The cloud is not an abstract concept; it is a massive, heavily engineered physical environment. As organizations scale their AI initiatives, hardware reliability becomes the ultimate bottleneck. When training Large Language Models (LLMs) on clusters exceeding 20,000 GPUs, a single hardware failure or network latency issue can add weeks to timelines and millions to operational costs.
As a Data Center Technician at AWS, I focus specifically on this machine learning infrastructure. I manage the physical layer of these systems, ensuring the high availability and seamless operations required to power global AI innovation.
Hardware Maintenance & Diagnostics
My primary responsibility is performing in-depth hardware diagnostics and repairs across complex server architectures and Linux environments.
- Root-Cause Analysis: Identifying the "why" behind failures to prevent recurrence.
- SLA Management: Executing rapid physical repairs and proactive maintenance to meet strict Service Level Agreements.
- System Integrity: Ensuring every node in a massive cluster is performing at its peak capacity.
High-Density Network Deployment
A massive GPU cluster is only as fast as the fabric connecting it. I am cross-trained in network deployment, physically racking, stacking, and cabling the infrastructure that supports our latest AI network fabrics. I help deploy high-density cabling and network links designed to deliver tens of petabits of bandwidth with sub-10 microsecond latency.
Professional Background & Drive
My career is rooted in high-stakes, mission-critical environments, beginning with my service as a U.S. Air Force veteran. Maintaining Fighter Aircraft Weapons systems instilled a disciplined, procedure-driven approach to troubleshooting complex hardware where there is zero margin for error.
Following my military service, I spent over 15 years at the Department of Veterans Affairs, most recently serving as a Senior Management and Program Analyst. In this role, I directed the lifecycle of a $32 million contract and oversaw eight IT service contracts supporting 16 different products. While I found success in management—including automating data workflows with Python and Power BI to reduce manual reporting—my professional drive has always been centered on the "hands-on" technical layer.
I transitioned to my current role as a Data Center Technician at Amazon Web Services (AWS) to return to the technical troubleshooting and systems maintenance that defined my early career. This move allows me to pair my lifelong interest in computer hardware and custom PC building with the operational excellence required to sustain global ML/AI infrastructure.
Today, I leverage a unique blend of Juris Doctor-level analytical thinking, program management oversight, and tactical hardware expertise to ensure the reliability of the physical cloud. When I am not in the data center, I stay sharp by building custom liquid-cooled high-performance PCs, tackling home improvement projects, or spending time outdoors hiking and fishing.