AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting data center operations — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Key job responsibilities
Own the 3-5 year technology roadmap for DCC Availability Tech, aligned with AWS Boost platform evolution and DCEO operational strategy
Define technology standards and architectural principles for availability-focused systems (CBM, Vector, RCA+, Availability Intelligence HUB)
Drive technology portfolio rationalization, consolidating legacy systems
Manage multi-million dollar budgets and multi-year timelines with 5+ cross-functional teams
Establish success criteria, governance structures, and standardized methodologies for program execution
Drive consensus among leaders on technology priorities through data-driven recommendations
Establish and chair key forums: Availability Tech Architecture Review Board, Boost Integration Steering Committee, Automation Innovation Working Group
Present technology strategies to VP/SVP audiences, translating technical details into business impact
About the team
The Availability team develops, implements, and maintains comprehensive programs that ensure maximum availability and reliability of AWS data center infrastructure through systematic validation, testing, and continuous improvement initiatives.