Join the MODS (Managed Orchestration of Devices & Services) team within Amazon's AEX organization, where you will build and operate device management infrastructure at massive scale. You will design, develop, and maintain distributed systems that process 800M events daily, serve 700K requests/hour, and manage 1.5 million Amazon corporate endpoints across 40+ consumer teams worldwide.
Key job responsibilities
Design and implement scalable serverless architectures using AWS Lambda, DynamoDB, SQS, and SNS to support device telemetry, compliance evaluation, and fleet targeting
Own end-to-end delivery of features across 8 production services including KARL (device data warehouse), UEM Platform (greenfield), CMS (claims management), and ACME (device enrollment)
Build new greenfield initiatives for 2026 including ACME v4 (migrating 152K macOS devices to a unified .NET 8 codebase) and OneIT SDK internal components (modular plugin architecture powering the OneIT Desktop Application)
Participate in oncall rotation, triage production incidents, and drive root cause analysis with preventive action items
Collaborate with 40+ consumer teams (FinTech, HR, Infosec, Asset Management) to deliver reliable APIs and data pipelines
Contribute to the multi-year migration from legacy systems (MEMS+KARL) to a greenfield Unified Endpoint Management platform (UEM+CDS)
A day in the life
You will spend your day designing and shipping features for device management services that run across 4 AWS regions. You will work closely with cross-functional teams including AEA-Core (identity/auth), Client Engineering (deployments), and Helios (software management). A typical week includes sprint planning, code reviews, oncall shadowing, and customer deep-dives with downstream data consumers. You will solve problems at the intersection of device security, fleet orchestration, and large-scale data processing.
About the team
MODS is part of AEG, building self-driving device identity and fleet management without human operational overhead
Team tenets: Fail Fast, Fewer Goals Higher Impact, Assume Positive Intention, and Short-Term Be Reasonable Long-Term Be Right
We value collaboration, operational excellence, and building fault-tolerant systems that scale