The Systems Development Engineer (SysDev) within Data Center Infrastructure Engineering (DCIE) is responsible for developing automation, monitoring, and analytics solutions to support Amazon’s global IT infrastructure within Fulfillment Centers (FCs). This role focuses on building scalable software systems, infrastructure automation, and data-driven insights to enhance the reliability, efficiency, and performance of Amazon’s power, cooling, and structured cabling infrastructure.
SysDevs collaborate closely with DCIE Engineers (electrical, mechanical, telecom, and general), TPMs, and DCIO Engineers to develop custom tools, dashboards, APIs, and automation frameworks that improve real-time monitoring, predictive maintenance, and operational scalability.
Key job responsibilities
Infrastructure System Development & Automation
- Develop custom automation tools to manage and optimize power, HVAC, and structured cabling infrastructure across Amazon FCs.
- Build scalable APIs, microservices, and integration solutions to streamline infrastructure monitoring and control.
- Create automated deployment frameworks for configuring, provisioning, and managing critical infrastructure components.
- Collaborate with TPMs and DCIE Engineers to enhance infrastructure standardization through code-driven deployments.
Monitoring & Data Analytics
- Design and implement real-time dashboards for power monitoring, HVAC performance, and network infrastructure health.
- Develop data analytics pipelines to process and analyze infrastructure telemetry, supporting predictive maintenance and anomaly detection.
- Work with DCIM teams to integrate asset tracking, power consumption metrics, and infrastructure lifecycle insights.
- Use machine learning and AI-driven analytics to identify trends, prevent failures, and optimize resource allocation.
Operational Support & Incident Response
- Automate incident detection, alerting, and response workflows to minimize downtime and improve infrastructure reliability.
- Support Sev1/Sev2 incident response, developing automated troubleshooting and remediation tools for infrastructure failures.
- Work with DCIO Engineers to enhance on-call operations through software-defined automation.
- Participate in post-incident reviews (PIRs), implementing software-driven solutions to prevent recurrence.
Infrastructure Integration & Standardization
- Develop and maintain OpenDCIM-based systems to track power, cooling, and network infrastructure assets.
- Create tools to enforce compliance with Amazon’s infrastructure standards, reducing manual audits and deployment errors.
- Support integration of Fault-Managed Power (Project Constellation), Fiber Media Conversion (Project Opti-Bridge), and Split CT Power Monitoring solutions into automated workflows.
Collaboration & Process Improvement
- Partner with TPMs to define software solutions for infrastructure lifecycle management, remediation projects, and scalability initiatives.
- Work with DCIE Engineers to implement self-healing infrastructure capabilities through automation and AI-driven insights.
- Continuously improve developer operations (DevOps) and infrastructure automation practices within DCIE.
A day in the life
As an SysDev in DCIE, you’ll begin your day reviewing infrastructure telemetry dashboards and overnight logs to identify anomalies or performance degradations across Amazon Fulfillment Centers. You’ll then sync with Technical Program Managers (TPMs) and DCIE Engineers to prioritize active workstreams, whether it’s scaling power monitoring pipelines, deploying automation for HVAC lifecycle tracking, or integrating new telemetry sources into the DCIM platform. Midday, you’ll be coding—whether building out RESTful APIs, refining predictive maintenance models, or developing automation scripts for infrastructure deployment. You’ll participate in design reviews, contribute to PIRs, and resolve automation or telemetry-related blockers raised by on-call engineers. Your day ends with sprint planning or roadmap check-ins, driving progress on scalable, self-healing infrastructure systems that improve uptime, efficiency, and global visibility for thousands of Amazon sites.
Amazon offers a full range of benefits that support you and eligible family members, including domestic partners and their children. Benefits can vary by location, the number of regularly scheduled hours you work, length of employment, and job status such as seasonal or temporary employment.
The benefits that generally apply to regular, full-time employees include:
- Medical, Dental, and Vision Coverage
- Maternity and Parental Leave Options
- Paid Time Off (PTO)
- 401(k) Plan
If you are not sure that every qualification on the list above describes you exactly, we'd still love to hear from you!
At Amazon, we value people with unique backgrounds, experiences, and skillsets. If you’re passionate about this role and want to make an impact on a global scale, please apply!
About the team
The Data Center Infrastructure Engineering (DCIE) team within Ops Technology Infrastructure Engineering (OTIE) designs, standardizes, and sustains scalable, cost-effective, and resilient IT infrastructure for Amazon Fulfillment and Logistics Operations worldwide.
We enable Operations Technology Solutions (OTS) by delivering high-performance power, cooling, structured cabling, edge compute, and automation solutions that ensure reliable and efficient on-premises hardware operations.
Our work spans Demarcation Rooms, MDFs, IDFs, power systems (UPSs, ATSs, PDUs), fault-managed power, cooling and containment, Computers on Wheels (COWs), telecommunications, and distributed edge compute infrastructure to enhance data processing and reduce latency.
Through automation, predictive analytics, and proactive maintenance, DCIE drives operational excellence, minimizes downtime, and scales infrastructure to support Amazon’s rapid growth while aligning with its efficiency, reliability & safety, sustainability, and scalability objectives.