Looking for an exciting challenge to accelerate your career? Are you customer obsessed? Imagine being part of a team that predicts inventory flows for Amazon’s worldwide supply chain. Amazon’s worldwide inventory planning involves many algorithms to buy inventory in the right quantities, at the right frequencies, from the right vendors, and assigning to the best warehouse to fulfill customer demand forecast. Our massive simulation system lives at the heart of these algorithms, keeping up with the rapid pace of optimization improvements and simulating how they interact with each other. We simulate what these systems will do for months into the future, predicting inventory flows across the network for both labor planning and AB test experimentation, end-to-end from vendors to customers.
The Amazon Supply Chain Optimization Technology (SCOT) organization is looking for a highly motivated Data Engineer (DE). SCOT is a unique opportunity to both create and see the direct impact of your work on billions of dollars worth of inventory, in one of the world’s most advanced supply chains, and at massive scale.
As an DE in SCOT Simulation team, you will work closely with some of the brightest software engineers, scientists, economists, and product managers, to solve highly complex supply chain challenges. You will work on one of the largest distributed systems in the world in order to support new use cases and design its future state, you will think of how we can leverage AWS technologies (specifically technologies such as ECS and EMR), that will allow our next generation system to run elastically at a much larger scale and on demand, as well as define new frameworks (data APIs, plug and play technology) that will allow supply chain teams to integrate their components in simulation in a low touch manner. You will be responsible for building and maintaining data pipelines, warehouses, and lakes across our Simulation tool set, successfully delivering reliable data infrastructure and meeting program goals. You will design, implement, and optimize ETL processes, data models, and analytical solutions for Amazon Supply Chain, using SQL, PySpark, Amazon Big Data Technologies (BDT), AWS services (such as Redshift, EMR, Glue), and data visualization tools.
Finally, here are some more reasons you might find this opportunity exciting:
- Being part of a team that is at the heart of the most sophisticated supply chain and decision making system on the planet
- Working at the intersection of science and engineering to simulate and predict future supply chain flows in order to ultimately impact Amazon user experience (by helping supply chain teams run experiments that allow for speed and same day availability for example), saving hundreds of millions of dollars along the way
- Closely collaborating with some of the brightest software engineers, scientists, and economists, in order to solve complex supply chain problems and implement systems to tackle these challenges at scale (billions of simulation events per day), and an evolving need for faster and more scalable applications
- Being part of a team that values creativity and welcomes outside of the box thinking (thinking big!)
- Being part of a team with a high emphasis on a positive work culture of collaboration
- Being part of a fast growing team with many growth opportunities
Key job responsibilities
- Work closely with data scientists, business intelligence engineers, product managers, and software engineers to create robust data architectures and pipelines
- Take ownership of the design, creation, and maintenance of business critical metrics, tables, and jobs
- Optimize data and queries to improve simulation performance runtime and cost structure
- Develop and manage scalable, automated, and fault-tolerant data solutions using technologies such as Cradle, EMR, Redshift, Glue, Andes, and S3
- Lead/orchestrate org-wide data migration and data privacy campaigns