At Frontier AI & Robotics, we're not just advancing robotics – we're reimagining it from the ground up. Our team is building the future of intelligent robotics through frontier foundation models and end-to-end learned systems. We tackle some of the most challenging problems in AI and robotics, from developing sophisticated perception systems to creating adaptive manipulation strategies that work in complex, real-world scenarios.
What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich real-world datasets to train and deploy state-of-the-art foundation models. Our work spans the full spectrum of robotics intelligence – from multimodal perception using images, videos, and sensor data, to sophisticated manipulation strategies that can handle diverse real-world scenarios. We're building systems that don't just work in the lab, but scale to meet the demands of Amazon's global operations.
Join us if you're excited about pushing the boundaries of what's possible in robotics, working with world-class researchers, and seeing your innovations deployed at unprecedented scale.
Key job responsibilities
In this role you will build and maintain the data infrastructure that powers our robotics manipulation research. You'll work alongside our existing team of platform engineers to extend the systems that turn raw robot session data into curated, trainable episodes. This team owns streaming ingestion pipelines, platform and schema design, heterogeneous data sources, data curation and quality controls, full-stack inspection and dataset-builders that researchers and human annotators actually use, and tools to let scientists go from dataset to training job without leaving the platform. We run on a modern cloud-native stack — distributed compute on Kubernetes, streaming data infrastructure, columnar lakehouse storage, and a TypeScript/React frontend. We’re looking for engineers willing and eager to work on the full stack in a fast iteration cycle while working with researchers as close customers.
What matters is that you can ship full-stack data infrastructure real users depend on, treat researchers as collaborators rather than customers, and have a strong bias toward iteration in a flat org where engineers pick up science-driven work directly instead of waiting for approval layers.