Our team leads the development and optimization of on-device ML models for Amazon's hardware products, including audio, vision, and multi-modal AI features. We work at the critical intersection of ML innovation and silicon design, ensuring AI capabilities can run efficiently on resource-constrained devices.
Currently, we enable production ML models across multiple device families, including Echo, Ring/Blink, and other consumer devices. Our work directly impacts Amazon's customer experiences in consumer AI device market. The solutions we develop determine which AI features can be offered on-device versus requiring cloud connectivity, ultimately shaping product capabilities and customer experience across Amazon's hardware portfolio.
This is a unique opportunity to shape the future of AI in consumer devices at unprecedented scale. You'll be at the forefront of developing industry-first model architectures and compression techniques that will power AI features across millions of Amazon devices worldwide. Your innovations will directly enable new AI features that enhance how customers interact with Amazon products every day. Come join our team!
Key job responsibilities
As a Principal Applied Scientist, you will:
• Own the technical architecture and optimization strategy for ML models deployed across Amazon's device ecosystem, from existing to yet-to-be-shipped products.
• Develop novel model architectures optimized for our custom silicon, establishing new methodologies for model compression and quantization.
• Create an evaluation framework for model efficiency and implement multimodal optimization techniques that work across vision, language, and audio tasks.
• Define technical standards for model deployment and drive research initiatives in model efficiency to guide future silicon designs.
• Spend the majority of your time doing deep technical work - developing novel ML architectures, writing critical optimization code, and creating proof-of-concept implementations that demonstrate breakthrough efficiency gains.
• Influence architecture decisions impacting future silicon generations, establish standards for model optimization, and mentor others in advanced ML techniques.