AWS Neuron ML Kernel Engineer

toronto, on, Canada • Posted June 04, 2026

Job Type: Full-time
Location: toronto, on
Posted: June 04, 2026
Category: Other-General
Application Deadline: July 14, 2026

Role Description

Join the Annapurna Labs team at Amazon Web Services as an ML Kernel Performance Engineer focused on optimizing deep learning performance with AWS Neuron on custom hardware. Shape the future of AI acceleration technology while working on critical machine learning projects.
In this engineering role, you'll contribute to enhancing the performance of AWS's ML accelerators, Inferentia and Trainium. Your expertise in designing high-performance compute kernels and optimizing kernel-level performance will be essential. Collaborate with cross-functional teams and directly interface with customers to maximize their ML models’ efficiency on AWS.
Key Responsibilities:
• Design and implement high-performance compute kernels for ML operations
• Analyze and optimize kernel performance on Neuron hardware
• Conduct detailed performance analysis using profiling tools
• Implement compiler optimizations like tiling and scheduling
• Collaborate with customers to enhance their ML models...

Interested in this role?

Click the button below to start your application for AWS Neuron ML Kernel Engineer at Amazon Development Centre Canada ULC.

Apply Now