Machine Learning Performance Engineer, Annapurna Labs

Tel Aviv, Israel, Israel • Posted May 27, 2026

Job Type: Full-time
Location: Tel Aviv, Israel
Posted: May 27, 2026
Category: other-general
Application Deadline: June 08, 2026

Role Description

Description
Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators — Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud.
We're building a new core group of engineers in TLV (Tel Aviv) to drive innovation in ML systems performance and software. As a Machine Learning Performance Engineer, you'll help shape the direction of the team from the ground up and work on:

Optimizing system performance across the entire ML software stack
Analyzing high-performance ML workloads running on Annapurna hardware
Developing high-performance kernels for critical ML operations
Enhancing the Neuron SDK to improve developer experience and system capabilities
Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance

As part of the Performance Engin...

Interested in this role?

Click the button below to start your application for Machine Learning Performance Engineer, Annapurna Labs at Amazon.

Apply Now