Inference Performance Engineer, ML Systems & Optimization

toronto, on, Canada • Posted June 04, 2026

Job Type: Full-time
Location: toronto, on
Posted: June 04, 2026
Category: Engineering
Application Deadline: July 14, 2026

Role Description

A leading AI technology company in Toronto is seeking an experienced software engineer to join their inference model team. This role involves prototyping AI architectural tweaks, developing benchmarking automation, and collaborating closely with silicon teams. Candidates should have over 3 years of experience in high-performance software and a solid understanding of AI tools. Join the forefront of groundbreaking advancements in AI and enjoy a non-corporate culture with job stability.
#J-18808-Ljbffr

Interested in this role?

Click the button below to start your application for Inference Performance Engineer, ML Systems & Optimization at Cerebras Systems.

Apply Now