Senior Software Engineer, AI Inference

Toronto, Canada, Canada • Posted June 01, 2026

Job Type: Full-time
Location: Toronto, Canada
Posted: June 01, 2026
Category: other-general
Application Deadline: June 05, 2026

Role Description

Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it!


We're looking for a Senior Software Engineer to work at the frontier of large-scale LLM serving, partnering directly with some of the world's most technically demanding customers to unlock the full performance potential of NVIDIA's inference stack. In this role, you'll combine deep systems knowledge with hands-on customer engagement — profiling real deployments, benchmarking across GPU clusters, and turning insights into improvements that ripple across the open-source ecosystem. Do you love digging into performance problems that don't have obvious answers, and want your work to have an impact far beyond a single codebase? We'd love to talk. Unlike traditional customer-facing engineering roles, we expect you to go far deeper — contributing to vLLM, NVIDIA Dynamo, and the tooling that makes every engineer on your team more eff...

Interested in this role?

Click the button below to start your application for Senior Software Engineer, AI Inference at NVIDIA.

Apply Now