Senior Software Engineer AI Inference at NVIDIA

toronto, on, Canada • Posted June 01, 2026

Job Type: Full-time
Location: toronto, on
Posted: June 01, 2026
Category: Other-General
Application Deadline: July 11, 2026

Role Description

Advance AI technology as a Senior Software Engineer at NVIDIA, focusing on building efficient AI inference systems. Utilize your skills in GPU optimization and multi-cloud deployment to impact large-scale models.
As a key part of NVIDIA, you'll design and implement high-performance inference stacks and collaborate across various teams. Your expertise will contribute to optimizing GPU kernels, developing benchmarking tools, and driving industry standards. This role requires someone skilled in programming languages like Python and C/C++, emphasizing performance engineering and distributed systems.
Key Responsibilities:
• Architect AI inference systems for large-scale deployment
• Optimize GPU kernels and compilers for performance
• Develop benchmarking methodologies for MLPerf Inference
• Build orchestration for containerized inference on GPU clusters
• Conduct research to integrate advanced ML concepts
Requirements:
• Bachelor’s in CS, CE, or SE with 7+ years ...

Interested in this role?

Click the button below to start your application for Senior Software Engineer AI Inference at NVIDIA at NVIDIA.

Apply Now