Senior Software Engineer, AI Inference
toronto, on, Canada • Posted June 01, 2026
Job Type:
Full-time
Location:
toronto, on
Posted:
June 01, 2026
Category:
IT & Technology
Application Deadline:
July 11, 2026
Role Description
Senior Software Engineer, AI Inference page is loaded## Senior Software Engineer, AI Inferencelocations:
Canada, Torontotime type:
Full timeposted on:
Posted 2 Days Agojob requisition id:
JR Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it!**What You'll be doing:*** Work directly with customer engineering teams through long-term technical partnerships, understanding their LLM serving architectures and performance goals, then designing and implementing end-to-end benchmarking campaigns across Kubernetes and Slurm environments to surface actionable insights.* Set up and operate vLLM serving deployments on GPU clusters, tuning configurations for throughput, latency, and efficiency — and collect Nsight Systems / Nsight Compute profiling traces to identify performance gaps relative to reference frameworks.* Develop detailed performance plans based on profili...
Canada, Torontotime type:
Full timeposted on:
Posted 2 Days Agojob requisition id:
JR Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it!**What You'll be doing:*** Work directly with customer engineering teams through long-term technical partnerships, understanding their LLM serving architectures and performance goals, then designing and implementing end-to-end benchmarking campaigns across Kubernetes and Slurm environments to surface actionable insights.* Set up and operate vLLM serving deployments on GPU clusters, tuning configurations for throughput, latency, and efficiency — and collect Nsight Systems / Nsight Compute profiling traces to identify performance gaps relative to reference frameworks.* Develop detailed performance plans based on profili...
Interested in this role?
Click the button below to start your application for Senior Software Engineer, AI Inference at NVIDIA Corporation.
Apply Now