Senior Software Engineer, Machine Learning Inference

Santa Clara, CA, United States • Posted June 04, 2026

Job Type: Full-time
Location: Santa Clara, CA
Posted: June 04, 2026
Category: other-general
Application Deadline: June 11, 2026

Role Description

At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.


As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPUs. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you!


What you’ll be doing:
+ Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
+ Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative A...

Interested in this role?

Click the button below to start your application for Senior Software Engineer, Machine Learning Inference at NVIDIA.

Apply Now