Deep Learning Performance Architect, CUTLASS DSL Testing
Shanghai, China, China • Posted June 03, 2026
Job Type:
Full-time
Location:
Shanghai, China
Posted:
June 03, 2026
Category:
other-general
Application Deadline:
June 07, 2026
Role Description
Are you excited about building world-class quality systems for advanced GPU software? Do you enjoy combining automation, product validation, and cod e analysis to support fast-moving compiler and kernel innovation? We are seeking a strong test engineer to develop the NVIDIA CUTLASS DSL testing framework, shape product test strategy, and ensure end-to-end code quality across the MLIR-based compilation pipeline. In this role, you will drive automated testing, and regression detection to make sure every code change is validated for correctness, and the product is ready for shipping at any time.
What you'll be doing:
+ Develop and evolve the NVIDIA CUTLASS DSL testing framework for next-generation GPU software
+ Define, refine, and execute robust product test strategies for shipping to the open-source community
+ Ensure end-to-end code quality across the MLIR-based compilation pipeline and related functional coverage infrastructure
+ B...
What you'll be doing:
+ Develop and evolve the NVIDIA CUTLASS DSL testing framework for next-generation GPU software
+ Define, refine, and execute robust product test strategies for shipping to the open-source community
+ Ensure end-to-end code quality across the MLIR-based compilation pipeline and related functional coverage infrastructure
+ B...
Interested in this role?
Click the button below to start your application for Deep Learning Performance Architect, CUTLASS DSL Testing at NVIDIA.
Apply Now