AI SW Stack Deployment Architect
Bengaluru, Karnataka, India • Posted May 28, 2026
Job Type:
full-time
Location:
Bengaluru, Karnataka
Posted:
May 28, 2026
Category:
Computer Occupations
Application Deadline:
July 07, 2026
Role Description
Job Description
Role Overview
We are looking for a Software Architect (12+ years experience) to lead the application/framework layer and deployment stack for the Next Generation Accelerator AI platform. This role owns how models run on Next Generation Accelerator—from vLLM / PyTorch / TensotFlow/XLA to production deployment—ensuring correctness, performance, and scalability.
Key Responsibilities
- Architect integration of vLLM, PyTorch, and TensorFlow, JAX/XLA into Next Generation Accelerator stack
- Define framework → compiler → runtime APIs and contracts
- Own LLM execution behavior (batching, KV cache, streaming inference)
- Design and implement end-to-end deployment workflows (packaging, versioning, reproducibility)
- Drive performance optimization across mod...
Interested in this role?
Click the button below to start your application for AI SW Stack Deployment Architect at Sandisk.
Apply Now