Inference Serving Engineer — Scalable AI Infra
toronto, on, Canada • Posted June 04, 2026
Job Type:
Full-time
Location:
toronto, on
Posted:
June 04, 2026
Category:
IT & Technology
Application Deadline:
July 14, 2026
Role Description
A technology firm specializing in AI is seeking a Software Engineer – Inference Serving. This entry-level role involves building software infrastructure for an inference serving cluster. Responsibilities include adapting open-source inference servers and implementing efficient solutions for AI models. Ideal candidates should have a relevant degree and familiarity with Python, ML, and low-level programming.
#J-18808-Ljbffr
#J-18808-Ljbffr
Interested in this role?
Click the button below to start your application for Inference Serving Engineer — Scalable AI Infra at Taalas.
Apply Now