Inference Serving Engineer — Scalable AI Infra

toronto, on, Canada • Posted May 28, 2026

Job Type: Full-time
Location: toronto, on
Posted: May 28, 2026
Category: Other-General
Application Deadline: July 07, 2026

Role Description

A technology firm specializing in AI is seeking a Software Engineer – Inference Serving. This entry-level role involves building software infrastructure for an inference serving cluster. Responsibilities include adapting open-source inference servers and implementing efficient solutions for AI models. Ideal candidates should have a relevant degree and familiarity with Python, ML, and low-level programming.
#J-18808-Ljbffr

Interested in this role?

Click the button below to start your application for Inference Serving Engineer — Scalable AI Infra at Taalas.

Apply Now