Senior ML Inference Engineer — Model Efficiency
montreal, montreal (administrative region), Canada • Posted June 01, 2026
Job Type:
Full-time
Location:
montreal, montreal (administrative region)
Posted:
June 01, 2026
Category:
Other-General
Application Deadline:
July 11, 2026
Role Description
A leading AI technology company is seeking a Member of Technical Staff to enhance model efficiency. This role involves improving performance metrics, optimizing bottlenecks, and collaborating with various teams. The ideal candidate has 5+ years in high-performance coding, strong skills in C++ or Python, and familiarity with large language models. Competitive perks include a flexible work environment, health benefits, and generous vacation time.
#J-18808-Ljbffr
#J-18808-Ljbffr
Interested in this role?
Click the button below to start your application for Senior ML Inference Engineer — Model Efficiency at Cohere.
Apply Now