Senior ML Inference Engineer — Model Efficiency

montreal, montreal (administrative region), Canada • Posted June 01, 2026

Job Type: Full-time
Location: montreal, montreal (administrative region)
Posted: June 01, 2026
Category: Other-General
Application Deadline: July 11, 2026

Role Description

A leading AI technology company is seeking a Member of Technical Staff to enhance model efficiency. This role involves improving performance metrics, optimizing bottlenecks, and collaborating with various teams. The ideal candidate has 5+ years in high-performance coding, strong skills in C++ or Python, and familiarity with large language models. Competitive perks include a flexible work environment, health benefits, and generous vacation time.
#J-18808-Ljbffr

Interested in this role?

Click the button below to start your application for Senior ML Inference Engineer — Model Efficiency at Cohere.

Apply Now