Senior ML Inference Engineer — Model Efficiency

montreal, montreal (administrative region), Canada • Posted June 01, 2026

Job Type: Full-time

Location: montreal, montreal (administrative region)

Posted: June 01, 2026

Category: Other-General

Application Deadline: July 11, 2026

Role Description

                    A leading AI technology company is seeking a Member of Technical Staff to enhance model efficiency. This role involves improving performance metrics, optimizing bottlenecks, and collaborating with various teams. The ideal candidate has 5+ years in high-performance coding, strong skills in C++ or Python, and familiarity with large language models. Competitive perks include a flexible work environment, health benefits, and generous vacation time.
#J-18808-Ljbffr
                

Interested in this role?

Click the button below to start your application for Senior ML Inference Engineer — Model Efficiency at Cohere.

Apply Now