Machine Learning Engineer at Red Hat
toronto, on, Canada • Posted June 06, 2026
Job Type:
Full-time
Location:
toronto, on
Posted:
June 06, 2026
Category:
Engineering
Application Deadline:
July 16, 2026
Role Description
Drive innovations in AI with Red Hat as a Machine Learning Engineer focusing on open-source LLMs. Contribute to cutting-edge projects that enhance AI for enterprise applications.
As a key member of the AI Inference team, you will design and develop inference optimisation algorithms in projects like LLM-compressor and vLLM. Your role includes implementing model compression pipelines and maintaining decoding frameworks to boost inference speed. Work collaboratively with research scientists to transform ideas into robust systems while profiling LLM performance.
Key Responsibilities:
• Contribute to optimisation algorithms for AI projects
• Design and implement model compression techniques
• Develop frameworks to enhance inference accuracy
• Collaborate on translating research into production systems
• Benchmark LLM performance and optimise for hardware
Requirements:
• Strong knowledge of ML and deep learning fundam...
As a key member of the AI Inference team, you will design and develop inference optimisation algorithms in projects like LLM-compressor and vLLM. Your role includes implementing model compression pipelines and maintaining decoding frameworks to boost inference speed. Work collaboratively with research scientists to transform ideas into robust systems while profiling LLM performance.
Key Responsibilities:
• Contribute to optimisation algorithms for AI projects
• Design and implement model compression techniques
• Develop frameworks to enhance inference accuracy
• Collaborate on translating research into production systems
• Benchmark LLM performance and optimise for hardware
Requirements:
• Strong knowledge of ML and deep learning fundam...
Interested in this role?
Click the button below to start your application for Machine Learning Engineer at Red Hat at Red Hat.
Apply Now