Senior AI Inference Engineer 100% Remote
multan, punjab, Pakistan • Posted May 29, 2026
Job Type:
Full-time
Location:
multan, punjab
Posted:
May 29, 2026
Category:
IT & Technology
Application Deadline:
July 08, 2026
Role Description
Responsibilities
- Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.
- Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
- Integrate AI features into existing products, enriching them with the latest advancements in machine learning.
Qualifications
- Excellent programming skills in C++; experience in Javascript is a bonus.
- Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures.
- Good understanding of deep learning concepts and model architectures.
- Experience with transformers, LLMs, Diffusion models.
- Demonstrated ability to rapidly assimilate new technologies and techniques.
- A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track r...
Interested in this role?
Click the button below to start your application for Senior AI Inference Engineer 100% Remote at Framework Ventures.
Apply Now