Senior AI Inference Engineer 100% Remote

multan, punjab, Pakistan • Posted May 29, 2026

Job Type: Full-time
Location: multan, punjab
Posted: May 29, 2026
Category: IT & Technology
Application Deadline: July 08, 2026

Role Description

Responsibilities

  • Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.
  • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.

Qualifications

  • Excellent programming skills in C++; experience in Javascript is a bonus.
  • Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers, LLMs, Diffusion models.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track r...

Interested in this role?

Click the button below to start your application for Senior AI Inference Engineer 100% Remote at Framework Ventures.

Apply Now