Senior AI Research Engineer Model Inference Remote

multan, punjab, Pakistan • Posted June 02, 2026

Job Type: Full-time
Location: multan, punjab
Posted: June 02, 2026
Category: Engineering
Application Deadline: July 12, 2026

Role Description

About the job

We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine‑tuning for language models with a strong focus on mobile and integrated GPU acceleration using Vulkan.

Responsibilities

  • Implement and optimize custom inference and fine‑tuning kernels for small and large language models across multiple hardware backends.
  • Implement and optimize full and LoRA fine‑tuning for small and large language models across multiple hardware backends.
  • Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
  • Design, customize, and optimize Vulkan compute shaders for quantized operators and fine‑tuning workflows.
  • Investigate and resolve GPU acceleration issues on Vulkan and integrated/mobile GPUs.

Interested in this role?

Click the button below to start your application for Senior AI Research Engineer Model Inference Remote at Framework Ventures.

Apply Now