Fullstack AI Engineer
Lahore, Pakistan, Pakistan • Posted May 24, 2026
Job Type:
Full-time
Location:
Lahore, Pakistan
Posted:
May 24, 2026
Category:
other-general
Application Deadline:
July 03, 2026
Role Description
Requirements:
- 4 years of experience as a fullstack or backend engineer
- Strong proficiency in Python and JavaScript/TypeScript
- Experience with FastAPI / Django / Node.js and React / Next.js
- Solid understanding of distributed systems and async architectures
- Hands-on experience deploying LLMs such as GPT-4/4.1, Claude, LLaMA, Mistral, Mixtral
- Experience serving models using vLLM, Triton, TGI, or similar frameworks
- Strong understanding of transformer models and inference trade-offs
- Experience with embeddings, vector search, and RAG architectures
- Experience with AWS, GCP, or Azure (GPU workloads preferred)
- Strong Docker and Kubernetes experience
- Familiarity with CI/CD pipelines for ML systems
- Experience with observability tools (Prometheus, Grafana, OpenTelemetry)
- Experience with multimodal AI (audio, video, image models)
- Experience optimizing LLM inference...
Interested in this role?
Click the button below to start your application for Fullstack AI Engineer at HR POD - Hiring Talent Globally.
Apply Now