AI Engineer for LLM Ops & Evaluation (m/f/d)

Munich, Bavaria, Germany • Posted June 19, 2026

Job Type: CDI
Location: Munich, Bavaria
Posted: June 19, 2026
Category: Computer Occupations
Application Deadline: July 29, 2026

Role Description

You'll join an early-stage, AI-native startup with a product that has already proven market fit. We build cutting-edge AI solutions for Governance, Risk and Compliance (GRC) for enterprises around the world.


Our customers are auditors, risk managers, and compliance teams, which means evaluation rigor, auditability, and EU AI Act readiness aren't afterthoughts for us. They're product requirements.


Tasks


As our AI Engineer for LLMOps & Evaluation, you'll own the LLMOps pipeline end-to-end and work directly alongside our founding team.


You will:



  • Own the LLMOps pipeline: Evaluate infrastructure, prompt optimization loop, and the production integration that turns experiments into reliable customer-facing features

  • Design evaluation strategy per output type: Decide when to use deterministic evals (exact match, schema validation, embeddings) vs. LLM-as-judge, and build the rubrics, test datasets, and human-review ...

Interested in this role?

Click the button below to start your application for AI Engineer for LLM Ops & Evaluation (m/f/d) at Auxilius.ai.

Apply Now