Freelance Agent Evaluation Engineer
Australia, Queensland, Australia • Posted May 31, 2026
Job Type:
Part time
Location:
Australia, Queensland
Posted:
May 31, 2026
Category:
Computer Occupations
Application Deadline:
July 10, 2026
Role Description
Please submit your CV in English and indicate your level of English proficiency.
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.
What this opportunity involves
We're building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks.
You'll create challenging tasks and evaluation criteria within realistic simulated environments:
- Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history
- Design tasks from intermediate states of these environments - craft the prompt, define what solved means, and ensure the task is solvable by an AI agent
- Write tests t...
Interested in this role?
Click the button below to start your application for Freelance Agent Evaluation Engineer at Mindrift.
Apply Now