Evaluation Scenario Writer - AI Agent Testing Specialist

madrid, comunidad de madrid, Spain • Posted May 31, 2026

Job Type: Full-time
Location: madrid, comunidad de madrid
Posted: May 31, 2026
Category: Informática y tecnología
Application Deadline: July 10, 2026

Role Description

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.

Participation isproject-based, not permanent employment.

What This Opportunity Involves

You’ll create challenging coding test cases that push AI coding systems to their limits:

  • Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
  • Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
  • Craft fair but hard challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
  • Analyze AI failures to understand what the mod...

Interested in this role?

Click the button below to start your application for Evaluation Scenario Writer - AI Agent Testing Specialist at Mindrift.

Apply Now