Reinforcement Learning & Optimization Intern

hyderabad, telangana, India • Posted June 05, 2026

Job Type: Full-time
Location: hyderabad, telangana
Posted: June 05, 2026
Category: Technology, Information and Internet
Application Deadline: July 15, 2026

Role Description

Program structure

Track: Research engineering

Reports to: Staff research engineer, EOS Intelligence Plane team

Duration: 20–24 weeks, full-time preferred

Primary languages: Python (PyTorch or JAX), familiarity with Stable Baselines / CleanRL / TorchRL

Outcome: A trained, sim-validated routing policy that demonstrably improves utility- per-dollar over the production baseline

Compensation: stipend per internal scale; conversion to full-time considered for strong performers.

Mentorship: each intern is paired with a senior engineer or researcher who is the technical owner of the area.


How to apply: Send

• Resume / CV (PDF).

• A link to a GitHub profile, portfolio, or representative project.

• The role number(s) you are applying for. You can apply for up to two.

Interested in this role?

Click the button below to start your application for Reinforcement Learning & Optimization Intern at CloudNuro.

Apply Now