Innovative Site Reliability Engineer for Cloud and AI Solutions

toronto, on, Canada • Posted June 06, 2026

Job Type: Full-time
Location: toronto, on
Posted: June 06, 2026
Category: Engineering
Application Deadline: July 16, 2026

Role Description

Lead the charge in site reliability engineering focusing on cloud systems and AI-driven observability. Leverage your strong Python scripting and experience with tools like PagerDuty and Moogsoft.

In this position, you will utilize your strong understanding of SRE principles and distributed systems. Your role will involve working with Ansible and Git for automation, while also exploring Kubernetes and Docker environments. Engaging with generative AI and event-driven architectures will enhance operational efficiency.

Key Responsibilities:
• Implement and manage observability with Splunk and Dynatrace
• Develop automation scripts using Python and Ansible
• Collaborate within distributed cloud systems
• Utilize Git and GitHub Actions for version control
• Engage with container solutions like Kubernetes

Requirements:
• Excellent scripting skills in Python
• Experience with AI/ML observability platforms
• ...

Interested in this role?

Click the button below to start your application for Innovative Site Reliability Engineer for Cloud and AI Solutions at Themesoft Inc..

Apply Now