Site Reliability Engineering (SRE)
bel air, andalucía, Spain • Posted June 01, 2026
Job Type:
Full-time
Location:
bel air, andalucía
Posted:
June 01, 2026
Category:
Informática y tecnología
Application Deadline:
July 11, 2026
Role Description
Description
Position at Ant Group
Key Responsibilities
- Ensuring Payment System Stability and High Availability: Lead technical initiatives to strengthen reliability of our payment systems, designing and implementing monitoring tools, logging frameworks, dashboards, diagnostic utilities, and disaster recovery plans. Conduct routine drills, develop contingency strategies, and participate in on-call rotations to ensure rapid response and resolution of production issues across regions.
- Incident Handling and Emergency Response: Conduct routine drills, develop contingency strategies, and participate in on-call rotations to ensure rapid response and resolution of production issues.
- Analyze and Optimize Production Issues: Investigate and analyze real-world production cases, such as performance bottlenecks or system inefficiencies, to derive actionable insights and establish technical best practices. Contribute to the evolution of a hi...
Interested in this role?
Click the button below to start your application for Site Reliability Engineering (SRE) at WorldFirst.
Apply Now