Platform Site Reliability Engineer

Location: Abu Dhabi
Experience: 7+ Years

Role Overview:

This role combines Site Reliability Engineering and automated testing to ensure AI platforms are resilient, scalable, and production-ready.

Key Responsibilities:

  • Build automated validation frameworks.
  • Implement chaos, performance, and resilience testing.
  • Define and monitor SLIs/SLOs.
  • Manage observability and alerting systems.
  • Ensure compliance and production readiness.

Required Skills:

  • SRE
  • Python
  • Azure / AWS
  • Automated Testing
  • Observability
  • Reliability Engineering

📩 Apply Now: [email protected]

Job Application