Opportunity Description
We are seeking an experienced Lead Site Reliability Engineer to spearhead our infrastructure reliability initiatives and guide a team of talented engineers. In this role, you will shape technical strategy, mentor team members and drive operational excellence across our cloud-based platforms and distributed services.
Responsibilities
- Lead the design and evolution of resilient, scalable infrastructure across multiple cloud providers
- Mentor and guide a team of engineers, fostering technical growth and best practices
- Define reliability standards, SLOs and operational policies for production environments
- Architect automation frameworks to streamline deployments and infrastructure management
- Oversee CI/CD strategy and ensure efficient software delivery workflows
- Coordinate incident response efforts and lead post-mortem analyses to prevent recurrence
- Partner with engineering leadership to align r...
Ready to Apply?
Submit your application for Lead Site Reliability Engineer at EPAM Systems
Apply for this Position