Opportunity Description
Job Description
Purpose of the role: To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.
Accountabilities
Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
Resolution, analysis and response to system outages and disruptions, and implementation of measures to prevent similar incidents from recurring.
Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency and improving system resilience.
Monitoring and optimisation of system performance and resource usage, identifying and addressing bottlenecks, and implementing best practices for performance tuning.
Collaboration with development teams to integrate best practices for reliability, scalabili...
Ready to Apply?
Submit your application for Site Reliability Engineer at Jobleads-UK
Apply for this Position