Opportunity Description
**Do you want to shape the future of AI infrastructure?**
**Ready to define the reliability architecture for AI products, from GPU compute to globally distributed inference, ensuring performance and reliability at scale.**
**Join the Akamai AI Team**
Akamai's Cloud Technology Group offers AI infrastructure globally. The GPU compute platform provides dedicated resources, from single GPUs to full clusters. These resources support training, simulation, inference, and various workloads. Site Reliability Engineering is integrated early to guarantee production-grade reliability and performance.
**Partner with the best**
As Senior Principal SRE for AI, this role involves setting technical direction for building, operating, and scaling AI services. Responsibilities include writing code, designing systems, and solving complex reliability issues. Additionally, mentoring team members, defining technical standards, and promoting engineering best practic...
**Ready to define the reliability architecture for AI products, from GPU compute to globally distributed inference, ensuring performance and reliability at scale.**
**Join the Akamai AI Team**
Akamai's Cloud Technology Group offers AI infrastructure globally. The GPU compute platform provides dedicated resources, from single GPUs to full clusters. These resources support training, simulation, inference, and various workloads. Site Reliability Engineering is integrated early to guarantee production-grade reliability and performance.
**Partner with the best**
As Senior Principal SRE for AI, this role involves setting technical direction for building, operating, and scaling AI services. Responsibilities include writing code, designing systems, and solving complex reliability issues. Additionally, mentoring team members, defining technical standards, and promoting engineering best practic...
Ready to Apply?
Submit your application for Senior Principal Site Reliability Engineer at Akamai Technologies, Inc.
Apply for this Position