M

Principal Software Engineer, CoreAI Workload Engines

Microsoft Corporation

Redmond, WA, United States Full-time June 03, 2026
Apply Now

Opportunity Description

**Overview**

The CoreAI Workloads team builds the foundational inference engines and APIs that power largescale AI inference across Azure - from cutting-edge startups to Fortune 500 enterprises and Microsoft Copilots and agents. Our mission is to deliver secure, reliable, and highly efficient GPU inference that enable multitenant AI systems at global scale while maximizing utilization, performance, and developer productivity. We own inference serving and performance of OpenAI and other state of the art large language model (LLM) models and work directly with OpenAI serving some of the largest workloads on the planet with trillions of inferences per day. Our converged AI fabric and engines deliver inference capabilities for all LLMs in Microsoft catalog (https://azure.microsoft.com/en-us/products/ai-model-catalog#Models) , including OpenAI, Anthropic, Mistral, Cohere, Llama, and more.

This role sits at the intersection of LLM inference fleets, serving efficiency, rapid...
Full-time other-general

Ready to Apply?

Submit your application for Principal Software Engineer, CoreAI Workload Engines at Microsoft Corporation

Apply for this Position