Opportunity Description
**Overview**
Join Microsoft’s AI Core team building high performance runtime systems that serve OpenAI chat and multimodal AI models at scale. This role focuses on systems level optimization for large scale LLM inferencing with deep C++ expertise.
**Responsibilities**
Join Microsoft’s AI Core team building high performance runtime systems that power OpenAI chat and multimodal AI models at scale. This role focuses on systems level optimization for largescale LLM inferencing with deep C++ expertise.
+ Design and implement high performance microservices and runtime components in C++.
+ Optimize AI inferencing systems for latency, throughput, cost, and reliability at large scale.
+ Debug and resolve complex production issues related to performance, scaling, and service reliability.
+ Collaborate with cross-functional partners to integrate model inference pipelines into scalable infrastructure.
+ Contribute to state-of-...
Join Microsoft’s AI Core team building high performance runtime systems that serve OpenAI chat and multimodal AI models at scale. This role focuses on systems level optimization for large scale LLM inferencing with deep C++ expertise.
**Responsibilities**
Join Microsoft’s AI Core team building high performance runtime systems that power OpenAI chat and multimodal AI models at scale. This role focuses on systems level optimization for largescale LLM inferencing with deep C++ expertise.
+ Design and implement high performance microservices and runtime components in C++.
+ Optimize AI inferencing systems for latency, throughput, cost, and reliability at large scale.
+ Debug and resolve complex production issues related to performance, scaling, and service reliability.
+ Collaborate with cross-functional partners to integrate model inference pipelines into scalable infrastructure.
+ Contribute to state-of-...
Ready to Apply?
Submit your application for Principal Software Engineer, CoreAI at Microsoft Corporation
Apply for this Position