Opportunity Description
We are now looking for a Senior Performance Architect for Nemotron! At NVIDIA, we are redefining the future of AI systems through deep model–system–hardware co-design. We are looking for a forward-thinking Nemotron Performance Architect to shape the next generation of Nemotron models through performance modeling, analysis, and forward projections. In this role, you will predict before we build - developing high-fidelity models to evaluate how architectural choices translate into real-world deployment efficiency. You will ensure that future models achieve Pareto-optimal trade-offs across accuracy, throughput, and interactivity on target platforms.
Recent efforts such as LatentMoE (https://research.nvidia.com/labs/nemotron/LatentMoE/) architectures and the Nemotron Super (https://developer.nvidia.com/blog/introducing-nemotron-3-super-an-open-hybrid-mamba-transformer-moe-for-agentic-reasoning/) model exemplify the kind of performance-driven co-design you will help advance...
Recent efforts such as LatentMoE (https://research.nvidia.com/labs/nemotron/LatentMoE/) architectures and the Nemotron Super (https://developer.nvidia.com/blog/introducing-nemotron-3-super-an-open-hybrid-mamba-transformer-moe-for-agentic-reasoning/) model exemplify the kind of performance-driven co-design you will help advance...
Ready to Apply?
Submit your application for Senior Performance Architect, Nemotron at NVIDIA
Apply for this Position