Opportunity Description
We are seeking a Deep Learning Research Engineer to join our team and help develop the next generation of Large Language Model (LLM) inference algorithms. You will work on technologies that directly enhance NVIDIA's software, making the latest LLMs more efficient and accessible to users worldwide. This role is designed for someone with strong research foundations who also wants to build software that runs and scales into production systems across the world.
By joining us, you will be part of a strategic effort to establish NVIDIA as the definitive platform for high-performance LLM inference. The work requires a combination of research taste, experimental rigor, and engineering ownership: you will explore new ideas, run rigorous evaluations, and help transform successful approaches into tools and implementations.
What you'll be doing:
+ Develop and improve benchmarks, profiling workflows, and evaluation pipelines that make inference performance measurable and...
By joining us, you will be part of a strategic effort to establish NVIDIA as the definitive platform for high-performance LLM inference. The work requires a combination of research taste, experimental rigor, and engineering ownership: you will explore new ideas, run rigorous evaluations, and help transform successful approaches into tools and implementations.
What you'll be doing:
+ Develop and improve benchmarks, profiling workflows, and evaluation pipelines that make inference performance measurable and...
Ready to Apply?
Submit your application for Senior Deep Learning Research Engineer, LLM Inference at NVIDIA
Apply for this Position