N

Senior Software Engineer, Machine Learning Inference

NVIDIA

Santa Clara, CA, United States Full-time May 30, 2026
Apply Now

Opportunity Description

At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.


As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPUs. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you!


What you’ll be doing:
+ Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
+ Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative A...
Full-time other-general

Ready to Apply?

Submit your application for Senior Software Engineer, Machine Learning Inference at NVIDIA

Apply for this Position