AI Computing Development Engineer, TensorRT and TensorRT-LLM

NVIDIA

Shanghai, China, China Full-time June 03, 2026

Opportunity Description

NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like generative AI, computer vision, speech recognition, recommender systems, and large-scale language and multimodal models. Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines. The ability to work in a fast-paced, delivery-focused environment is required, and excellent interpersonal skills are a must.

What you'll be doing:
+ Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms
+ Perform performance analysis, optimization, and tuning of deep learning inference workloads
+ Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly
+ Provide feedback into archit...

Full-time other-general

Ready to Apply?

Submit your application for AI Computing Development Engineer, TensorRT and TensorRT-LLM at NVIDIA

Apply for this Position

Location Shanghai, China

Country China

Type Full-time

Category other-general

Posted June 03, 2026

Deadline June 07, 2026

AI Computing Development Engineer, TensorRT and TensorRT-LLM

Opportunity Description

Ready to Apply?

Opportunity Details

About NVIDIA

NVIDIA

Share This Opportunity