Opportunity Description
Job Description
Role Overview
We are looking for a Software Lead (8+ years’ experience) to own the runtime and neural network (NN) layer of a next-generation AI accelerator platform. This role focuses on designing, optimizing, and implementing NN operators and developing new ops using CUDA/custom runtime APIs to deliver high-performance execution on custom AI hardware.
Key Responsibilities
- Design and optimize NN operators for performance-critical workloads
- Develop new NN ops using CUDA/custom runtime APIs
- Drive runtime-level optimizations across compute, memory, and scheduling
- Own runtime ↔ NN layer interfaces and execution model
- Implement and optimize operator fusion (e.g., matmul + bias + LayerNorm) for efficient hardware utilization<...
Ready to Apply?
Submit your application for AI Software Lead – PyTorch & CUDA Runtime (Next-Gen Accelerator) at Sandisk
Apply for this Position