W

Staff ML Performance Engineer (Inference Optimisation)

Wayve

london, england, United-Kingdom Full-time May 31, 2026
Apply Now

Opportunity Description

The role

As a Staff ML Performance Engineer, you’ll play a key role in high-impact projects, optimising ML inference for edge accelerators and GPUs. The focus of this team is to run large transformer-based models efficiently on low-cost, low-power edge devices to enable Wayve’s first driving product. You’ll help set the technical direction for turning these models into production systems that run reliably on in-vehicle compute. This is a hands-on role working across ML systems, compilers, runtimes, kernels, and embedded deployment, contributing to several early-stage, high-impact projects at Wayve.

Key Responsibilities

  • Profile and pinpoint bottlenecks across the full inference stack (model graph, compiler/runtime, kernel execution, memory movement) and deliver measurable improvements.
  • Implement and validate optimisations in compilers, runtimes, and/or kernels (e.g. operator fusion, scheduling, quantisation-aware performance, custom kern...
Full-time IT & Technology

Ready to Apply?

Submit your application for Staff ML Performance Engineer (Inference Optimisation) at Wayve

Apply for this Position