Staff ML Performance Engineer (Inference Optimisation)

Wayve

london, england, United-Kingdom Full-time May 31, 2026

Opportunity Description

The role

As a Staff ML Performance Engineer, you’ll play a key role in high-impact projects, optimising ML inference for edge accelerators and GPUs. The focus of this team is to run large transformer-based models efficiently on low-cost, low-power edge devices to enable Wayve’s first driving product. You’ll help set the technical direction for turning these models into production systems that run reliably on in-vehicle compute. This is a hands-on role working across ML systems, compilers, runtimes, kernels, and embedded deployment, contributing to several early-stage, high-impact projects at Wayve.

Key Responsibilities

Profile and pinpoint bottlenecks across the full inference stack (model graph, compiler/runtime, kernel execution, memory movement) and deliver measurable improvements.
Implement and validate optimisations in compilers, runtimes, and/or kernels (e.g. operator fusion, scheduling, quantisation-aware performance, custom kern...

Full-time IT & Technology

Ready to Apply?

Submit your application for Staff ML Performance Engineer (Inference Optimisation) at Wayve

Apply for this Position

Location london, england

Country United-Kingdom

Type Full-time

Category IT & Technology

Posted May 31, 2026

Deadline July 10, 2026

Staff ML Performance Engineer (Inference Optimisation)

Opportunity Description

The role

Key Responsibilities

Ready to Apply?

Opportunity Details

About Wayve

Wayve

Share This Opportunity