Opportunity Description
Desay SV Automotive Singapore Pte. Ltd. is an innovative organization committed to exploring frontier technologies. While the company has a strong background in automotive electronics, this role is exclusively focused on advancing applications in large language models and on-device AI inference.
Duties/ Responsibilities
- On-Device Inference Engine Development. Design, develop, and optimize LLM inference engines for embedded, mobile, and edge devices — covering operator development, graph optimization, memory management, and multi-backend adaptation
- Model Compression & Lightweight Deployment. Research and apply quantization (INT4/INT8/FP16), pruning, distillation, and KV Cache compression techniques to achieve efficient inference on resource-constrained hardware
- Heterogeneous Hardware Optimization. Conduct operator-level performance tuning for ARM CPU, NPU, GPU, and DSP; use profiling tools to identify bottlenecks and ...
Ready to Apply?
Submit your application for Senior AI Engineer - Large-Scale Foundation Models (LLM / VLM) at Desay SV
Apply for this Position