Machine Learning Engineer II – Autonomous Driving, Inference Runtime

Worldwide Salaried Open

Job Description:

Deploy and Optimize Machine Learning model architectures across May’s Autonomous Driving training and inference stacks.
Own the model-compilation and deployment pipeline end-to-end.
Establish and defend latency/throughput budgets across the AV stack, including profiling, regression and integrity tests.

Requirements:

Bachelor’s or Master’s degree in Robotics, Computer Science, Computer Engineering, or a related field with strong mathematical and engineering foundations.
A minimum of 2 years writing software to interface with GPU and ML systems.
Proficiency in C/C++/CUDA/PyTorch and experience in Linux environments.
Familiarity with basic Perception and Planning concepts in Autonomous Driving.
Familiarity with NVIDIA compute architectures (Ada, Hopper, Blackwell, etc).
Familiarity with common profiling tools such as Nsight, Pytorch Profiler, flamegraph.
Understanding of Quantization (INT8/FP8/FP16) and other compression techniques.
Familiarity with NVIDIA DRIVEOS architecture and SoCs (Orin/Thor).
Familiarity with techniques for scaling training throughput (batching, FSDP, streaming dataloaders).

Benefits:

Comprehensive healthcare suite including medical, dental, vision, life, and disability plans. Domestic partners who have been residing together at least one year are also eligible to participate.
Health Savings and Flexible Spending Healthcare and Dependent Care Accounts available.
Rich retirement benefits, including an immediately vested employer safe harbor match.
Generous paid parental leave as well as a phased return to work.
Flexible vacation policy in addition to paid company holidays.
Total Wellness Program providing numerous resources for overall wellbeing

Apply tot his job Apply To this Job

More jobs