Machine Learning Engineer
Software Engineering
San Francisco, CA, USA
About XOR
XOR is a platform that helps world-class companies pushing the frontier of AI hire exceptional ML, RL, and AI engineering talent.
About Our Client
Our client is a well-funded AI startup working on next-generation training systems for large language models. The team is small, technical, and moving fast, with a strong focus on hands-on engineering over process.
About the Role
This team designs and builds training tasks that safely advance model capabilities in machine learning research and engineering - specifically, teaching frontier models to do the work of an ML engineer or researcher. The role blends research and engineering: staying current with the latest research, developing novel approaches, and realizing them in code, with full ownership and autonomy over what you build. Work includes designing and implementing training tasks, conducting experiments and evaluations, delivering work into production training runs, and collaborating with other researchers and engineers. This role is for experienced ML engineers (a separate track exists for new graduates).
What You'll Do
- Design and build training tasks and scoring functions that produce clean, learnable signals for frontier models on ML research and engineering tasks
- Build deep expertise across the frontier of ML research, training, and inference infrastructure
- Collaborate with others to brainstorm and create new ideas and tools to improve the task-building process
What We're Looking For
- Strong ML fundamentals and broad research interests - you read many papers or tutorials, understand topics deeply, and have the creativity to translate them into rigorous, verifiable problems
- Proficiency in Python and systems programming, and at least one of PyTorch or JAX
- Ownership mentality and ability to drive solutions end-to-end
- Passion for staying current with the rapidly evolving ML infrastructure landscape
- Ability to meet throughput expectations and respond quickly to feedback
Nice to Have
- Expert knowledge in an active DL/ML research area, with publications or public code to show for it - research experience (PhD, MS) is a big plus
- Deep understanding of transformer internals, training/inference of modern LLMs, experience with inference libraries (vLLM, SGLang, etc.)
- Strong expertise in kernel development (CUDA, Triton, Pallas)
- Experience building complex interactive evaluation or training environments
