The RoleWe're looking for a
Machine Learning Scientist to push the limits of small, high-performance language models. This is a deeply technical role focused on advancing the capabilities of our models for retrieval, application, and code generation.
The ideal candidate has a strong background in ML research and engineering, is comfortable working with both theory and production systems, and thrives in an environment where ideas turn into deployed infrastructure fast. This person should be excited to work on training methodology, optimization, evaluation, and model architecture at scale - and collaborate directly with infrastructure and product teams to get breakthroughs into production quickly.
This role is best suited for someone who loves both mathematical elegance and real-world impact.
Requirements- Strong background in machine learning, deep learning, or related fields.
- 2+ years of experience working on ML research or production systems.
- Fluency in Python and frameworks like PyTorch or JAX.
- Experience with training and optimizing large or efficient models.
- Strong understanding of applied optimization, distributed training, or model evaluation.
- Familiarity with code models, retrieval systems, or language modeling a plus.
- Advanced degree (MS or PhD) in a quantitative field, or equivalent industry experience.
- Willingness to work in-person from our SF office in FiDi.