Member of Technical Staff, Pretraining

Mirendil

$350K — $500K *
Technical Services
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in machine learning or related engineering field
  • Strong background in large-scale distributed systems and GPU computing
  • Proficiency in designing and optimizing model architectures and training algorithms
  • Experience with data processing and pipeline development
  • Familiarity with scientific experimentation and statistical analysis techniques

Responsibilities

  • Implement and iterate on model architectures and training algorithms based on research insights
  • Scale distributed training across thousands of GPUs to enhance performance
  • Optimize training throughput for new attention mechanisms and architecture variants
  • Design and build large-scale data pipelines for model efficiency
  • Run and analyze experiments to understand architecture and data impacts on model performance

Benefits

  • Meaningful equity grant based on experience and background
  • Access to competitive health and wellness benefits
  • Opportunity to work at the forefront of AI research and engineering
  • Collaborative environment fostering both research and practical application
Full Job Description
The Role

We are looking for an engineer to work at the intersection of research and systems on our pretraining stack. You'll contribute across the full pipeline, from data processing and model architecture to distributed training infrastructure and low-level optimization, and help determine how we scale our next generation of models. Some example areas you might work on (not limited to):

  • Implement and iterate on model architectures, training algorithms, and optimizer research in large-scale pretraining runs
  • Scale distributed training jobs across thousands of GPUs
  • Optimize training throughput for novel attention mechanisms, architecture variants, and compute efficiency improvements
  • Design and build large-scale data pipelines for efficient model consumption and dataset curation
  • Run and analyze scientific experiments to advance understanding of how architecture and data choices affect model capabilities


If you're excited about working across research and engineering to push the frontier of what large models can do, we'd love to hear from you.

We offer a base salary of $350,000-$500,000 USD and a meaningful equity grant, depending on experience and background, along with competitive benefits.

Similar Jobs

More Jobs at Mirendil

More Technical Services Jobs

Find similar Member of Technical Staff, Pretraining jobs: