Machine Learning Engineer, Data Mining

Motional

$144K — $192K *
US-AnywhereRemote in Pittsburgh, PA
Technical Services
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • BS or MS in Computer Science, Machine Learning, or a related field.
  • Hands-on experience with PyTorch (preferred) or TensorFlow/JAX.
  • Strong proficiency in Python, with skills in writing clean, modular code.
  • Knowledge of version control, unit testing, and basic software design patterns.
  • Experience with large datasets, SQL, and data libraries like Pandas and NumPy.
  • Understanding of the full ML lifecycle from data cleaning to deployment.
  • Commitment to learning and adaptability in a dynamic environment.

Responsibilities

  • Develop, train, and fine-tune machine learning models for multimodal sensor data.
  • Create scalable data preprocessing and augmentation pipelines for efficient model deployment.
  • Build embedding-based search tools and 'active learning' workflows for critical driving scenarios.
  • Monitor model health and performance through dashboard creation.
  • Participate in software engineering best practices, including code reviews and documentation.
  • Collaborate with senior engineers to translate model prototypes into scalable engineering solutions.

Benefits

  • Medical, dental, and vision insurance.
  • 401k plan with company match.
  • Health savings accounts and life insurance.
  • Pet insurance options available.
  • Flexible hybrid work schedule with remote options.
Full Job Description
Mission Summary:

As a Machine Learning Engineer on the Data Mining team, your mission is to help build the "Brain" of this engine. You will work with state-of-the-art foundation models to extract insights from Motional's driving data, working at the intersection of large-scale representation learning and data retrieval. By building smarter mining tools and efficient data pipelines, you will accelerate the model improvement lifecycle for teams working on post-training analysis, error diagnosis, and dataset curation.

What You'll Do:
  • Build and Train ML Pipelines: Develop, train, and fine-tune machine learning models for multimodal sensor data (e.g., vision, LiDAR). Focus on implementing supervised and self-supervised learning approaches to improve data search and retrieval.
  • Support Model Deployment: Implement scalable data preprocessing and augmentation pipelines. Assist in applying standard optimization techniques (e.g., batch inference, quantization) to ensure models run efficiently in production environments.
  • Data Mining & Analysis: Help develop embedding-based search tools and "active learning" workflows to identify critical driving scenarios.
  • Monitor Production Performance: Help build and maintain dashboards to monitor model health, data drift, and system performance. Identify regressions and assist in the operational support of our data mining services.
  • Learn and Apply Best Practices: Follow software engineering standards (version control, CI/CD, unit testing) for ML code. Participate in code reviews and contribute to technical documentation.
  • Collaborate Across Teams: Work closely with senior engineers and machine learning engineers to translate model prototypes into maintainable, scalable engineering solutions.

What We're Looking For (Must-Haves):
  • BS or MS in Computer Science, Machine Learning, or a related field.
  • Hands-on experience with PyTorch (preferred) or TensorFlow/JAX. You should be comfortable training models and evaluating them using standard metrics.
  • Strong proficiency in Python with the ability to write clean, modular, and well-documented code.
  • Working knowledge of version control, unit testing, and basic software design patterns.
  • Experience working with large datasets, including proficiency in SQL and data libraries like Pandas and NumPy.
  • A solid grasp of the full ML lifecycle, from data cleaning and feature engineering to validation and deployment basics.
  • A proactive learner who thrives on constructive feedback and is eager to grow within a high-stakes engineering environment.

Bonus Points (Nice-to-Haves):
  • MS/PhD in Computer Science, Machine Learning, or related field.
  • Experience with agentic systems, autonomous reasoning, chain-of-thought models, or LLM-based planning.
  • Background in autonomous driving, robotics, or real-time decision-making systems.
  • Familiarity with multimodal learning, sensor fusion, or embodied AI.
  • Experience building active learning loops, using the model to find the data that breaks the model.
  • Experience with ML-based data mining, active learning, or contrastive learning.
  • Knowledge of model serving tools (TF Serving, Triton, TorchServe) and MLOps platforms.
  • Publication in top-tier conferences (e.g., ICCV, CVPR, ECCV)

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.

The salary range for this role is an estimate based on a wide range of compensation factors including but not limited to specific skills, experience and expertise, role location, certifications, licenses, and business needs. The estimated compensation range listed in this job posting reflects base salary only. This role may include additional forms of compensation such as a bonus or company equity. The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process.

Candidates for certain positions are eligible to participate in Motional's benefits program. Motional's benefits include but are not limited to medical, dental, vision, 401k with a company match, health saving accounts, life insurance, pet insurance, and more.

Salary Range

$144,000-$192,000 USD

Similar Jobs

More Jobs at Motional

More Technical Services Jobs

Find similar Machine Learning Engineer, Data Mining jobs: