Research Scientist, RL & Simulation

Mecka AI

$100K — $150K *
Technical Services
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • MSc/PhD in robotics, ML, or related field.
  • Hands-on experience with robot simulation and policy learning.
  • Proficient in Python with strong engineering principles.
  • Ability to manage end-to-end processes from environment to evaluation.
  • Familiarity with manipulation, dexterous hands, or locomotion as strong signals.

Responsibilities

  • Build and maintain simulation environments for robotics learning.
  • Decide priority of environments to enhance learning velocity.
  • Convert human demonstrations into robot-executable trajectories.
  • Explore various retargeting approaches including IK and optimization.
  • Train policies using imitation learning and reinforcement learning.
  • Define and track evaluation metrics and stress tests for policies.
  • Drive sim-to-real transfer through various modeling techniques.

Benefits

  • Opportunity to shape scalable robot learning signals from human behaviors.
  • Significant ownership of projects with rapid iteration.
  • Direct work with real-world datasets to inform research.
Full Job Description
The Role

We are looking for a Research Scientist, RL & Simulation to own the RL + simulation engine that turns large-scale human demonstrations into scalable robot learning signals.

This is a research-meets-systems role: you'll build simulation environments, retarget human motion to robot actions, train and evaluate policies, and drive sim-to-real transfer with clear metrics.
What You'll Work On
Simulation Environments
  • Build and maintain simulation environments for robotics learning (e.g., Isaac Sim / Isaac Gym, MuJoCo, Genesis, Habitat, ManiSkill).
  • Decide what environments and assets to build first to maximize learning velocity.
Retargeting (Human 12 Robot)
  • Convert human demonstrations into robot-executable trajectories.
  • Explore IK-based, optimization-based, and learning-based retargeting approaches.
Policy Learning & Evaluation
  • Train policies from demonstrations using imitation learning + RL:
    • Behavior Cloning, DAgger-style aggregation, Offline RL
    • PPO / SAC (or similar) when online fine-tuning is required
  • Define evaluation: success metrics, stress tests, generalization, and regression tracking.
Sim-to-Real
  • Drive transfer via domain randomization, system identification, contact modeling, and failure-mode analysis.
  • Use real data to identify domain gaps that matter.
Who You Are
Required Background
  • MSc/PhD (or equivalent research experience) in robotics, ML, or a related field.
  • Strong hands-on experience with robot simulation and policy learning.
  • Proficiency in Python; solid engineering discipline (reproducible experiments, clean code, debugging).
  • Comfort working end-to-end: environment 12 data 12 training 12 evaluation.

Strong Signals:
  • Experience with manipulation, dexterous hands, or locomotion.
  • Experience with retargeting, IK, trajectory optimization, or differentiable simulation.
  • Deep intuition for what makes sim-to-real succeed or fail.
Why This Role
  • Define how Mecka turns egocentric human behavior into scalable robot learning signals.
  • High ownership, fast iteration, and direct connection to real-world datasets.

Similar Jobs

More Jobs at Mecka AI

More Technical Services Jobs

Find similar Research Scientist, RL & Simulation jobs: