Google

Research Engineer, Benchmarking, Robotics, DeepMind

Google$147K — $211K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, Robotics, or similar experience.
  • 2 years of experience in machine learning tools, specifically deploying LLMs/VLMs.
  • Hands-on experience in software engineering, AI/ML engineering, or solutions architecture.
  • Proficient in Python and familiar with AI-assisted development tools for rapid prototyping.
  • Preferred: Experience with ROS/ROS2 and on-device deployment constraints.

Responsibilities

  • Design, implement, and maintain frameworks for evaluating robot policies.
  • Collaborate with researchers to optimize benchmark content for evaluation.
  • Develop diagnostic tools for identifying policy failures and performance issues.
  • Set evaluation standards for model releases, ensuring their readiness for demos.
  • Innovate processes for faster and reproducible real-world hardware evaluations.

Benefits

  • Comprehensive medical, dental, and vision insurance.
  • Generous paid time off and parental leave.
  • Retirement savings plan with a match.
  • Access to wellness programs and resources.
  • Opportunities for ongoing education and personal development.
Full Job Description
Minimum qualifications:
  • Bachelor's degree in Computer Science, Robotics, or equivalent practical experience.
  • 2 years of experience with machine learning tools and algorithms, specifically deploying LLMs/VLMs and deep learning models.
  • Experience in a technical role (software engineering, AI/ML engineering, or solutions architecture).
  • Experience with Python, and with modern AI-assisted development tools to accelerate prototyping.

Preferred qualifications:
  • Experience with ROS/ROS2, or on-device deployment constraints (Jetson, TPU).
  • Experience managing large-scale multimodal datasets, time-series telemetry data, or building automated pipelines for hardware-in-the-loop testing.
  • Familiarity with the operational realities of modern vision-language-action (VLA) models or behavior cloning policies and their common pitfalls like task overfitting.
  • A deep-seated interest in the future of embodied AI and a desire to build the testing bedrock for robotics development.


About the job

At Google, research-focused Software Engineers are embedded throughout the company, allowing them to setup large-scale tests and deploy promising ideas quickly and broadly. Ideas may come from internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.

From creating experiments and prototyping implementations to designing new architectures, engineers work on real-world problems including artificial intelligence, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more. But you stay connected to your research roots as an active contributor to the wider research community by partnering with universities and publishing papers.

Our mission is to bring advanced AI into the physical realm by building generalist robots that perceive, reason, and act naturally alongside humans.

As a Research Engineer, you will manage the practical challenges of benchmarking foundation models for robotics. You will have an understanding of how modern robotics foundation models work and where they currently fall short. Your mission is to design evaluation protocols, tooling, and frameworks that extract meaningful signals from the messiness of physical policy execution. You will build the infrastructure that allows the engineering team to effectively hillclimb and gives leadership a clear, data-driven understanding of technological readiness.

US: $147000 - $211000 (USD) 15% bonus target bonus equity benefits

Learn more about benefits at Google .

Responsibilities
  • Design, implement, and maintain scalable, robust frameworks to enable large-scale evaluation of robot policies across offline open-loop testing and real-world hardware evaluations.
  • Partner with researchers to design the content of various benchmarks in order to maximize evaluation signal and stress-test model capabilities.
  • Build diagnostic and visualization tools that allow the team to easily root-cause policy failures and track performance regressions.
  • Establish evaluation criteria for model releases and own the stability and benchmarking of models slated for critical demos.
  • Innovate on how to make real-world hardware evaluation faster, more reproducible, and less reliant on manual human intervention.


About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.
Learn more about Google
Size
156,500 employees
Market Cap
$1,115.4 billion
Industry
Net Income
$40.2 billion
Founded
1998
5 Year Trend
+23.3%
Revenue
$182.5 billion
NASDAQ

Similar Jobs

More Jobs at Google

More Consumer Technology Jobs

Find similar Research Engineer, Benchmarking, Robotics, DeepMind jobs: