Research Engineer, Frontier Safety Loss of Control, DeepMind

Google • $174K — $253K *

San Francisco, CA 94112In-Person

Information Technology

5 - 7 years of experience

1 week ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree in Computer Science, Machine Learning, or a related technical field, or equivalent practical experience.
5 years of experience in engineering and agentic assistance, including software development in Python.
Experience in a frontier AI research and development environment.
Professional experience in a software engineering or research team setting.
Familiarity with technical stakeholders and managing relationships.
Experience in frontier model risk.

Responsibilities

Identify potential harms from misaligned agents and develop detection and prevention strategies.
Implement technical controls to monitor agent thoughts and behavior, and respond to mitigate risks.
Integrate behavioral signals from multiple agents to inform response policies.
Conduct adversarial testing of control systems.
Collaborate with internal product teams to ensure the adoption of control systems on high-risk AI surfaces.

Benefits

Comprehensive health coverage and wellness programs.
Generous paid time off and sick leave policies.
Retirement savings options with company matching.
Professional development opportunities and educational reimbursements.
Access to cutting-edge technology and tools.

Full Job Description

Minimum qualifications:

Bachelor's degree in Computer Science, Machine Learning, or a related technical field, or equivalent practical experience.
5 years of experience in engineering and agentic assistance, including software development in Python.
Experience working in a frontier AI research and development environment.
Experience working in a professional software engineering or research team environment.
Experience working with technical stakeholders.
Experience in frontier model risk.

Preferred qualifications:

Experience of engineering or product design for AI tools or assistants, especially those focused on ML Research and Development (R&D).
Experience with cybersecurity detection and response.
Experience with collaborating or leading an applied ML project.
Experience with Large Language Model (LLM) training and inference.
Knowledge of AI control, chain-of-thought and other monitoring, faithfulness and monitorability and related research areas.

About the job

Our team develops monitoring and control for potentially misaligned AI to mitigate risks of extreme harms. Currently, this primarily involves: designing, building, and testing monitors for potentially dangerous behaviours; developing and implementing response policies to preserve AI usefulness while mitigating risks; and foreseeing ways in which our control tools might be bypassed or degraded. We are looking for an engineer who can rapidly iterate to solve never-before-seen problems with creativity and thoroughness.

The Loss of Control team contributes to a defense in depth against the risk of misaligned AI systems being deployed. We take the possibility of very advanced AI seriously. We don't think control is a suitable alternative to alignment in the limit of advancing intelligence. But while AI remains effectively monitorable, we think that control is an important part of an overall strategy for building safe AI.

We are looking for a research engineer for the Frontier Safety Loss of Control team within the AGI Safety and Alignment Team based in either San Francisco or London.

In this role, the core responsibility is to help Google prepare for the internal use of potentially misaligned AI systems. That means building defense-in-depth against AI that might persistently pursue goals that users and system developers did not intend.

US: $174000 - $253000 (USD) 15% bonus target bonus equity benefits

Learn more about benefits at Google .

Responsibilities

Identify potential harms from misaligned agents and develop strategies for detection and prevention.
Implement technical controls to monitor agent thoughts, behaviour, and respond to mitigate potential harms.
Integrate various agent behaviour signals from across the organisation to inform response policies.
Conduct adversarial testing of controls.
Work with internal product teams to ensure that control systems are adopted over all high-risk AI surfaces.

About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.

Learn more about Google

Size

156,500 employees

Market Cap

$1,115.4 billion

Industry

Enterprise Technology

Net Income

$40.2 billion

Founded

1998

5 Year Trend

+23.3%

Revenue

$182.5 billion

NASDAQ

GOOGL

* Ladders Estimates

Similar Jobs

Software Dev Engineer - AI/ML, Advanced Manufacturing Engineering (AME)
$165K — $223K *
Amazon
Sunnyvale, CA 94087 (Santa Clara County)
Today
Research Engineer, Frontier Safety Mitigations, DeepMind
$174K — $253K *
Google
San Francisco, CA 94112 (San Francisco County)
Today
Forward Deployed Engineer, DeepMind
$174K — $253K *
Google
Mountain View, CA 94040 (Santa Clara County)
Today
Forward Deployed AI Engineer
$150K — $250K *
Distyl AI
San Francisco, CA 94112 (San Francisco County)
Today
AI Engineer, Evaluation
$150K — $250K *
Distyl AI
San Francisco, CA 94112 (San Francisco County)
Today
Lead AI Engineer (FM Hosting, LLM Inference)
$215K — $245K *
Capital One Financial Corporation
San Jose, CA 95123 (Santa Clara County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Google

Product Lead, Gmail Life Management
$192K — $279K *
Sunnyvale, CA 94087 (Santa Clara County)
Today
Consumer Technology
In-Person
Software Engineering Manager, GBO Engineering, Agency Tooling
$207K — $301K *
New York, NY 10025 (New York County)
Today
Information Technology
In-Person
Technical Program Manager II, Software Engineering, Google Wallet
$138K — $198K *
Mountain View, CA 94040 (Santa Clara County)
Today
Consumer Technology
In-Person
Engineering Manager, Data Flow Agentic Data Cloud
$207K — $301K *
Seattle, WA 98115 (King County)
Today
Information Technology
In-Person
Senior Software Engineering Manager, Agent Registry and AppHub
$262K — $365K *
Sunnyvale, CA 94087 (Santa Clara County)
Today
Information Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
2 days ago
Systems Analyst
$76K — $112K *
Cayuse Holdings
Remote
Today
SME Computer User Support Specialist
$75K — $96K *
Cayuse Holdings
Bridgeport, WA 98813 (Douglas County)
Today
Machine Learning User Research Scientist (Ph.D. required)
$100K — $130K *
Exponent
Tampa, FL 33647 (Hillsborough County)
Today
Data & Cloud Solutions Engineer (M.S. or Ph.D.)
$133K — $150K *
Exponent
Los Angeles, CA 90011 (Los Angeles County)
Today

Find similar Research Engineer, Frontier Safety Loss of Control, DeepMind jobs:

Nationwide San Francisco, CA

Research Engineer, Frontier Safety Loss of Control, DeepMind

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Research Engineer, Frontier Safety Loss of Control, DeepMind jobs:

Get Ready For Your
Next Interview