OpenAI

Researcher, Safety & Privacy

OpenAI$120K — $180K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • PhD or equivalent in Computer Science, Cryptography, Security, Machine Learning, or related fields
  • Deep understanding of privacy, security, and AI safety
  • Skill in translating ambiguous problems into formal frameworks and deployable systems
  • Proficiency in privacy-preserving computation techniques (e.g., secure enclaves, MPC, differential privacy)
  • Experience with AI safety, jailbreak detection, or model alignment

Responsibilities

  • Design and implement privacy-first architectures for harmful model behavior detection
  • Build frameworks for auditable private identification of high-risk content
  • Develop strict mechanisms triggered only by harm signals
  • Drive development of automated safety systems preserving privacy

Benefits

  • Engage in cutting-edge research at the intersection of AI and privacy
  • Collaborate on foundational problems in a pioneering field
  • Contribute to the development of systems that protect user data
  • Play a central role in shaping the future of AI safety and security
Full Job Description
About the Role:

We are seeking a Researcher in Privacy-Preserving Safety to help design and build the next generation of privacy-preserving safety systems for frontier AI models. This role sits at the intersection of AI safety, security, and privacy, with a focus on developing auditable, privacy-first mechanisms that enable robust harm detection and mitigation without exposing sensitive user data.

You will help define and operationalize frameworks for identifying and addressing frontier risks (e.g., bioweapon instructions, malware creation, suicide/self-harm risks, jailbreaks), while ensuring that privacy guarantees remain intact-even under adversarial conditions.

This role is central to our long-term goal of scaling our automated privacy-preserving safety systems to mitigate potential harms while minimizing human review.

You'll work on foundational problems such as privacy-preserving monitoring, algorithmic auditing, secure enclaves, and adversarially robust safety enforcement protocols, helping ensure that safety systems scale without compromising user trust.

In this role, you will:
  • Design and implement privacy-first architectures for detecting and mitigating harmful model behaviors.
  • Build frameworks for auditable private identification of high-risk content (jailbreaks, cyber threats, or weaponization instructions).
  • Develop strict, auditable mechanisms triggered only by harm signals.
  • Drive the development of automated safety systems that preserve privacy at every level.

You might thrive in this role if you:
  • Are a researcher with deep interest in privacy, security, and AI safety, motivated by building systems that are both trustworthy and effective at scale.
  • Hold a PhD or equivalent experience in Computer Science, Cryptography, Security, Machine Learning, or related fields
  • Have the ability to translate ambiguous problem spaces into formal frameworks and deployable systems
  • Demonstrate profiency in one or more of:
    • Privacy-preserving computation (e.g., secure enclaves, MPC, differential privacy)
    • Security and adversarial systems
    • Machine learning safety or alignment
    • Experience designing robust systems under adversarial threat models
  • Have experience with AI safety, jailbreak detection, or model alignment
  • Are familiar with privacy-preserving machine learning techniques, algorithmic auditing and/or secure system design


About OpenAI

OpenAI is an artificial intelligence research laboratory consisting of the for-profit corporation OpenAI LP and its parent company, the non-profit OpenAI Inc. The company was founded in 2015 by a group of technology leaders, including Elon Musk, Sam Altman, Greg Brockman, Ilya Sutskever, and John Schulman. OpenAI's mission is to develop and promote friendly AI for the betterment of humanity. The company has developed a number of cutting-edge AI technologies, including GPT-3, a language processing system that can generate human-like text. OpenAI has received funding from a number of high-profile investors, including LinkedIn co-founder Reid Hoffman and venture capitalist Peter Thiel.
Learn more about OpenAI
Size
100 employees
Industry
Founded
2015

Similar Jobs

More Jobs at OpenAI

More Consumer Technology Jobs

Find similar Researcher, Safety & Privacy jobs: