OpenAI

RE/RS, Data Understanding (MM)

OpenAI$120K — $180K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Strong track record in machine learning through publications or applied research.
  • Proven ability to drive a research agenda independently.
  • Experience with multimodal data problems.

Responsibilities

  • Synthesize multimodal content such as images, audio, and video.
  • Improve noisy data pipelines for better quality control.
  • Automate data preparation using advanced models.
  • Measure dataset changes to evaluate their impact on model performance.
  • Develop high-quality datasets for training large models.

Benefits

  • Opportunity to work on cutting-edge AI research and its applications.
  • Collaborative and empirical research environment at OpenAI.
  • Ability to influence the development of multimodal learning.
Full Job Description
About The Team

The Data Understanding team is responsible for creating the high quality datasets and their quantized representation for OpenAI. This includes synthesizing multimodal data, building VQ representations, and processing, filtering, deduplication, quality control, and tokenization so it can be used effectively in big model training runs.

About The Role

We're looking to advance how OpenAI prepares, curates, synthesizes and understands multimodal data at scale. You'll work on research and production problems like synthesizing multimodal content (images, audio, and video) and their supervisions, improving noisy data pipelines, building better quality filters, using models to automate data prep, and measuring whether changes in the dataset improve model performance.

We Expect You To
  • Have a strong track record of new or improved ML ideas, through publications, projects, or applied research.
  • Own and drive a research agenda, from choosing the right multimodal data problems to carrying long-running work through to impact.
  • Be excited by OpenAI's empirical, collaborative approach to research.


Nice To Have
  • Experience with multimodal learning, audio, vision, video, synthetic data, or data-centric ML.
  • Thoughtfulness about AI's impact, including privacy, provenance, and data quality.
  • Experience building high-performance deep learning or large-scale data processing systems.


About OpenAI

OpenAI is an artificial intelligence research laboratory consisting of the for-profit corporation OpenAI LP and its parent company, the non-profit OpenAI Inc. The company was founded in 2015 by a group of technology leaders, including Elon Musk, Sam Altman, Greg Brockman, Ilya Sutskever, and John Schulman. OpenAI's mission is to develop and promote friendly AI for the betterment of humanity. The company has developed a number of cutting-edge AI technologies, including GPT-3, a language processing system that can generate human-like text. OpenAI has received funding from a number of high-profile investors, including LinkedIn co-founder Reid Hoffman and venture capitalist Peter Thiel.
Learn more about OpenAI
Size
100 employees
Industry
Founded
2015

Similar Jobs

More Jobs at OpenAI

More Information Technology Jobs

Find similar RE/RS, Data Understanding (MM) jobs: