Dolby Laboratories

PhD Research Intern - Multimodal AI (Fall 2026)

Dolby Laboratories$129K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Currently enrolled in a PhD program in Computer Science, Electrical Engineering, Applied Mathematics, or a closely related field
  • Strong background in deep learning applied to multimedia research
  • Familiarity with leading AI model paradigms, including large language models and generative models
  • Proficiency in Python and at least one deep learning framework
  • Strong mathematical foundation
  • Excellent written and verbal communication skills

Responsibilities

  • Develop and apply multimodal AI architectures for joint understanding and generation of audio, visual, and language modalities
  • Design, implement, and train advanced multimodal AI models for spatial media content creation and generation
  • Prepare and curate high-quality datasets through data augmentation and synthetic data generation
  • Evaluate proposed models against state-of-the-art research benchmarks
  • Prototype and validate developed algorithms in realistic use cases
  • Present research findings and contribute to patent applications and scientific publications

Benefits

  • Opportunity to work in a cutting-edge research environment
  • Hands-on experience with advanced multimodal AI concepts
  • Collaboration with experts in deep learning and signal processing
  • Potential publication and patent contributions
  • Flexible application review process with a strong emphasis on timely submissions
Full Job Description
The Multimodal Lab is looking for a talented, self-motivated PhD student to explore multimodal AI models for multimodal source separation, and spatial media content creation and generation. This is a research-focused role ideal for candidates passionate about pushing the boundaries of audio-visual AI at the intersection of deep learning, signal processing, and generative media.

Responsibilities

  • Develop and apply multimodal AI architectures that integrate audio, visual, and/or language modalities for joint understanding and generation


  • Design, implement and train advanced multimodal AI models for spatial media content creation and generation, including multimodal source separation and localization


  • Prepare and curate high-quality datasets through data augmentation and synthetic data generation


  • Evaluate proposed models against state-of-the-art research benchmarks.


  • Prototype and validate the developed algorithms in realistic use cases


  • Present research findings and contribute to patent applications and scientific publications.


Qualifications

  • Currently enrolled in aPhD program in Computer Science, Electrical Engineering, Applied Mathematics, or a closely related field


  • Strong background in deep learning with proven ability of applying it to multimedia research challenges


  • Deep familiarity with leading AI modelparadigms, including large language models (LLMs) and generative models (e.g., diffusion, VAE, GAN)


  • Proficiency in Python and at least one deep learning framework


  • Strong mathematical foundation


  • Excellent written and verbal communication skills


We will review applications on a rolling basis. For the best chance to have your resume reviewed and considered, we recommend submitting your application by June 26, 2026.

Eligibility

Currently enrolled in PhD program. Recent grads who are within 6 months of graduation are also eligible to apply. Must be available to work full-time Monday - Friday for 12 weeks between September 2026 - December 2026.

The start date for this internship is as follows (please note these dates are not flexible):

  • September 21, 2026


The San Francisco/Bay Area base hourly range for this internship position is $62/hr and can vary if outside of this location. Our hourly ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific hourly range and perks and benefits for your location during the hiring process.

About Dolby Laboratories

Dolby Laboratories, Inc. creates audio and imaging technologies that transform entertainment and communications at the cinema, at home, at work, and on mobile devices. The company develops and licenses its audio technologies, such as AAC & HE-AAC, a digital audio codec solution used for TVs, set-top boxes (STBs), personal computers (PCs), gaming consoles, mobile devices, and digital radio; AVC, a digital video codec with high bandwidth efficiency used in media devices; Dolby AC-4, an audio coding technology that delivers new audio experiences to a range of playback devices; and Dolby Atmos technology for home theaters, cinemas, device speakers, mobile devices, and headphones. Its audio technologies also comprise Dolby Digital, a digital audio coding technology that provides multichannel sound in the home; Dolby Digital Plus, a digital audio coding technology that delivers audio quality for streaming, downloaded, and broadcast content; Dolby TrueHD, a digital audio coding technology for content providers; Dolby Vision, an imaging technology for cinema, digital TV, and other consumer devices; and HEVC, a digital video codec with high bandwidth efficiency to support delivery of Ultra HD and other video content. In addition, the company designs and manufactures audio and imaging products, such as digital cinema servers, Dolby Cinema audio products, and other products for the film production, cinema, television broadcast, and entertainment industries. Further, it offers services to support theatrical and television production for cinema exhibition, broadcast, and home entertainment. The company serves film studios, content creators, post-production facilities, cinema operators, broadcasters, and video game designers. Dolby Laboratories, Inc. was founded in 1965 and is headquartered in San Francisco, California.
Learn more about Dolby Laboratories
Size
2,368 employees
Market Cap
$6.5 billion
Industry
Net Income
$317.8 million
Founded
1965
5 Year Trend
+3%
Revenue
$1.2 billion
NASDAQ

Similar Jobs

More Jobs at Dolby Laboratories

More Information Technology Jobs

Find similar PhD Research Intern - Multimodal AI (Fall 2026) jobs: