Video AI Engineer

Zoom Video Communications, Inc.

$137K — $275K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • PhD or Master in Electrical Engineering, Computer Science, Applied Mathematics, or related fields.
  • Experience with C/C++ or Objective-C, and Python, with released projects and publications.
  • Hands-on experience in video generation and neural rendering techniques like NeRF and 3D Gaussian Splatting.
  • Experience with machine learning techniques including generative and diffusion models.
  • Understanding of deepfake detection methods, including biometric analysis and vision-language models.
  • Familiarity with multi-threaded programming and communication mechanisms.
  • Knowledge of multimedia stream data processing flows, particularly in 3D scene or point cloud workflows.

Responsibilities

  • Build and develop video and generative video processing applications for desktop and mobile.
  • Conduct research and evaluate performance of video processing and generation algorithms.
  • Design and implement algorithms within Zoom's video and 3D reconstruction pipelines.
  • Write modular, production-ready code for video, neural rendering, and 3D Gaussian Splatting algorithms.
  • Optimize algorithms for real-time performance on multiple platforms.
  • Integrate and deploy deep learning models across various operating systems.
  • Set up testing environments and develop unit tests for video and 3D pipeline components.

Benefits

  • Comprehensive benefits program focusing on physical, mental, emotional, and financial health.
  • Support for work-life balance through various perks and options.
  • Recognition as part of an award-winning workplace culture.
  • Opportunities for community contribution and meaningful engagement.
Full Job Description
What you can expect

As a Video AI Engineer, you'll enhance video codecs, video generation, and real-time 3D reconstruction to improve video quality, immersion, and performance in Zoom products. You will work across our stack, developing software ranging from Web Server to business application layers for our distributed, cloud-hosted backend. Working alongside leading experts in the field, you'll deliver happiness to our users and grow your knowledge base every day.

Responsibilities
  • Building and developing video and generative video processing applications on both desktop and mobile systems
  • Participating in research and performance evaluation of video processing, video generation, and 3D reconstruction algorithms
  • Designing and developing algorithms in Zoom's video and 3D reconstruction processing pipelines at both module and system levels
  • Implementing video, neural rendering, and 3D Gaussian Splatting algorithms with modular, well-organized, and production-ready code
  • Optimizing video, generative, and 3D reconstruction algorithms to achieve real-time performance on corresponding platforms
  • Customizing, integrating, and shipping deep learning models-including video generative models and 3D neural rendering models-across Mac, Windows, iOS, and Android
  • Setting up test environments, developing test tools, and designing unit tests for runtime verification of video and 3D pipeline components


What we're looking for
  • Hold either a PhD or Master in Electrical Engineering, Computer Science, Applied Mathematics, or related fields
  • Have experience with C/C++ or Objective-C, and Python, talking avatar/head/portrait(with released projects and top conference papers)
  • Have hands-on experience with video generation or video diffusion models, neural rendering techniques (e.g., NeRF, 3D Gaussian Splatting), and 3D reconstruction systems.
  • Have hands-on experience with machine learning techniques such as generative models, diffusion models, discriminative models, or transfer learning.
  • Possess experience with or a solid understanding of at least one deepfake detection approach. This includes biometric analysis-based methods, vision-language model (VLM)-based techniques, interactive behavior analysis, or multimodal signal modeling that leverages visual, temporal, and audio cues.
  • Have familiarity with multi-threaded programming and communication mechanisms
  • Have understanding of multimedia stream data processing flows, ideally including 3D scene or point cloud pipelines


Salary Range or On Target Earnings:

Minimum:
$137,700.00

Maximum:
$275,400.00

In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure; there may be a different range for candidates in this and other locations

At Zoom, we offer a window of at least 5 days for you to apply because we believe in giving you every opportunity. Below is the potential closing date, just in case you want to mark it on your calendar. We look forward to receiving your application!

Anticipated Position Close Date:

07/08/26

Ways of Working
Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

Benefits
As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.

Similar Jobs

More Jobs at Zoom Video Communications, Inc.

More Consumer Technology Jobs

Find similar Video AI Engineer jobs: