AI Research Scientist, Audio-Visual Understanding, FAIR

Meta

$130K — $180K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science or a related field
  • PhD in AI, computer science, or a relevant technical area
  • Experience in an industry, postdoctoral or research position
  • Research background in AI, machine learning, or applied mathematics
  • Publications demonstrating research experience
  • Proficiency in Python or similar programming languages
  • Experience in data analysis and collection

Responsibilities

  • Develop advanced audio-visual understanding systems for perception
  • Build and evaluate audiovisual language models for social interactions
  • Contribute to benchmarks for visual social understanding
  • Train and optimize machine learning methodologies
  • Collaborate on global research projects

Benefits

  • Access to cutting-edge technology and research facilities
  • Opportunity to work in an interdisciplinary team
  • Innovative work environment focused on advancing AI
  • Engagement in meaningful research impacting the future
Full Job Description
Meta is seeking a Research Scientist to join Fundamental AI Research (FAIR), a research organization focused on making significant advances in AI. Our organization is driven by advancing the science of intelligence and developing technology toward achieving superintelligence. We are seeking researchers with experience in computer vision, speech and multimodal learning to join our team and help build the perceptual foundations for real-time embodied conversational agents. This role offers the opportunity to collaborate with a highly interdisciplinary team of scientists, engineers, and cross-functional partners, with access to cutting-edge technology, resources, and research facilities.

Responsibilities

Develop joint audio-visual understanding systems that integrate visual and auditory signals for advanced perception
• Build and evaluate audiovisual language models for social interactions and understanding, including predicting social intent, semantic function, and reasoning from human-centric inputs
• Contribute to benchmarks and evaluation frameworks for visual social understanding and interactions
• Train and optimize state-of-the-art machine learning and neural network methodologies
• Conduct and collaborate on research projects within a globally-based team

Minimum Qualifications
• Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
• A PhD in AI, computer science, data science, or related technical fields
• Experience holding an industry, postdoctoral, faculty, or government researcher position
• Research background in machine learning, artificial intelligence, computational statistics, or applied mathematics, or related areas
• Research publications reflecting experience in theoretical or empirical research
• Experience in developing and debugging in Python or similar programming languages
• Experience in analyzing and collecting data from various sources
• Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment

Preferred Qualifications
• Demonstrated research and software engineering experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub)
• Experience with audio-visual learning or multimodal fusion techniques
• Familiarity with human action recognition, social signal processing, or human-centric video understanding
• Experience with long-form video understanding, video-language models, or streaming perception systems
• Experience with vision-language models (VLMs) such as LLaVA, GPT-4V, Gemini, or similar architectures
• Experience with temporal modeling, video transformers, or recurrent architectures for sequential data

About Meta

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

Equal Employment Opportunity

Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.

Similar Jobs

More Jobs at Meta

More Consumer Technology Jobs

Find similar AI Research Scientist, Audio-Visual Understanding, FAIR jobs: