Research Engineer, Computer Vision

Job Overview by Ladders

Qualifications

Bachelor's degree in Computer Science, Computer Engineering, or relevant field, or equivalent experience (must be completed prior to joining)
Proficiency in C++ and/or Python with knowledge of modern features
Experience with deep learning frameworks such as PyTorch and TensorFlow
Collaborative experience in cross-functional teams
Master's degree or higher in a relevant technical field is preferred
Experience with vision-language models or multi-modal transformers is a plus
Familiarity with large language models integration with visual systems

Responsibilities

Design and implement systems that integrate vision, language, and other sensory inputs
Develop algorithms for cross-modal learning and to enhance human-AI interaction
Lead the curation and management of diverse multi-modal datasets
Oversee ground truth annotation workflows and ensure data quality
Execute medium to large-scale features independently
Collaborate with research and engineering teams to spark multi-modal innovation
Write organized code with testing and documentation for production systems

Benefits

Flexible work hours and opportunities for remote work
Health, dental, and vision insurance
Generous vacation and paid time off
Retirement savings plans and matching
Access to professional development resources
Employee wellness programs and initiatives

Full Job Description

As a Research Engineer focused on Multi-Modal Understanding, you will develop advanced algorithms that integrate computer vision with other modalities such as language, audio, and sensor data. You will also drive the curation of multi-modal datasets and ground truth annotation pipelines to support model training and evaluation. You will work closely with our research team to bring innovative multi-modal solutions to production, bridging the gap between visual perception and holistic contextual understanding for immersive applications.

Responsibilities

Design and implement multi-modal understanding systems that combine vision, language, and other sensory inputs to enable richer contextual awareness
• Develop algorithms for cross-modal learning, fusion, and reasoning to improve human-AI interaction
• Lead the curation and management of multi-modal datasets, ensuring data quality and diversity across vision, language, and sensor modalities
• Design and oversee ground truth annotation workflows and quality assurance processes for multi-modal data
• Complete medium to large features spanning multiple tasks independently with minimal to no guidance
• Collaborate with researchers and engineers across computer vision and machine learning teams to drive multi-modal innovation
• Develop well-organized code with proper testing and documentation, building production-ready multi-modal systems

Minimum Qualifications
• Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
• Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
• Proven experience with C++ and/or Python, including experience with modern features
• Experience working with deep learning frameworks such as PyTorch and TensorFlow
• Demonstrated experience working collaboratively in cross-functional teams

Preferred Qualifications
• Master's degree in Computer Science, Computer Vision, Machine Learning, or related field
• Experience with vision-language models or multi-modal transformers
• Publications or contributions to multi-modal understanding research
• Familiarity with large language models and their integration with visual understanding systems
• Experience with data curation, annotation tools, or ground truth labeling pipelines

* Ladders Estimates

Similar Jobs

Applied Research Mathematician
$49K — $290K *
Gem.com
Annapolis, MD 21401 (Anne Arundel County)
Reposted Today
Sr. Research Chemist, Photochromic Dye
$85K — $115K *
PPG Industries
Monroeville, PA 15146 (Allegheny County)
Today
Marine Minerals Analyst / Domain Specialist (Geotechnical & Geophysical)
$80K — $120K *
Essnova Solutions, Inc.
Washington, DC 20011 (District Of Columbia County)
Today
Research Scientist 2
$49K — $290K *
Gem
College Park, MD 20740 (Prince Georges County)
Today
Associate Scientist at Gelest
$75K — $99K *
Eastman Chemical Company
Morrisville, PA 19067 (Bucks County)
Today
Senior Statistician
$92K — $146K *
Yale University
Whitney Point, NY 13862 (Broome County)
Today

Get Ready For Your
Next Interview

More Jobs at Meta

Mechanical Engineer, Data Center Design Engineering
$120K — $160K *
Menlo Park, CA 94025 (San Mateo County)
Today
Technical Services
In-Person
Research Engineer, Computer Vision
$90K — $130K *
Pittsburgh, PA 15237 (Allegheny County)
Today
Consumer Technology
In-Person
Software Engineer, Machine Learning RecSys
$130K — $180K *
Sunnyvale, CA 94087 (Santa Clara County)
Today
Information Technology
In-Person
ASIC Engineer, Physical Design
$120K — $160K *
Sunnyvale, TX 75182 (Dallas County)
Today
Consumer Technology
In-Person
ASIC Engineer, Physical Design
$130K — $180K *
Sunnyvale, CA 94087 (Santa Clara County)
Today
Enterprise Technology
In-Person

More Consumer Technology Jobs

Chief Product and Innovation Officer
$250K — $400K *
Blueair
New York, NY 10007 (New York County)
Reposted Today
Product Manager
$120K — $150K *
Clipboard Health
San Francisco, CA 94112 (San Francisco County)
Today
Head of Global Social Media
$130K — $180K *
Western Digital Technologies
Irvine, CA 92620 (Orange County)
Today
Growth Marketing Director (Remote)
$138K — $179K *
Cengage Learning
Remote
Reposted Today
Administrative Coordinator, Global E-Commerce
$80K — $98K *
TikTok
Seattle, WA 98115 (King County)
Today

Find similar Research Engineer, Computer Vision jobs:

Nationwide Pittsburgh, PA

Research Engineer, Computer Vision

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Research Engineer, Computer Vision jobs:

Get Ready For Your
Next Interview