Google

Software Engineer III, Multimodal Agentic AI, XR

Google$147K — $211K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree or equivalent practical experience.
  • 2 years of software development experience in Python or C.
  • 1 year of experience with ML infrastructure, including model deployment and debugging.
  • Experience with Generative AI techniques such as LLMs and computer vision.
  • Master's degree or PhD in Computer Science or related field (preferred).
  • 2 years' experience with data structures and algorithms (preferred).
  • Experience in applied research for enhancing language and multimodal models (preferred).

Responsibilities

  • Design, develop, and deploy scalable agentic AI solutions for multimodal conversational AI on smart glasses.
  • Understand the Gemini Live and Astra tech stacks; optimize agent architecture for efficient deployment.
  • Take ownership of AI quality in production, defining metrics and driving data-driven improvements.
  • Implement and enhance AI techniques focused on multimodal conversational quality and reasoning tasks.

Benefits

  • Comprehensive healthcare coverage.
  • Flexible working hours and remote work options.
  • Retirement savings plan with company matching.
  • Generous parental leave and family support programs.
  • Continuous learning and development opportunities.
Full Job Description
Minimum qualifications:
  • Bachelor's degree or equivalent practical experience.
  • 2 years of experience with software development in Python or C .
  • 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
  • Experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).

Preferred qualifications:
  • Master's degree or PhD in Computer Science, or a related technical field.
  • 2 years of experience with data structures and algorithms.
  • Experience conducting applied research to enable new functionality and improve the quality and efficiency of large language and multimodal models.
  • Knowledge of machine learning and statistics.


About the job
Our team is at the forefront of building the next generation of conversational AI. We're developing agentic AI solutions for smart glasses, utilizing Gemini Live and Astra to create a unique and trusted multimodal experience. This technology delivers instant, natural conversational intelligence directly to the user's eye, allowing them to navigate their world more immersively than ever.

In this role, you will design multimodal agentic solutions focused on goal-oriented reasoning tasks. You will enhance and develop new multimodal tools and extensions. You will define and execute the strategy for data, evaluation, and post-tuning of the Gemini model to enhance its impact for smart glasses use cases.
For decades, the computing revolution has reshaped our world driven by
breakthroughs in compute, connectivity, mobile, and now, AI. Google's XR team is at the forefront of the next major leap - the convergence of AI and XR. This is more than just new devices - it's about reimagining how we interact with the world around us. We're building a future where
lightweight XR devices like smart glasses and headsets pair with helpful AI to augment human intelligence, offering personalized, conversational, and contextually aware experiences.Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $147000 - $211000 (USD) 15% bonus target bonus equity benefits

Learn more about benefits at Google .

Responsibilities
  • Design, develop, and deploy scalable and agentic AI solutions for high-value, real-world multimodal conversational AI use cases on smart glasses.
  • Gain an understanding of the Gemini Live and Astra tech stack and infrastructure. Optimize agent architecture/orchestration to ensure efficient deployment and operation at scale, with a focus on inference cost optimization.
  • Take ownership of AI quality for production systems. This includes defining technical metrics, implementing evaluation frameworks, analyzing loss patterns, and driving improvements through data collection and smart data generation and model enhancements.
  • Implement, optimize, and advance AI techniques, with a focus on multimodal conversational quality, multimodal tool use, and multimodal goal-oriented reasoning.


About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.
Learn more about Google
Size
156,500 employees
Market Cap
$1,115.4 billion
Industry
Net Income
$40.2 billion
Founded
1998
5 Year Trend
+23.3%
Revenue
$182.5 billion
NASDAQ

Similar Jobs

More Jobs at Google

More Enterprise Technology Jobs

Find similar Software Engineer III, Multimodal Agentic AI, XR jobs: