ML Ops Engineer (Boston, MA)

Foundation EGI

$120K — $150K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • BS in Computer Science or a related field
  • 5+ years of experience in AI/ML Ops or related roles
  • Expert-level skills in Python and TypeScript
  • Proficient with Docker, Kubernetes, and Terraform
  • Deep understanding of machine learning models, especially LLMs
  • Experience in designing CI/CD pipelines for ML models
  • Strong written and verbal communication skills

Responsibilities

  • Architect and operate end-to-end ML pipelines on Google Cloud and AWS
  • Define and maintain logging, monitoring, and alerting for models
  • Automate CI/CD for ML artifacts using GitHub Actions
  • Collaborate with cross-functional teams including engineers and researchers
  • Write clean, maintainable code with proper documentation
  • Ensure high availability and performance of systems

Benefits

  • Opportunities for professional growth and development
  • Flexible work environment
  • Collaboration with diverse, talented teams
  • Access to cutting-edge technologies and tools
  • Participation in innovative projects in the AI/ML space
Full Job Description
Requirements:

  • Architect, build, and operate end-to-end ML pipelines for training, validation and deployment on Google Cloud and AWS.
  • Define, instrument, and maintain logging, monitoring, and alerting for model performance and data drift.
  • Automate CI/CD for ML artifacts and infrastructure using GitHub Actions or equivalent.
  • Collaborate with cross-functional teams, including frontend engineers, backend engineers, research engineers, and infrastructure engineers.
  • Write clean, well-documented, fast, and maintainable code.
  • Help ensure our systems have high availability and performance.
  • Experience in computer graphics or physics-based simulation.
  • Background in setting up Prometheus/Grafana, ELK, or similar monitoring stacks.
  • Experience with Vertex AI.
  • Experience working with custom Domain-Specific Languages.

What we're looking for

  • BS in Computer Science or a related field.
  • 5+ years of experience as a AI/ML Ops, DevOps, Infrastructure Engineer or equivalent.
  • Expert-level Python and TypeScripts skills.
  • Experience with Docker, Kubernetes, Terraform, Google Cloud and AWS.
  • Deep understanding of machine learning models, including LLMs.
  • Experience designing and maintaining CI/CD pipelines to fine-tune or train ML models.
  • Excellent written and verbal communication skills.


Bonus Points

  • Experience in computer graphics or physics-based simulation.
  • Background in setting up Prometheus/Grafana, ELK, or similar monitoring stacks.
  • Experience with Vertex AI.
  • Experience working with custom Domain-Specific Languages.


Our tech stack

  • Google Cloud, AWS
  • Python, TypeScript
  • Protobuf, gRPC
  • Next.JS, React.JS
  • GitHub Actions
  • Docker, Kubernetes, Spinnaker
  • PostgreSQL


Similar Jobs

More Jobs at Foundation EGI

  • Software Architect
    $130K — $180K *
    Boston, MA 02115 (Suffolk County)
    Information Technology
    In-Person
  • ML Ops Engineer (Boston, MA)
    $120K — $150K *
    Boston, MA 02115 (Suffolk County)
    Information Technology
    In-Person

More Information Technology Jobs

Find similar ML Ops Engineer (Boston, MA) jobs: