Staff Machine Learning Infrastructure Engineer in Mountain View, CA

Intuit Inc   •  

Mountain View, CA 94035

Industry: Enterprise Technology

  •  

Less than 5 years

Posted 55 days ago

Overview

Intuit AI is paving the way in Machine Learning and Artificial Intelligence for all facets of the Intuit product suite, allowing Intuit to power prosperity for everyone. Intuit AI Engineering is a dynamic and innovative group whose mission is to enhance overall engineering capabilities for the purpose of improving efficiency, productivity, and impact of AI practitioners. In this team, you'll be tasked with owning and delivering cutting edge platforms, frameworks, and services that power AI at Intuit.

Machine Learning Infrastructure Engineers work at the intersection of software and systems engineering to design and build large scale consumer experiences which are reliable, operable, secure, highly available, disaster-ready, and performant. Join a world-class operations team and utilize your programming and operations talents to apply the latest patterns in continuous delivery, cloud operations, containerization, and serverless technology and help us build the next generation AI/ML platform and delight millions of customers.

In this role, you'll be part of a vibrant team of data scientists, product developers, and machine learning engineers. You'll be expected to help architect, code, optimize, and deploy machine learning models at scale using the latest industry tools and techniques. You will be responsible for defining and maintaining the day to day as well as the future operations and reliability for a variety of use-cases. You will also help automate, deliver, monitor, and improve machine learning solutions while ensuring data and models are secure.


Responsibilities

  • Design and build systems which improve machine learning scalability, usability, and performance.
  • Work cross-functionally with product managers, data scientists, and engineers to understand, implement, and deploy machine learning pipelines.
  • Work with data scientists and MLE to build pipelines to train and deploy models.
  • Effectively communicate results to peers and leaders.
  • Explore the state-of-the-art technologies and apply them to deliver customer benefits.
  • Embed into development projects and assist teams in delivering new features, services, and consumer experiences through production with speed and quality. You honor operational rigor and think first about customer experiences
  • Consider security, reliability, operability, and performance when designing and implementing technical solutions
  • Full stack system engineering. Utilize your deep knowledge of networking, systems, and software engineering coupled with your understanding of service-oriented architectures, mastery of CI/CD and AWS to steer operational approaches and solutions
  • Work on significant assignments which are broad in scope and complexity, span organizational boundaries, and cover a wide range of technology
  • Lead Failure Mode Effect Analysis (FMEA) working sessions and automate chaos engineering within the Intuit AI ecosystem
  • Expertly build and operate services within AWS. Assist project teams with designing and setting up highly available cloud environments
  • Think pragmatically when introducing new technology. Balance the demand for realistic, short-term goals to meet project and seasonal deadlines, with the need for longer-term, potentially more disruptive innovation

Qualifications

  • BS, MS, in Computer Science or a related field, or equivalent practical experience.
  • 3+ years of cloud experience, working collaboratively with development teams, assisting with build and production support, writing deployment automation, setting up CI/CD pipelines, etc.
  • Knowledge of data query and data processing tools (i.e. SQL)
  • 2+ years of solid Configuration Management experience: (For example Chef, Puppet, Salt, Ansible, Cfengine, etc
  • Knowledge of agile project methodologies. Experience with writing user stories in Jira, grooming project backlogs, planning sprints, and working with Scrum or Kanban
  • Solid understanding of networking including TCP/IP stack, basic switching/routing concepts, and load-balancing
  • Experience with CI/CD tools (Jenkins/Jenkins2, Spinnaker, CodePipeline, TeamCity, Bamboo, etc)
  • Experience with big data stores and NoSQL databases (EMR (spark, hive, hadoop) – preferred, Vertica – preferred, Cassandra, Redshift, Kinesis, or AWS Elasticsearch)
  • Experience with software container technology (e.g., Kubernetes, Docker)


Valid Through: 2019-11-12