AI/ML SRE Lead in New York, NY

$150K - $200K(Ladders Estimates)

J.P. Morgan Chase & Co   •  

New York, NY 10001

Industry: Finance & Insurance

  •  

8 - 10 years

Posted 62 days ago

This job is no longer available.

As an experienced Software Engineer, your mission is to help lead our team of innovators and technologists toward creating next-level solutions that improve the way our business is run. Your deep knowledge of design, analytics, development, coding, testing and application programming will help your team raise their game, meeting your standards, as well as satisfying both business and functional requirements. Your expertise in various technology domains will be counted on to set strategic direction and solve complex and mission critical problems, internally and externally. Your quest to embracing leading-edge technologies and methodologies inspires your team to follow suit. And best of all, you'll be able to harness massive amounts of brainpower through our global network of technologists from around the world.


Manage a team of software engineers focused on improving and promoting the availability, stability and performance of our infrastructure, systems and applications.

  • Leads the design, analysis, development, support and/or delivery of AI/ML products and services
  • Cultivates trust through personal and team relationships with senior management and key stakeholders inclusive of MD's and responsible for periodic reporting, KPI reporting
  • Troubleshoots priority incidents, conducts post-mortems and ensures permanent closure of the incidents
  • Engages with development team throughout the life cycle to help develop software for reliability
  • Designs and conducts the performance tests, identifies the bottlenecks, opportunities for optimization and the capacity demand
  • Contributes to the definition of the strategic roadmap and its execution; inclusive of R&D of emerging industry trends
  • Applies analytics on the past data like incidents and usage patterns for predicting issues and takes proactive actions
  • Defines and drives adoption of a best in class monitoring frameworks to accomplish end to end flow monitoring and noiseless alerting
  • Deploys the software and product upgrades
  • Facilitates maximum speed of delivery by objectively binding to error budgets of the service
  • Manages the effort split between manual operational work and engineering work
  • Be part of the 24x7 support coverage as needed
  • Articulate complex AI/ML and data science problems and comfortable presenting solutions to Senior Management in business language while driving resolution
  • Embrace & promote cultural embodiment of group and firm


  • BS or MS degree or equivalent experience in computer science
  • A minimum of 8 years of hands-on leadership of high-performing, agile-based engineering teams
  • 6+ years of experience architecting integrated stack solutions (storage, network, compute) within an enterprise scale production environment
  • 6+ years of experience in performance engineering and monitoring using tools such as AppDynamics, Splunk, Apica, Jmeter and Blaze meter etc.
  • Experience in Anaconda, Jupyter, open source framework.
  • Experienced in at least one programming language, preferably python
  • Cloud computing: Google Cloud, Amazon Web Service, Azure, Docker, Kubernetes.
  • Experience working in an Agile Development environment
  • Experience in setting CI/CD pipeline.
  • Proven ability to understand and troubleshoot complex problems under pressure
  • Familiarity with AWS ML/Sagemaker, Azure ML, Google AI would be preferred.
  • 8+ years of incident resolution experience in an large scale operations environment
  • Experience in big data technologies.

Valid Through: 2019-9-16