Coupa Software

Lead Data Platform Engineer - 11623

Coupa Software$125K — $174K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's Degree with 8+ years in Big Data and Data Pipelines management.
  • Strong Linux administration and troubleshooting skills.
  • 5+ years of cloud platform management experience, primarily AWS and Azure.
  • Familiarity with generative AI technologies and MLOps.
  • Experience with container orchestration tools such as ECS, EKS, or AKS.
  • Proficient in Infrastructure as Code (IaC) tools like Terraform and Chef.
  • Excellent communication skills with a proactive problem-solving approach.

Responsibilities

  • Manage end-to-end data pipelines (ETL jobs) within agreed SLAs.
  • Oversee AWS core and big data services including S3 and Redshift.
  • Run applications in containerized environments (ECS, Docker).
  • Lead operational life cycles for Machine Learning infrastructures, ensuring reliability and performance.
  • Automate platform maintenance and standardize infrastructure configurations.
  • Administer applications on Linux operating systems for improved observability.
  • Support ETL jobs and participate in on-call rotations for emergencies.

Benefits

  • Opportunity to work with cutting-edge technology in AI and big data environments.
  • Remote work flexibility allowing for better work-life balance.
  • Collaborative team culture focused on continuous learning and professional growth.
  • Involvement in designing and implementing innovative AI-driven features.
Full Job Description
The Impact of a Lead Data Platform Engineer at Coupa:

If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the "Cloud team" is looking for an engineer who is ready to constantly question the status quo with a mixture of system design, code development, deployment, automation, networking, and experience in managing big data/ Machine Learning/GenAI platforms.

What You'll Do:

  • Manage end-to-end Data pipeline (ETL jobs) within agreed SLAs.
  • Manage AWS core and big data services (S3, IAM, EMR, Redshift, etc..)
  • Running applications in containers (ECS, Docker)
  • Lead Day 2 operational lifecycle for ML and GenAI infrastructure. This includes designing, deploying, and maintaining high-availability production LLM serving platforms, implementing automated scaling, self-healing, and infrastructure-as-code patterns. Focus on proactive reliability, model performance observability, and continuous cost optimization for high-compute AI workloads.
  • Collaborate closely with our product development and engineering teams to create AI-driven features
  • Drive cloud operations consistency by automating platform maintenance, standardizing infrastructure configurations (IaC), and implementing robust release management processes to minimize drift across multi-cloud environments.
  • Manage AWS infrastructure using code (Terraform, Chef, etc..)
  • Administering applications running in Linux operating system.
  • Enable application and system monitoring for better observability.
  • Application and infrastructure support for ETL jobs and data pipelines including participating in an on-call rotation for after-hours emergencies.
  • Collaborate with platform and Dev teams to plan and deploy product releases and patch Linux/ECS clusters.
  • Ability to participate in design reviews, code reviews, and troubleshooting incidents.
  • Ability to operate in a high-pressure environment and troubleshoot complex issues quickly while successfully handling multiple priorities.
  • Ability to record, write, and review RCAs.


What You Will Bring to Coupa:

  • Bachelor's Degree and at least 8+ years of experience managing Big Data technologies and Data Pipelines.
  • Sound knowledge and experience in Linux administration and troubleshooting.
  • 5+ years of experience in managing cloud infrastructure and platforms, such as AWS and Azure
  • Familiar with the current engineering landscape in the generative AI space and have a strong interest in AI and related technologies.
  • Strong expertise in MLOps and production-grade LLM operations. Proven track record in managing high-availability model inference clusters, automating model lifecycle management, and implementing advanced observability (latency, throughput, and error rate monitoring) specifically for AI workloads.
  • Have Bash or Python scripting experience
  • Experience with containerization, Amazon ECS, EKS/ Azure AKS
  • Experience with tools like Chef, Ansible, Jenkins, Rundeck, or equivalent
  • Experience with source control systems such as Git and operating in complex branching strategies
  • Experience with Infrastructure as Code products like Terraform, helm charts
  • Good understanding of DNS and Load balancers setup and troubleshooting
  • Experience in Big Data platforms/Data lakes and managing Business Intelligence tools (like looker..)
  • Knowledge in ApacheSpark architecture and troubleshooting Java applications.
  • Basic understanding of MySQL Server and general database knowledge
  • Excellent written and verbal communication with a passion for solving the problem
  • Confidence in your ability to own and deliver projects and issues to resolution on your own & can think and act globally
  • Deep experience in Day 2 cloud operations, including automated incident remediation, capacity planning, and managing large-scale production cloud environments with a focus on performance and reliability.


$125,000 - $174,333 a year

The successful candidate's starting salary will be determined based on permissible, non-discriminatory factors such as skills, experience, and geographic location within the state.

#LI-Remote

#LI-TC1

About Coupa Software

Coupa Software is a cloud-based business spend management platform that helps companies control their spending and generate savings. The company's platform provides procurement, invoicing, and expense management solutions that enable businesses to manage their spending across various categories, including travel and expense, contingent workforce, and marketing. Coupa Software's platform is designed to be user-friendly and accessible from any device, making it easy for employees to manage their spending and for businesses to track their expenses. The company serves customers in a variety of industries, including healthcare, retail, and manufacturing.
Learn more about Coupa Software
Size
3,076 employees
Market Cap
$5.9 billion
Industry
Net Income
-$142.7 million
Founded
2006
5 Year Trend
+40.2%
Revenue
$489.5 million
NASDAQ

Similar Jobs

More Jobs at Coupa Software

More Information Technology Jobs

Find similar Lead Data Platform Engineer - 11623 jobs: