EPAM Systems

Senior Site Reliability Engineer - Data Science - Onsite

EPAM Systems$120K — $150K *
US-AnywhereRemote in Nashville, TN
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 2+ years administering data science platforms like Alteryx, Dataiku, or Azure ML
  • Strong skills in MongoDB performance monitoring and automation
  • Proficiency in DevOps practices with Unix Shell, Python, PowerShell, or similar
  • Ability to analyze complex issues and propose scalable solutions
  • Experience influencing IT and business stakeholders
  • Willingness for on-call and weekend support as needed
  • Optional Unix and/or Windows administration experience

Responsibilities

  • Design and implement robust infrastructure for global data science platforms
  • Provide expert-level support for MongoDB-based data science environments
  • Develop automation to enhance system reliability and performance monitoring
  • Troubleshoot technical incidents as a Problem Manager to minimize disruptions
  • Collaborate on improving infrastructure and modern data initiatives
  • Ensure compliance of platform changes with operational guidelines

Benefits

  • Subsidized Medical, Dental, and Vision Insurance
  • Health Savings and Flexible Spending Accounts
  • Company-provided Short- and Long-Term Disability
  • Life and AD&D Insurance provided by the company
  • Employee Assistance Program for support services
  • Unlimited access to LinkedIn learning solutions
  • Matched 401(k) Retirement Savings Plan
  • Paid Time Off policy
  • Legal Plan and Identity Theft Protection options
  • Accident Insurance available
  • Employee Discounts and Pet Insurance offerings
  • Employee Stock Purchase Program
Full Job Description
Shape the future of global data science infrastructure as a Site Reliability Engineer at EPAM. You'll architect, implement, and support cutting-edge platforms like Alteryx, Dataiku, and Azure Machine Learning that power critical business insights across wealth management, investment banking, and corporate functions.

At EPAM, you'll work on cutting-edge technologies, solve complex challenges, and shape the future of digital innovation. With access to continuous learning, mentorship, and global projects, your expertise will drive meaningful change.

RESPONSIBILITIES
  • Design and implement robust infrastructure solutions to support enterprise-scale data science platforms across multiple global regions
  • Provide expert-level production support for engineering teams and business stakeholders using MongoDB-based data science environments
  • Develop automation frameworks that enhance system reliability, performance monitoring, and incident response capabilities
  • Troubleshoot and resolve complex technical incidents as a Problem Manager, ensuring minimal disruption to business operations
  • Collaborate with cross-functional teams to continuously improve core infrastructure and implement modern data science initiatives
  • Ensure all platform changes and enhancements adhere to operational guidelines and compliance requirements across international jurisdictions

REQUIREMENTS
  • 2+ years of hands-on administrative experience with data science platforms such as Alteryx Server, Dataiku, or Azure Machine Learning
  • Strong MongoDB performance monitoring and optimization skills with focus on automation and reliability
  • Demonstrated proficiency in DevOps practices using Unix Shell, Python, PowerShell scripting, or other programming languages
  • Proven ability to analyze complex problems, design effective solutions, and implement technical improvements at scale
  • Experience influencing IT stakeholders and business partners in enterprise technology environments
  • Willingness to participate in occasional on-call rotation and weekend support for critical activities
  • Unix and/or Windows administration experience (Optional)

WE OFFER
  • Medical, Dental and Vision Insurance (Subsidized)
  • Health Savings Account
  • Flexible Spending Accounts (Healthcare, Dependent Care, Commuter)
  • Short-Term and Long-Term Disability (Company Provided)
  • Life and AD&D Insurance (Company Provided)
  • Employee Assistance Program
  • Unlimited access to LinkedIn learning solutions
  • Matched 401(k) Retirement Savings Plan
  • Paid Time Off
  • Legal Plan and Identity Theft Protection
  • Accident Insurance
  • Employee Discounts
  • Pet Insurance
  • Employee Stock Purchase Program

About EPAM Systems

EPAM Systems, Inc. is a leading global provider of digital platform engineering and development services. The company has a strong presence in North America, Europe, and Asia, and serves clients in a variety of industries, including financial services, healthcare, and retail. EPAM's services include software engineering, product development, and digital platform engineering, and the company has a reputation for delivering high-quality solutions that help its clients achieve their business goals. EPAM has been recognized as a leader in the digital services industry by a number of independent research firms, and the company has won numerous awards for its work.
Learn more about EPAM Systems
Size
58,824 employees
Market Cap
$18.2 billion
Industry
Net Income
$327.1 million
Founded
1993
5 Year Trend
+26.5%
Revenue
$2.6 billion
NASDAQ

Similar Jobs

More Jobs at EPAM Systems

More Enterprise Technology Jobs

Find similar Senior Site Reliability Engineer - Data Science - Onsite jobs: