EPAM Systems

Senior Data Engineer with Java, Apache Spark

EPAM Systems$120K — $150K *
US-AnywhereRemote in Georgia, US
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in data engineering or backend platform development
  • Proficiency in Java and/or Scala with Python as secondary language
  • Expertise in Apache Spark, including Spark SQL and Spark Streaming
  • Skills in Apache Kafka and event-driven architectures
  • Competency in SQL and data modeling
  • Familiarity with AWS cloud services and serverless computing

Responsibilities

  • Design, develop, and maintain Apache Spark batch pipelines and Kafka streaming solutions
  • Build serverless microservices using AWS Lambda and Azure Functions
  • Develop data ingestion connectors for databases, file systems, and financial data feeds
  • Write production-grade code in Java, Scala, and Python
  • Manage infrastructure using Terraform across multiple environments
  • Containerize and deploy applications using Docker and Kubernetes (EKS/AKS)
  • Implement and maintain CI/CD pipelines in GitLab CI

Benefits

  • Opportunity to work within a technology-driven organization
  • Focus on large-scale data ingestion and analytics solutions
  • Engage with critical downstream analytics and reporting use cases
  • Access to cutting-edge tools and technologies
  • Work in a collaborative environment with exposure to cloud services
Full Job Description
We are seeking a Senior Data Engineer to join a technology-driven organization operating in the financial data domain, building a production-grade analytics and data ingestion platform processing large-scale financial datasets. This role focuses on designing, building and operating large-scale data ingestion and analytics solutions for batch and real-time processing of financial data, supporting critical downstream analytics and reporting use cases. Responsibilities Design, develop and maintain Apache Spark batch pipelines and Kafka streaming solutions Build serverless microservices using AWS Lambda and Azure Functions Develop data ingestion connectors for databases, file systems, message queues and financial data feeds Write production-grade code in Java, Scala and Python Manage infrastructure using Terraform across multiple environments Containerize and deploy applications using Docker and Kubernetes (EKS/AKS) Implement and maintain CI/CD pipelines in GitLab CI Write and maintain unit and integration tests with high code coverage Monitor platform health and troubleshoot production issues in distributed systems Requirements 5+ years of experience in data engineering or backend platform development Proficiency in Java and/or Scala as primary languages with Python as secondary Expertise in Apache Spark including Spark SQL and Spark Streaming Skills in Apache Kafka and event-driven architectures Competency in SQL and data modeling Familiarity with AWS cloud services and serverless computing Nice to have Knowledge of Terraform for Infrastructure as Code Skills in Docker and Kubernetes (EKS/AKS) Familiarity with CI/CD pipelines such as GitLab CI, Jenkins or equivalent Background in cloud data warehouses such as Amazon Redshift Understanding of Microsoft Azure services and Azure-native tooling Knowledge of Iceberg, Hudi or Delta Lake

About EPAM Systems

EPAM Systems, Inc. is a leading global provider of digital platform engineering and development services. The company has a strong presence in North America, Europe, and Asia, and serves clients in a variety of industries, including financial services, healthcare, and retail. EPAM's services include software engineering, product development, and digital platform engineering, and the company has a reputation for delivering high-quality solutions that help its clients achieve their business goals. EPAM has been recognized as a leader in the digital services industry by a number of independent research firms, and the company has won numerous awards for its work.
Learn more about EPAM Systems
Size
58,824 employees
Market Cap
$18.2 billion
Industry
Net Income
$327.1 million
Founded
1993
5 Year Trend
+26.5%
Revenue
$2.6 billion
NASDAQ

Similar Jobs

More Jobs at EPAM Systems

More Information Technology Jobs

Find similar Senior Data Engineer with Java, Apache Spark jobs: