UnitedHealth Group

Data Engineer - Remote

UnitedHealth Group$72K — $130K *
US-AnywhereRemote in Minnetonka, MN
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years of experience in data engineering with pipeline and ETL process implementation.
  • 2+ years of SQL experience focusing on query optimization and data manipulation.
  • 1+ year of experience with Azure and Databricks for managing cloud data platforms.
  • Familiarity with streaming data technologies like Spark Streaming, Kafka, or Azure Event Hubs.
  • Effective communication skills to interact with technical and non-technical stakeholders.
  • Proven ability to identify and resolve data issues and implement data quality checks.

Responsibilities

  • Design, develop, and maintain scalable data pipelines for analytics using Azure Data Factory and PySpark.
  • Utilize the Medallion architecture for organized data processing stages, ensuring schema enforcement.
  • Optimize Spark jobs by tuning configurations and managing resources for high performance.
  • Build and maintain ETL processes integrating diverse data sources and handling data anomalies.
  • Implement data quality management with automated validations and alerts for data inconsistencies.
  • Leverage modern pipeline frameworks to enhance productivity in data flow development.
  • Document data engineering workflows and ensure compliance with governance and security policies across the data lifecycle.

Benefits

  • Remote work flexibility for U.S. employees, with office days required for Minneapolis and D.C. areas.
  • Comprehensive benefits package including healthcare and retirement plans.
  • Incentive and recognition programs to reward performance.
  • Equity stock purchase options available.
  • 401(k) contributions to support long-term savings.
Full Job Description
This role is responsible for designing, developing, and maintaining scalable and reliable data pipelines that support both batch and real-time analytics within an Azure-based data platform. The position operates as part of a collaborative data engineering team, working closely with fellow data engineers and data science & reporting partners to meet evolving data requirements. The scope of the role includes end-to-end pipeline development using Azure Data Factory, Databricks, PySpark, and streaming technologies; implementation of the Medallion (Bronze/Silver/Gold) architecture; enforcement of data quality, reliability, and performance standards; and adherence to enterprise data governance, security, and documentation practices across the data lifecycle.

You'll enjoy the flexibility to work remotely * from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.

Primary Responsibilities:

  • Pipeline Development: Design, develop, and maintain robust pipelines to ingest data from various sources (both streaming and batch) into the analytics environment using using Azure Data Factory and PySpark via Databricks. Set up real-time data ingestion using tools like Spark Structured Streaming and batch ETL jobs for periodic data loads. Ensure these pipelines are scalable, efficient, and fault-tolerant to handle growing data volumes and velocity
  • Implement Data as per Medallion Architecture: Utilize the Medallion (Bronze/Silver/Gold) architecture principles to organize data processing stages. Establish raw data capture (bronze), perform cleansing and transformations (silver), and curate refined datasets for analysis and machine learning (gold). Apply best practices in each layer, such as schema enforcement and checkpointing for streaming data
  • Optimize Spark jobs by tuning configurations, improving query logic, and managing resources to achieve high throughput and low latency. Address bottlenecks in streaming pipelines (e.g., by scaling clusters or tweaking batch intervals) and ensure timely data delivery. Optimize job scheduling and cluster utilization to balance timely data delivery with cost-effectiveness
  • ETL Development & Maintenance: Build and maintain data pipelines with an emphasis on data cleaning steps. Integrate data from various sources (APIs, databases, file feeds, IoT streams, etc.) into the data platform, writing transformations that handle anomalies (e.g., missing or corrupt values) and standardize datasets. Collaborate with the other data engineer to share responsibility across different pipelines or sources, ensuring redundancy and knowledge transfer
  • Data Quality Management: Implement comprehensive data validation rules and checks within pipelines. For example, verify schema correctness, check value ranges for sensor or health data, and ensure referential integrity where applicable. Set up automated alerts or logs that flag inconsistent or bad data, enabling quick intervention. Over time, build a library of data quality tests that run as part of the pipeline (for both streaming and batch processes) to catch issues early
  • Emerging Pipeline Frameworks: Leverage modern pipeline frameworks and tools to improve development productivity. For example, use Databricks Delta Live Tables or Lakehouse pipelines to declaratively define data flows where applicable. Explore the use of Spark Declarative Lakeflow Pipelines or similar technologies to simplify the orchestration of complex data processes
  • Reliability & Collaboration: Implement monitoring and alerting for pipeline health. Investigate and resolve problems such as data delays, pipeline failures, or data inconsistencies. Use logs, error messages, and analytics to identify root causes (e.g., source system changes, bug in transformation logic) and implement fixes. Work closely with the other data engineer and data science team members to understand data requirements and adjust pipelines accordingly. Document data engineering workflows and ensure proper data governance (security, privacy, access controls) is in place
  • Documentation & Governance: Maintain clear documentation of data pipelines, including data source details, transformation logic, and data destination schemas. Ensure that data lineage is tracked so one can trace how data moved and changed through the system. Adhere to data governance policies - for instance, ensure sensitive data is properly masked or encrypted in non-production environments, and that access controls are in place. Work with leadership to periodically review and improve data management practices


You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.

Required Qualifications:

  • 3+ years of experience in data engineering role designing and implementing data pipelines and ETL processes. Should have understanding on how to handle incremental data loads and maintain history (CDC - change data capture)
  • 2+ years of experience in SQL for data manipulation and query optimization. Knowledge of Python and Apache Spark (using PySpark) for building data pipelines; ability to write efficient code for batch and streaming data transformations
  • 1+ years of experience using Azure, Databricks or an equivalent cloud-based data platform. Comfortable with managing clusters, using notebooks, and working with Delta Lake or Parquet files. Familiarity with cloud data services and tools for pipeline orchestration is expected
  • Experience working in a team environment with agile methodologies. Ability to communicate effectively with both technical peers and non-technical stakeholders (explaining data issues in plain language). Should be comfortable using version control systems and participating in collaborative development (code reviews, pair programming when needed)
  • Familiar with streaming data technologies. This could include Spark Streaming, Kafka, Azure Event Hubs, or similar platforms for real-time data ingestion.
  • Demonstrated ability to detect and correct data issues - for instance, identifying when a data source has stopped updating, or when an upstream change has altered data format. Experience implementing validation checks or using frameworks to enforce data quality standards


Preferred Qualifications:

  • Experience with any declarative pipeline frameworks or data workflow management tools (e.g., Databricks Delta Live Tables). This can indicate readiness to adopt advanced tools in our environment
  • Experience integrating data quality checks into pipelines (such as using assertions or Great Expectations tests) to ensure accuracy and completeness of data. Familiarity with data security practices, encryption, and handling of sensitive data
  • Familiarity with streaming data handling (even if assisting, should understand basics of Spark Streaming or message queue systems) is expected
  • Demonstrated skill in performance tuning for Spark or SQL queries. For example, experience in partitioning strategies, caching, or troubleshooting shuffle issues to optimize heavy data workloads


*All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy

Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $72,800 to $130,000 annually based on full-time employment. We comply with all minimum wage laws as applicable.

Application Deadline: This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants.

About UnitedHealth Group

UnitedHealth Group is a medical facility. They offer health technology, health finance, and pharmacy services. They are utilizing clinical data and intelligence to assist in the redesign, automation, and deployment of technology to streamline administrative operations and clinical decision-making. Payment systems for both consumers and providers are critical components of a health-care system.

UnitedHealth Group Careers

Joining UnitedHealth Group means becoming part of a diverse team dedicated to making a difference. As a leader in health services and innovation, our company offers a variety of job opportunities that allow professionals to leverage their skills, drive innovation, and improve lives. Work You’ll Do At UnitedHealth Group, you’ll contribute to a mission-focused environment where your expertise will influence the health and well-being of people worldwide. Our employment opportunities span across a wide range of disciplines, from healthcare specialists to data analysts, ensuring that your career journey is both dynamic and rewarding. Transform Healthcare with Your Expertise UnitedHealth Group stands at the forefront of health innovation. Our team collaborates to deliver solutions that lead to better patient outcomes. Working with us, you’ll find yourself at the intersection of technology, healthcare, and leadership, providing key insights that drive industry transformation. Join Our Global Team As part of our team, you’ll engage with over 300,000 professionals globally, dedicated to building a diverse and inclusive workplace. UnitedHealth Group is not just a company; it’s a community where you can grow your career through continuous learning and leadership opportunities. Our commitment to diversity training ensures that every team member can thrive. UnitedHealth Group Career Development We are committed to your professional growth. Explore career paths filled with promising job opportunities and internships that will harness your potential and expand your capabilities. Whether you’re a seasoned professional or a recent graduate, you’ll find that our career development programs support your ambition at every level. Innovative Work Environment At UnitedHealth Group, innovation is at the core of our operations. We encourage our team to bring forward-thinking ideas that challenge the status quo and lead to breakthrough improvements in patient care. Be Part of a Great Team Our culture fosters a collaborative and supportive environment where every member’s contribution is valued. Enjoy the benefits of being part of a global team that’s committed to making a difference in people’s lives. Future-Proof Your Career With UnitedHealth Group, your career is future-proof. Dive into a range of positions that offer both challenges and rewards. Our robust support system includes unmatched training, development programs, and certification support to propel your career forward. Stay Connected Join Our Team Discover the right position that matches your skills and interests. We are always hiring and look for passionate, curious, and solution-driven team players. Search UnitedHealth Group jobs today and take the first step towards a fulfilling career. Keep Up to Date Stay informed with career tips, insider perspectives, and industry-leading insights you can use today—all from the people who work here. Read Careers Blog Job Alert Emails Customize your subscription to receive job alerts, the latest news, and insider tips tailored to your preferences. Explore the exciting and rewarding opportunities that await at UnitedHealth Group.
Learn more about UnitedHealth Group
Size
350,000 employees
Market Cap
$493.1 billion
Industry
Net Income
$15.4 billion
Founded
1974
5 Year Trend
+9.2%
Revenue
$257.1 billion
NASDAQ

Similar Jobs

More Jobs at UnitedHealth Group

More Information Technology Jobs

Find similar Data Engineer - Remote jobs: