The Hartford Financial Services Group, Inc

Principal Reliability Engineer - EDS

US-Anywhere
+ 2 other locationsRemote
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 10+ years of experience in data, cloud, platform engineering, or site/reliability engineering.
  • Proficiency with cloud platforms like AWS and GCP, including architectural resilience and security.
  • Deep knowledge of platforms such as Snowflake, EMR, and Hadoop/Spark.
  • Scripting experience in Python for automation and reliability frameworks.
  • Familiarity with Infrastructure-as-Code tools like Terraform or CloudFormation.

Responsibilities

  • Define and lead the Reliability Engineering strategy for data platforms and cloud environments.
  • Establish long-term RE roadmaps and architectural patterns to support organizational growth.
  • Serve as the highest technical escalation point for systemic reliability issues.
  • Architect cost-efficient and high-performing cloud platforms across AWS and GCP.
  • Develop AI-driven automation for anomaly detection and predictive capacity management.
  • Implement enterprise-wide observability frameworks for data platforms and pipelines.
  • Lead the definition of RE best practices for modern data products and operational analytics.

Benefits

  • Hybrid work schedule with office presence required 3 days a week.
  • Opportunities for leadership mentoring and influence in engineering culture.
  • Access to work on advanced AI-driven operations and observability tools.
  • Engagement in cross-organizational technical initiatives and strategic vision setting.
Full Job Description
Principal Reliability Engineering - IE06JE

The Enterprise Data Services (EDS) organization is seeking a Principal Reliability Engineer (Principal RE) to serve as the senior technical authority responsible for the reliability, resilience, availability, and performance of all data platforms, cloud infrastructure, data products, and data pipelines across the enterprise data organization. This role sets the strategic vision for Reliability Engineering within EDS and leads the definition, implementation, and continuous evolution of RE practices, tooling, automation, observability frameworks, and AIOps/AI-driven operations.

As the Principal RE, you will influence architectural direction, lead large-scale, cross-organizational technical initiatives, and drive a culture of engineering excellence, automation-first operations, and proactive reliability improvement. You will partner closely with platform engineering, data engineering, security, architecture, and product teams to embed RE principles into every stage of the data product lifecycle.

This role will have a Hybrid work schedule, with the expectation of working in an office (Columbus, OH, Chicago, IL, Hartford, CT or Charlotte, NC) 3 days a week (Tuesday through Thursday).

Key Responsibilities

Enterprise Reliability Strategy & Leadership
  • Work closely with the AVP, RE & Production Support, EDS defining the Reliability Engineering strategy for data platforms, data cloud environments, and data products.
  • Establish long-term RE roadmaps, target operating models, and architectural patterns that scale with organizational growth.
  • Serve as the highest-level technical escalation point for systemic reliability issues, influencing executive stakeholders and engineering leaders.


Platform & Cloud Reliability (AWS, GCP, Snowflake, EMR, Hadoop, ETL/ELT)
  • Leverage Enterprise provided standards and building blocks to Architect and evolve highly reliable, performant, and cost-efficient cloud-based platforms across AWS and GCP for all EDS services.
  • Influence and work directly with Platform Solution Architecture on new product enablement, hyper automation (end to end blueprint automation).
  • Oversee reliability controls and fail-safe patterns for Snowflake, EMR, Hadoop/Spark clusters, container platforms (e.g., Kubernetes), and mission-critical data systems.
  • Lead the creation and enforcement of SLO/SLI frameworks that span the entire data lifecycle.


AI-Enabled Operations, AIOps & Intelligent Automation
  • Develop and implement AI-driven automation for anomaly detection, alert correlation, autonomous remediation, and predictive capacity management.
  • Leverage LLMs, prompt engineering, and cloud-native AI services (AWS Bedrock, SageMaker, Vertex AI) to build intelligent runbooks, advanced troubleshooting agents, and generative-AI-enabled operational tooling.
  • Champion the adoption of machine learning-based observability and reliability analytics.


End-to-End Observability & Operational Excellence
  • Adopt and architect enterprise-wide data observability frameworks-including logging, metrics, tracing, distributed profiling, and event pipelines-for all data platforms and pipelines.
  • Establish gold-standard incident response patterns, post-incident reviews, and continuous improvement processes.
  • Drive elimination of toil across EDS, focusing on self-healing systems, proactive detection, and autonomous operations.


Data Pipeline & Data Product Reliability
  • Define RE best practices for modern data products, governed data pipelines, real-time/streaming systems, and operational analytics platforms.
  • Ensure data quality, data timeliness, and SLAs for data products through automated checks, lineage-informed alerting, and pipeline reliability tooling.
  • Partner with Data Engineering to embed resilience patterns (idempotency, checkpointing, replayability, disaster recovery) into pipeline architectures.


Engineering Standards, Governance & Cross-Org Influence
  • Set and enforce standards for IaC, CI/CD, platform automation, reliability frameworks, operational readiness, and runbook quality across EDS.
  • Provide technical leadership and mentorship to Staff/Senior Engineers in the RE team and Production Support teams, influencing engineering culture and helping grow RE capabilities across the organization.
  • Represent Reliability Engineering in architectural reviews, enterprise governance forums, and executive-level discussions.


Technical Experience
  • 10+ years in one or more of the following areas: data, cloud, platform engineering, site/reliability engineering, or large-scale distributed systems, with experience in leadership or technology leader roles.
  • Proficiency with data or cloud platforms, including architectural patterns for resilience, networking, security, and distributed data infrastructure.
  • Deep experience supporting or engineering platforms such as Snowflake, EMR, Hadoop/Spark, Data Integration, and cloud-native data ecosystems.
  • Scripting and programming (preferably Python) for large-scale automation, platform tooling, and reliability frameworks.
  • Experience with Infrastructure-as-Code (Terraform, CloudFormation) and enterprise CI/CD.


Preferred Qualifications

  • Experience in regulated or highly complex enterprise environments (financial services, insurance, healthcare).
  • Prior experience as a Senior Staff Engineer, Engineering or Architecture leader with hands on experience, or similar senior technical role.
  • Knowledge of data governance, metadata, lineage systems, and data quality engineering practices.
  • Certifications in AWS, GCP, Kubernetes, or SRE/DevOps frameworks.


AI & AIOps
  • Background applying machine learning to operations-anomaly detection, event correlation, predictive modeling, and automated remediation.
  • Understand of AI-enabled developer/operations tools using LLMs, prompt engineering, or cloud AI services for reliability improvements.


Observability & Platform Operations
  • Expertise with enterprise observability stacks (Prometheus, Grafana, Datadog, Splunk, Dynatrace, OpenTelemetry).
  • Ability to design and enforce advanced SLI/SLO frameworks across complex data ecosystems.


Leadership & Cross-Functional Influence
  • Demonstrated ability to lead technical strategy at scale, influence senior engineering leaders, and set enterprise-wide standards.
  • Strong capability in mentoring engineers, providing architectural guidance, and fostering engineering excellence.
  • Exceptional communication skills for interacting with executives, senior architects, product leaders, and engineering teams.


Candidate must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.

Compensation

The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:

$152,800 - $229,200

About The Hartford Financial Services Group, Inc

The Hartford is an industry leading provider of property and casualty insurance, group benefits and mutual funds. Throughout our rich history of more than 200 years, countless businesses and individuals, including Robert E. Lee, Abraham Lincoln and Babe Ruth, have turned to our company for protection. The Hartford celebrated its 200th anniversary in 2010.

The Hartford Financial Services Group, Inc. Careers

Join the esteemed team at The Hartford Financial Services Group, Inc., a leader in investment and insurance, where we offer more than just job opportunities—we provide a platform for professional growth and innovation. As one of the most respected names in the financial services industry, The Hartford is dedicated to fostering a culture of diversity, leadership, and continuous development.

Work You’ll Do

Embark on a career with The Hartford and contribute to our mission of helping customers achieve amazing financial outcomes. You will have the chance to work alongside a team of experts who are not only skilled in their fields but are also passionate about making a difference.

Transform Your Career

At The Hartford, we believe in nurturing talent through comprehensive training programs and robust career development opportunities. Our commitment to professional growth is evident in our dynamic leadership and diversity training programs that prepare you for the future.

Innovate with Us

Innovation is at the heart of everything we do at The Hartford. Join us and bring your unique perspective to help shape the future of financial services. Our collaborative environment encourages creativity and is the perfect place to advance your skills in groundbreaking ways.

Be Part of a Great Team

The Hartford is not just a company; it's a community. We pride ourselves on a workplace culture that upholds the values of inclusivity and teamwork. By joining us, you’ll work on diverse teams that value your insights and encourage networking and mutual support.

Future-Proof Your Career

With a wide range of job opportunities, from internships to full-time positions, The Hartford offers a path for everyone. Whether you’re just starting out or looking to take your career to the next level, we provide the tools and support needed to succeed. Our benefits package is designed to ensure that our team members are well taken care of, not only at work but in all aspects of life.

Explore Job Opportunities and Internships

Whether you're polishing your resume, preparing for an interview, or seeking to enhance your employment experience, The Hartford has a position to match your skills and ambitions. We are continuously hiring and looking for new talent to join our thriving team.

Stay Connected

Join Our Team Search open positions that match your skills and interest at The Hartford. We look for passionate, curious, creative, and solution-driven team players.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at The Hartford Financial Services Group, Inc. Join us at The Hartford—where careers thrive and futures are made.
Learn more about The Hartford Financial Services Group, Inc
Size
18,500 employees
Market Cap
$24.1 billion
Industry
Net Income
$1.7 billion
Founded
1810
5 Year Trend
+6.5%
Revenue
$20.5 billion
NASDAQ

Similar Jobs

More Jobs at The Hartford Financial Services Group, Inc

More Information Technology Jobs

Find similar Principal Reliability Engineer - EDS jobs: