Observability Platform Engineering Technical Lead

Fidelity Investments

$120K — $150K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in software engineering or a related field.
  • Strong understanding of distributed systems principles and architecture.
  • Proven experience with AWS native services and building production solutions.
  • Expertise in observability, particularly with metrics, tracing, and logging.
  • Hands-on experience with CI/CD and coding in languages like Python, Java, Go, or Node.js.
  • Solid networking knowledge, particularly in security and VPC design.
  • Experience in developing real-time data streaming pipelines.

Responsibilities

  • Design and implement scalable and fault-tolerant distributed systems.
  • Lead the architecture and development of high-throughput streaming data solutions.
  • Develop observability practices including metrics, logging, and alerting.
  • Implement application instrumentation and collaborate on standardization of telemetry.
  • Utilize AWS services to create robust and efficient architectures.
  • Design secure network architectures and enforce firewall rules.
  • Write production-quality code and manage CI/CD pipelines.
  • Mentor engineers and enhance team practices around system reliability.

Benefits

  • Collaborative work environment with cross-functional teams.
  • Opportunities for mentorship and skill enhancement.
  • Access to advanced technologies and platforms related to observability.
  • Involvement in significant project leadership and execution.
Full Job Description

Job Description:

Note: Fidelity will not provide immigration sponsorship for this position.

The Role

We are seeking a highly experienced, hands-on technical leader and platform builder. You will report to the engineering lead and will be responsible for delivering core platform capabilities within Fidelity’s Enterprise Observability Platform. You’ll own system architecture, observability, security, and delivery while collaborating closely with product owners and engineering teams.

What you’ll do (responsibilities):

  • Design and implement highly scalable, fault-tolerant distributed systems and services that operate at large scale and low latency.
  • Lead the architecture and build of high-throughput data streaming pipelines (real-time/event streaming), including ingestion, processing, and durable storage.
  • Develop and own observability for systems: metrics, tracing, structured logging, dashboards, alerts and SLOs.
  • Implement application instrumentation and collaborate with platform teams to standardize telemetry and monitoring practices.
  • Use AWS native services (MSK, Lambda, ECS/EKS, EC2, S3, DynamoDB, RDS, IAM, CloudWatch, etc.) to deliver robust solutions.
  • Design and enforce secure networking and firewall architectures (VPCs, subnets, security groups, NACLs, load balancers, private endpoints).
  • Write production-quality code and tests in at least one major language (Python, Java, Go, or Node.js). Own CI/CD pipelines and release automation.
  • Drive projects end-to-end: translate product requirements, create execution plans, identify risks, and coordinate across product, security, infra, and operations teams.
  • Mentor engineers, run design reviews, and improve team practices around reliability, scalability, and operability.

The Expertise and Skills You Bring:

  • Strong understanding of distributed systems fundamentals: consensus, partitioning, replication, consistency models, leader election, backpressure, and fault tolerance.
  • Demonstrated experience designing and operating highly available, highly scalable production systems.
  • Deep knowledge of observability: metrics, tracing, logging formats specially OpenTelemetry , alerting, and SLO/SLI design.
  • Experience implementing application instrumentation libraries and sidecars; familiarity with sampling, tagging, and context propagation.
  • Solid networking knowledge: TCP/IP, load balancing, NAT, DNS, VPC design, security groups, firewalls, and TLS.
  • Proven experience building solutions using AWS native services (design patterns and tradeoffs).
  • Experience designing and building real-time/high-speed streaming pipelines capable of processing large volumes of data; familiarity with Kafka, Kinesis, Flink, Spark Streaming, or similar.
  • Hands-on coding: able to implement, debug, and ship production code; strong test discipline and experience with CI/CD.
  • Experience with building containerized applications using Docker and container orchestration (Kubernetes/EKS).
  • Excellent written and verbal communication; able to drive projects with product managers and stakeholders.

Nice-to-have

  • Experience with observability platforms like Grafana, Prometheus, Datadog, or Splunk.
  • Background in performance tuning, JVM internals, or low-latency systems.
  • Experience with infrastructure-as-code (Terraform, CloudFormation).
  • Experience building multi-region or global systems.


Certifications:

Category:

Information Technology

Similar Jobs

More Jobs at Fidelity Investments

More Enterprise Technology Jobs

Find similar Observability Platform Engineering Technical Lead jobs: