Site Reliability Engineer II

Medallia • $103K — $150K *

Mclean, VA 22101In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

2+ years in Site Reliability Engineering, DevOps, or related roles.
Experience with Kubernetes or similar container platforms in production.
Familiarity with cloud providers like AWS, OCI, or GCP.
Proficient in Linux systems administration and troubleshooting.
Knowledge of scripting languages such as Python, Bash, or Go.
Understanding of CI/CD pipelines and Git workflows.
Familiarity with networking basics like DNS, load balancing, and routing.

Responsibilities

Collaborate with teams to enhance application reliability and scalability.
Operate and support production services on Kubernetes environments.
Troubleshoot infrastructure and application issues across the tech stack.
Automate processes to reduce manual work and improve operational efficiency.
Utilize AI tools to enhance troubleshooting and productivity.
Identify and implement improvements in operational processes through automation.
Create reusable tools and solutions to increase team efficiency.

Benefits

Comprehensive health and wellness benefits, including medical, dental, and vision.
401(k) with company contributions.
Paid parental leave and statutory leaves.
Short-term and long-term disability coverage.
Life and AD&D insurance.
Paid holidays.

Full Job Description

The Role and Team

The Site Reliability Engineering organization at Medallia brings together the infrastructure and applications that power a highly reliable global SaaS platform.

As an SRE II, you will help operate and improve the reliability, scalability, and performance of services running across Kubernetes-based environments in cloud and hybrid infrastructure. You will work closely with software engineering teams to build automation, improve operational excellence, and support production services used globally by Medallia customers.

We are looking for engineers who enjoy solving complex technical problems, automating repetitive tasks, improving system reliability, and learning modern cloud-native technologies in a fast-paced environment.

We value engineers who actively seek opportunities to improve scalability and operational efficiency through automation, AI-assisted engineering workflows, and continuous process improvement.

Please note this role participates in a rotating on-call schedule supporting production systems and services.

Responsibilities

Collaborate with software engineering teams to improve application reliability, scalability, and operational maturity.
Operate and support production services running in Kubernetes environments.
Troubleshoot and resolve infrastructure and application issues across the full technology stack.
Build automation and tooling to reduce operational overhead and eliminate manual work.
Leverage AI-assisted engineering tools and automation platforms to accelerate troubleshooting, improve productivity, and reduce operational toil.
Identify opportunities to streamline operational processes through automation, AI-enabled workflows, and self-service solutions.
Create reusable solutions, tooling, and operational improvements that increase engineering leverage across the team.
Support CI/CD and GitOps-based deployment workflows.
Develop and maintain infrastructure-as-code configurations and operational tooling.
Monitor system health, availability, and performance using observability and alerting platforms.
Participate in incident response, root cause analysis, and operational improvements.
Continuously improve reliability, deployment processes, and operational standards.

Candidates based in the Tysons vicinity will be prioritized as this role is Hybrid, 3 days per week onsite.

Qualifications

Minimum Qualifications

2+ years of experience in Site Reliability Engineering, DevOps, Systems Engineering, Cloud Operations, or related roles.
Demonstrated experience supporting production environments running on Kubernetes or other containerized platforms.
Demonstrated experience with cloud infrastructure platforms such as AWS, OCI, or GCP.
Demonstrated experience with Linux systems administration and troubleshooting.
Demonstrated experience with scripting or programming languages such as Python, Bash, or Go.
Familiarity with CI/CD pipelines and Git-based workflows.
Demonstrated understanding of networking fundamentals including DNS, load balancing, TLS/SSL, and routing concepts.
Demonstrated experience troubleshooting distributed systems and production incidents.
Ability to participate in an on-call rotation supporting production systems.
Fluency in English, both oral and written.

Preferred Qualifications

Experience with GitOps and tools such as ArgoCD.
Experience with infrastructure-as-code tools such as Terraform.
Familiarity with observability platforms such as Prometheus, Grafana, Loki, or OpenTelemetry.
Experience operating services in hybrid-cloud or multi-region environments.
Understanding of release strategies such as rolling deployments, canary releases, or blue/green deployments.
Familiarity with incident management and operational best practices.
Exposure to security and compliance concepts in production environments.
Experience using AI-assisted development, automation, or operational tooling to improve engineering productivity and service reliability.
Demonstrated passion for automation, process improvement, and operational efficiency.
Strong communication and collaboration skills.

Medallia is committed to equal pay and transparency. The annual base salary range for this position is $103,500 - $150,000. Please note that the salary range information provided is a general guideline and combines all of the distinct labor markets within the US. It is uncommon for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on a variety of factors. Medallia considers factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience, candidate's work location, education/training, key skills, internal peer equity, external market data, as well as, market and business considerations when making compensation decisions.

Medallia also offers competitive health and wellness benefits, including but not limited to medical, dental, vision, 401(k), short-term and long-term disability, life and AD&D insurance, statutory leaves, paid parental leave, and paid holidays. Benefits and eligibility may vary by location and role.

At Medallia, we celebrate diversity and recognize the value it brings to our customers and employees.

About Medallia

Medallia is a software company that provides customer experience management solutions. The company was founded in 2001 by Borge Hald and Amy Pressman and is headquartered in San Francisco, California. Medallia's software allows businesses to collect and analyze customer feedback across multiple channels, including email, social media, and mobile. The company's clients include some of the world's largest brands, such as Hilton, Delta Air Lines, and Mercedes-Benz. Medallia went public in 2019 and is traded on the New York Stock Exchange under the ticker symbol MDLA.

Learn more about Medallia

Size

2,037 employees

Market Cap

$5.3 billion

Industry

Enterprise Technology

Net Income

-$148.6 million

Founded

2001

Revenue

$477.2 million

NASDAQ

MDLA

* Ladders Estimates

Similar Jobs

Site Reliability Engineer
$142K — $158K *
General Dynamics
Remote
1 week ago
Application Support Engineer, Service Reliability Engineering
$78K — $125K *
Ciena
Remote
Reposted 1 week ago
SRE/DevOps Engineer
$120K — $150K *
Versana
New York, NY 10025 (New York County)
3 weeks ago
Site Reliability Engineer
$111K — $160K *
Mizuho Financial
New York, NY 10025 (New York County)
1 month ago
Site Reliability Engineer
$90K — $130K *
Arctiq, Inc.
Norfolk, VA 23503 (Norfolk City County)
1 month ago
Elastic Administrator - Clusters
$135K — $145K *
Stefanini
New York, NY 10025 (New York County)
1 month ago

Get Ready For Your
Next Interview

More Jobs at Medallia

Site Reliability Engineer II
$103K — $150K *
Mclean, VA 22101 (Fairfax County)
Today
Information Technology
In-Person
Senior Principal Architect, AI-Native Platform Transformation
$229K — $360K *
Mclean, VA 22101 (Fairfax County)
Today
Enterprise Technology
In-Person
Principal Product Manager, AI Platform
$186K — $260K *
Mclean, VA 22101 (Fairfax County)
5 days ago
Enterprise Technology
In-Person
Client Services, Consultant (Public Sector)
$80K — $110K *
Mclean, VA 22101 (Fairfax County)
2 weeks ago
Education, Government & Non-Profit
In-Person
Senior Business Intelligence Analyst
$101K — $135K *
Mclean, VA 22101 (Fairfax County)
1 month ago
Business Services
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Software Test Engineer
$80K — $110K *
Barrios Technology
Huntsville, AL 35810 (Madison County)
Today
AI Solutions Architect
$120K — $150K *
BlackHawk Network
Coppell, TX 75019 (Dallas County)
Today
Technical Lead - Node.js
$120K — $150K *
Bridgenext, Inc
Remote
Today
Application Developer
$80K — $120K *
CCS, LLC
Remote
Today

Find similar Site Reliability Engineer II jobs:

Nationwide Mclean, VA

Site Reliability Engineer II

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Site Reliability Engineer II jobs:

Get Ready For Your
Next Interview