General Dynamics

Site Reliability Engineer

General Dynamics$142K — $158K *
US-AnywhereRemote in United States
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's in Software Engineering, Computer Science, or related field
  • 5-8 years of experience in relevant positions
  • Experience in production SRE or DevOps environments
  • Proficient with monitoring tools (Prometheus, Grafana, etc.)
  • Strong scripting capabilities in Python or Bash
  • Familiar with containers (Docker, Kubernetes) and cloud services
  • U.S. citizenship and Department of Defense Secret security clearance required

Responsibilities

  • Define service level objectives (SLOs) and drive engineering decisions based on error budgets
  • Build and maintain monitoring, logging, and alerting infrastructures for AI services
  • Establish and lead incident management procedures for defect response
  • Conduct operational readiness reviews for AI services before they go live
  • Monitor resource consumption and forecast capacity needs for the platform
  • Automate repetitive operational tasks to eliminate toil
  • Collaborate with teams to ensure AI services are operable before deployment

Benefits

  • 100% remote work flexibility
  • 9/80 work schedule for improved work-life balance
  • Opportunity to shape and define SRE practices from the ground up
  • No previous defense industry experience required
  • Direct impact on the operational readiness of AI services
Full Job Description
Basic Qualifications
Bachelor's degree in Software Engineering, or related Science, Technology, Engineering or Mathematics field, plus a minimum of 8 years of relevant experience; or Master's degree, plus 6 years relevant experience. Responsibilities for this Position
What You'll Own
  • SLOs and reliability metrics. Define service level objectives for every AI service that goes to production. Establish error budgets and use them to drive engineering decisions — not just measure uptime.
  • Monitoring and observability. Build and maintain monitoring, logging, and alerting infrastructure for AI services. You will know when something is degrading before users do.
  • Incident response. Establish incident management procedures, lead post-incident reviews, and drive corrective actions. When something breaks, you coordinate the response and ensure it doesn't break the same way again.
  • Operational readiness reviews. Before any AI service goes live, you validate that it meets reliability, security, and operational standards. You are the gate between "it works in dev" and "it's ready for production."
  • Capacity planning and cost monitoring. Track resource consumption, forecast capacity needs, and monitor costs — tokens, compute, storage. You ensure the platform scales without surprises.
  • Toil elimination. Identify and automate repetitive operational tasks. If a human is doing something a script could do, you fix that.
What You Won't Own
  • Application development or AI model building — you ensure what they build is operable, you don't build it
  • Infrastructure provisioning — IT provides the infrastructure; you define what's needed and validate it works
  • Business process decisions or backlog prioritization
What Makes This Role Different
  • AI services have failure modes that traditional applications don't — model drift, token budget exhaustion, prompt injection, upstream data quality degradation. You will build monitoring for problems that most SRE teams have never encountered.
  • You are applying SRE principles from scratch. There is no existing SRE practice to inherit — you will define it for the platform.
  • Your operational readiness reviews directly determine whether AI services go live. You have real authority to say "not ready."
Required Qualifications
  • Bachelor’s degree in Computer Science, Software Engineering, or a related field, plus 5 years of experience; or Master’s degree plus 3 years of experience
  • Production SRE or DevOps experience — you have owned the reliability of systems that real users depended on, not just built CI/CD pipelines
  • Hands-on experience with monitoring and observability tools — Prometheus, Grafana, Datadog, ELK, CloudWatch, or similar. You have built dashboards and alerts that caught real problems.
  • Strong scripting and automation skills — Python, Bash, infrastructure-as-code (Terraform, CloudFormation, or similar)
  • Experience with containerized environments — Docker, Kubernetes, container orchestration at scale
  • Experience defining and managing SLOs, error budgets, and incident response procedures in production
  • S. citizenship required. Department of Defense Secret security clearance is required at time of hire.
Preferred Qualifications
  • Experience with AI/ML production systems — model serving, inference monitoring, token cost tracking, or similar
  • Multi-cloud experience (AWS, Azure, GCP) including cloud-native monitoring and logging services
  • Experience building operational readiness review processes or production launch checklists
  • Familiarity with Google SRE principles — you have read the book and applied the concepts, not just referenced them in interviews
  • Experience in environments where reliability has compliance or safety implications — defense, healthcare, finance, or critical infrastructure
What Sets You Apart
  • You think about failure before you think about features. Your first question about any new system is "how does this break?"
  • You automate yourself out of toil. If you're doing the same thing twice, you write a script.
  • You have said "not ready" to a team that wanted to ship, and you were right.
  • You build monitoring that tells you what's wrong, not just that something is wrong.
  • You write post-incident reviews that actually change how systems are built, not just how incidents are documented.
Details
  • Remote — 100% telework
  • 9/80 schedule
  • Defense industry experience is not required
Salary NoteThis estimate represents the typical salary range for this position based on experience and other factors (geographic location, etc.). Actual pay may vary. This job posting will remain open until the position is filled. Combined Salary RangeUSD $142,696.00 - USD $158,303.00 /Yr.

About General Dynamics

General Dynamics is involved in business aviation; land and expeditionary combat vehicles and systems, armaments, and munitions; shipbuilding and marine systems; and mission-critical information systems and technologies. General Dynamics has four main business segments. Aerospace designs, develops, manufacturers and services a comprehensive offering of advanced business-jet aircraft. Combat Systems specializes in producing, supporting and sustaining land and expeditionary combat systems for the U.S. military and its allies. Marine Systems designs, builds and supports submarines and a variety of surface ships for the U.S. Navy and commercial customers. The Information Systems and Technology group offers a breadth and depth of technology and service capabilities that support a wide range of government and commercial needs, including systems integration expertise; hardware and software products; and engineering, management and support services.

General Dynamics Careers

Join the dynamic team at General Dynamics, a leader in global defense, aerospace, and technology services. As a pivotal player in innovation and security, General Dynamics offers unparalleled job opportunities for professionals eager to advance their careers in a cutting-edge environment.

Work You’ll Do

At General Dynamics, we empower our employees to drive innovation and lead with integrity and excellence. With a commitment to professional growth and diversity, our team is at the forefront of developing solutions that make a difference in the world. Whether in engineering, cybersecurity, or project management, your work at General Dynamics will contribute to missions that matter.

Explore Career Paths

General Dynamics is not just a company; it's a community where you can build a career that aligns with your passion and skills. From internships to leadership positions, the breadth of opportunities supports your professional journey at every stage.

Innovate and Lead

Join a team where innovation is the status quo. At General Dynamics, you’ll work alongside industry experts to solve complex challenges with cutting-edge technology. Our leadership is committed to fostering a culture of growth and learning, where you can lead projects that explore new frontiers.

Professional Development

Invest in your future with General Dynamics’ robust professional development programs. Enhance your skills through targeted training, workshops, and seminars that propel your expertise and leadership capabilities forward.

Diversity and Inclusion

At General Dynamics, we believe diversity drives innovation. Our inclusive culture welcomes diverse perspectives and ideas, fostering an environment where all employees can thrive. Through diversity training and networking opportunities, we ensure every team member can contribute uniquely and significantly.

Benefits and Culture

Experience a culture that prioritizes the well-being and satisfaction of its employees. General Dynamics offers competitive benefits, including health care, retirement plans, and flexible working arrangements, to ensure a healthy work-life balance. Our team-oriented culture encourages collaboration and mutual respect, making General Dynamics a great place to work.

Join Our Team

Ready to advance your career at General Dynamics? Explore our current job opportunities and find a position that matches your skills and ambitions. We are continuously hiring talented individuals who are passionate about making a difference.

Prepare for Your Interview

Make a great first impression. Prepare for your interview at General Dynamics by researching our company values and recent projects. Tailor your resume to highlight relevant experience and skills that align with the job description.

Stay Connected

Keep up to date with the latest from General Dynamics: - **Career Growth Tips:** Gain insights from our professionals on advancing your career. - **Networking Events:** Connect with peers and leaders in your field through our networking events. - **Job Alerts:** Customize your job alerts to receive updates on new openings that fit your career preferences.

Explore General Dynamics Jobs

Discover the impact you can make at General Dynamics by joining our team. Search open positions, apply online, and take the first step towards a rewarding career filled with opportunities for growth and innovation.

SEARCH GENERAL DYNAMICS JOBS

Join General Dynamics today and be part of a team that values leadership, innovation, and diversity. See what exciting and rewarding opportunities await you in a career at General Dynamics.
Learn more about General Dynamics
Size
103,100 employees
Market Cap
$67.9 billion
Industry
Net Income
$3.1 billion
Founded
1952
5 Year Trend
+4.7%
Revenue
$37.9 billion
NASDAQ

Similar Jobs

More Jobs at General Dynamics

More Information Technology Jobs

Find similar Site Reliability Engineer jobs: