Systems Engineer L3 - HPC/HMD

Power3 Solutions

$175K — $250K *
Technical Services
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Active TS/SCI with polygraph required, with the last poly within the last 5 years.
  • Experience with system health monitoring and diagnostics for complex hardware systems.
  • Ability to interpret detailed hardware documentation and schematics.
  • Strong analytical mindset paired with effective troubleshooting skills.
  • Excellent communication skills for engaging stakeholders and vendors.
  • Experience with monitoring and diagnostics tools, especially Splunk, Chiplink, and iostat.
  • Self-motivated with a track record of managing complex technical tasks independently.

Responsibilities

  • Maintain understanding of end-to-end system architecture for diagnostics.
  • Utilize monitoring tools to identify and triage system issues effectively.
  • Develop expertise in monitoring architecture to enhance diagnostic processes.
  • Collaborate with cross-functional teams to improve system reliability and visibility.
  • Analyze system telemetry using Splunk to diagnose root causes and behaviors.
  • Guide the creation of dashboards and scripts for critical monitoring.
  • Track and resolve issues using JIRA, validating fixes through troubleshooting.

Benefits

  • 100% company-paid health, dental, and vision premiums.
  • Automatic company contribution to Health Savings Account (HSA) up to $3,900 for families.
  • Up to 7 weeks of Paid Time Off (PTO).
  • Automatic 401k investment contributions.
  • Paid 11 Federal Holidays.
  • BlueCross BlueShield health insurance.
  • Tuition and training reimbursement opportunities.
  • Access to Ravens season tickets in club level and company-paid golf events.
Full Job Description
We are looking for an experienced High-Performance Computing (HPC) Systems Engineer to support complex system design, integration, monitoring, and diagnostics by applying deep understanding of both physical and logical system architectures.

Position Description
  • Maintain a comprehensive understanding of the system's end-to-end physical and logical architecture to effectively apply hardware modeling and diagnostics (HMD) monitoring tools.
  • Leverage HMD monitoring tools to identify, narrow, and triage system issues, directing detailed problems to the appropriate diagnosticians or vendors for resolution.
  • Develop deep expertise in the HMD product and monitoring architecture to identify gaps, inefficiencies, and opportunities to enhance diagnostic effectiveness.
  • Collaborate with developers, analysts, and monitoring tool owners to propose, design, and implement improvements to monitoring solutions, increasing system reliability, and operational visibility.
  • Analyze system logs, metrics, and telemetry-primarily using Splunk-to determine root causes, understand system behavior, and identify anomalous conditions.
  • Interpret hardware and system performance data, including graphs and trends, to diagnose system behavior and inform troubleshooting activities.
  • Guide the development of Splunk dashboards, health indicators, and diagnostic scripts to monitor critical data flows, system performance, and failure signatures.
  • Review and evaluate relevant technical documentation; ask clarifying questions and build expertise in hardware design to support accurate and timely system diagnosis.
  • Provide recommendations for testing strategies and develop documentation of issue signatures to enable and accelerate diagnostics development.
  • Collaborate closely with diagnosis teams and external vendors to troubleshoot complex hardware and system-level issues.
  • Track issues through resolution using JIRA, validate fixes, and confirm that corrective actions resolve the underlying problems.

Experience
  • Demonstrated experience in one or more of the following technical domains, with a strong willingness and aptitude to expand expertise as required: System Architecture and Design, Power Systems, Printed Circuit Board (PCB) Design, Cooling Infrastructure, Signal Integrity, and System Reliability.
  • Proven experience in system health monitoring, diagnostics, and operational support for complex hardware and integrated systems.
  • Ability to read, interpret, and analyze detailed hardware documentation, including specifications, data sheets, schematics, and design artifacts.
  • Strong analytical and troubleshooting mindset, with a natural curiosity and willingness to ask probing questions to identify root causes and systemic issues.
  • Excellent communication skills, including the ability to engage vendors and internal stakeholders to extract detailed technical information related to system design, performance, and anomalies.
  • Experience with, or demonstrated ability to quickly learn, system monitoring and diagnostics tools, including but not limited to Chiplink, Splunk, Ipsci, and iostat.
  • Self-motivated and capable of completing complex technical tasks with minimal supervision while managing priorities effectively.
  • Ability to clearly convey technical findings, risks, and recommendations to both technical and non-technical audiences through written documentation and oral briefings.
  • Proven ability to work effectively within cross-functional teams, including engineering, diagnostics, operations, and vendor partners.
  • Comfortable learning and adapting to new tools, technologies, and processes as mission and system needs evolve.

Qualifications
  • An active TS/SCI with polygraph is required - Last poly must be within the last 5 years.

Employee Freedom of Choice
Our focus is on people first. We offer comprehensive and flexible compensation packages that match the best the industry has to offer and can be customized to fit your needs.

Our Benefits:
  • 100% company-paid health, dental, and vision premiums
  • Automatic company contributed Health Savings Account (HSA) up to $3,900 for families
  • Up to 7 weeks of Paid Time Off (PTO)
  • Automatic 401k Investment
  • Paid 11 Federal Holidays
  • BlueCross BlueShield Health Insurance
  • Tuition/Training Reimbursement
  • Access to Ravens season tickets in club level
  • Company-paid golf events for your time and course fees

The projected compensation range for this position is $175K - $250K. There are differentiating factors that can impact a final salary/hourly rate, including, but not limited to, Contract Wage Determination, relevant work experience, skills and competencies that align to the specified role, geographic location, education and certifications as well as Federal Government Contract Labor categories. In addition, we invest in our employees beyond just compensation. Our benefit offerings include, dependent upon position, Health Insurance, Dental and Vision Coverage, Health Savings Account (HSA), Paid Time Off, Holiday Pay, Short Term and Long Term Disability, Life Insurance, 401(k) Plan, Safe Harbor 401k Investment, Learning and Development opportunities, Referral Bonuses and Flex Time.

Power3 Solutions
Partnering with federal, state, and local organizations to bring the best talent to the right roles.

https://power3.com/
[email protected]
https://www.linkedin.com/company/power3-solutions

Similar Jobs

More Jobs at Power3 Solutions

  • Database Engineer (Oracle)
    $90K — $130K *
    Fort George G Meade, MD 20755 (Anne Arundel County)
    Technical Services
    In-Person
  • Principal Software/Data Engineer
    $120K — $150K *
    Laurel, MD 20707 (Prince Georges County)
    Aerospace & Defense
    In-Person
  • DevOps Engineer L4
    $200K — $250K *
    Annapolis Junction, MD 20701 (Howard County)
    Information Technology
    In-Person
  • Application Engineer 4 (Kovr.AI)
    $100K — $130K *
    Linthicum Heights, MD 21090 (Anne Arundel County)
    Aerospace & Defense
    In-Person
  • Agile Developer L1
    $125K — $175K *
    Annapolis Junction, MD 20701 (Howard County)
    Information Technology
    In-Person

More Technical Services Jobs

Find similar Systems Engineer L3 - HPC/HMD jobs: