Job Area:Engineering Group, Engineering Group > CPU Engineering
General Summary:We are seeking a
post-silicon CPU RAS engineer focused on
Silent Data Corruption (SDC) / Silent Data Errors (SDE) on
ARM ISA-based CPUs running on silicon platforms. This role centers on reproducing, detecting, and root-causing failures escaping, often observed in customer or field environments. You will work hands-on with
bring-up systems, validation boards, and customer platforms, closely partnering with
Customer Engineering, architecture, RTL, firmware, and post-silicon debug teams to drive root cause and mitigation.
Key Responsibilities- Develop and run C and assembly tests directly on silicon (Bare Metal/OS based) to provoke and detect silent corruption in:
- CPU pipelines, load/store, atomics, coherency, and cache interactions
- Build robust correctness oracles (redundant execution, invariants, checksums) to catch subtle wrong-answer failures.
- Analyze and root-cause customer-reported issues with symptoms such as data corruption, miscompares, or non-deterministic failures without explicit machine checks.
- Work closely with Customer Engineering to:
- Reproduce issues in lab or customer-like environments
- Triage logs, dumps, and limited telemetry
- Provide clear technical root-cause hypotheses and mitigation guidance
- Use post-silicon debug techniques:
- Performance counters, trace, JTAG, register/state dumps, targeted instrumentation
- Reduce issues to minimal repros (often assembly-only) to enable efficient handoff to RTL and architecture teams.
- Integrate high-signal SDC tests into ongoing datacenter stress and regression flows.
Required Qualifications for this Role:- Strong C, Python, ARM assembly programming skills.
- Solid understanding of computer architecture concepts, OOO execution, ARM ISA
- Experience debugging silicon failures.
- Ability to reason across hardware, firmware, OS, and workload behavior.
- Strong analytical and communication skills for customer-impacting issues.
Preferred Qualifications- Experience working directly with customers or field teams on CPU issues.
- Familiarity with RAS and reliability features (detection, containment, escalation).
- Experience with silicon bring-up, validation boards, or datacenter/enterprise CPU deployments.
Minimum Qualifications:• Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or related field and 4+ years of Hardware Engineering, Software Engineering, Electrical Engineering, Systems Engineering, or related work experience.
OR
Master's degree in Electrical Engineering, Computer Engineering, Computer Science, or related field and 3+ years of Hardware Engineering, Software Engineering, Electrical Engineering, Systems Engineering, or related work experience.
OR
PhD in Electrical Engineering, Computer Engineering, Computer Science, or related field and 2+ years of Hardware Engineering, Software Engineering, Electrical Engineering, Systems Engineering, or related work experience.
Pay range and Other Compensation & Benefits: $167,100.00 - $250,700.00
The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer - and you can review more details about our US benefits at this link.
If you would like more information about this role, please contact Qualcomm Careers.