Amazon

SoC Platform Software Engineering Manager, Annapurna Labs Machine Learning Acceleration, AWS

Amazon$212K — $287K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years of engineering team management experience
  • 7+ years of professional software development in C or C++
  • 4+ years of experience in software system design or architecture
  • Experience with hardware interface software across execution environments
  • Proficient in designing APIs or abstraction layers for engineering use

Responsibilities

  • Manage and mentor a team of 6 engineers, setting the technical direction
  • Own the platform abstraction layer for a unified C++ codebase across environments
  • Shape API contracts balancing stability and evolution for chip generations
  • Drive the architecture for C++ metaprogramming frameworks and test infrastructures
  • Build and maintain CI/CD and validation strategies for integration issues
  • Coordinate across multiple teams to ensure HAL readiness for new chips
  • Engage directly in technical tasks such as debugging and code reviews

Benefits

  • Health insurance (medical, dental, vision, prescription)
  • 401(k) matching
  • Paid time off and parental leave
  • Flexible Spending Accounts (FSA)
  • Adoption and Surrogacy Reimbursement
Full Job Description
One C++ codebase. Three radically different execution environments. We're looking for an engineering manager who thinks in terms of platforms, abstractions, and portable software architecture - and can lead a team that ships all three.

Our SoC HAL (Hardware Abstraction Layer) team builds the platform software layer for AWS's custom Trainium and Inferentia ML accelerator chips. The HAL is a shared library that boots, configures, and manages every hardware block on the SoC - 270+ instances per chip - and the same source tree compiles and runs on SystemVerilog DPI for chip verification, QEMU for system emulation, and Carbon OS in microcontrollers within the AWS production fleet. Your platform abstractions are what make this possible, and your APIs are the interface that 100's of engineers across verification, emulation, and production use to interact with the chip.

Tech stack: C++17, CMake, GoogleTest, Python, SystemVerilog DPI, SPI, APB/AXI bus protocols, PCIe, UCIe, HBM, PLL, custom IPs

As the SoC Platform Software Manager, you will:

- Manage, coach, and grow a team of 6 engineers - set technical direction, own hiring, and create an environment where strong engineers want to stay

- Own the platform abstraction layer that enables one C++ codebase to compile and run correctly across three target environments with fundamentally different runtime characteristics

- Shape the external API contracts that verification, emulation, and production teams build on - balancing stability for consumers against the need to evolve as new chip generations arrive

- Drive the architecture of our C++ template metaprogramming framework that generates type-safe register interfaces for every hardware block, and our BUTR (Built-in Unit Test for Registers) and HITL (Hardware-in-the-Loop) test infrastructure

- Build and maintain the CI/CD and validation strategy that catches integration issues across all three platforms before they reach customers

- Coordinate across chip architects, RTL designers, verification engineers, validation engineers, and platform software teams - you're the single point of accountability for HAL readiness on every new chip program

- Get into the weeds alongside your team - debug register-level HW/SW interactions, review code, and write code yourself when it matters

Most platform software teams target one OS or one hardware family. We target three execution environments from a single source tree - and our software must be stateless, survive live-updates on running production servers without reboots, and be correct down to individual register bits. A single abstraction leak can break chip verification, stall emulation, or misconfigure millions of servers in AWS's global fleet.

The HAL runs on an external microcontroller running embedded Linux, reaching into the chip over SPI and PCIe. It's stateless by design: the microcontroller can reboot at any time - including during customer workloads - and the HAL must resume managing the SoC by querying hardware state on-demand. Your platform layer is what makes this resilience possible while keeping the complexity invisible to consumers.

The same codebase that runs in pre-silicon simulation months before tape-out is the codebase that runs in production fleet. When the chip comes back from the fab, your team validates that pre-silicon models match real hardware behavior. For Trainium3, our HAL enabled a full ML training workload within 12 hours of first power-on: https://www.aboutamazon.com/news/aws/trainium-3-ultraserver-faster-ai-training-lower-cost

No ML background needed. Your platform software is the foundation that enables ML training across clusters of thousands of interconnected accelerators - you'll work on components like PCIe and HBM, but won't need to understand ML itself.

This role can be based in Cupertino, CA or Austin, TX. The team is split between the two sites.

BASIC QUALIFICATIONS

- 3+ years of engineering team management experience

- 7+ years of professional software development in C or C++, including systems, platform, or infrastructure software

- 4+ years of designing or architecting software systems (platform abstractions, API design, multi-target build systems)

- Experience developing software that interfaces with hardware or runs across multiple execution environments

- Experience designing APIs or abstraction layers consumed by other engineering teams

PREFERRED QUALIFICATIONS

- Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers

- Experience building or maintaining hardware abstraction layers, board support packages, or platform software for SoC, ASIC, or embedded systems

- Experience with multi-platform or cross-compilation build systems (targeting simulation, emulation, and production from a single source tree)

- Familiarity with bus protocols (APB, AXI, PCIe) or memory subsystems (HBM, DDR)

- Experience with C++ template metaprogramming or code generation frameworks

- Experience with pre-silicon software development (simulation, emulation, or virtual platforms)

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Cupertino - 212,700.00 - 287,700.00 USD annually

USA, TX, Austin - 184,900.00 - 250,200.00 USD annually

About Amazon

Audible is a provider of spoken audio information and entertainment , on the Internet. They provide premium spoken audio content, such as audio versions of books and newspapers and radio programs, that is delivered over the Internet and played back on personal computers and hand-held electronic devices. The Audible service allows consumers to purchase and download their content from their Website, store it in digital files and play it back on personal computers and electronic devices. More than 15,000 hours of audio content are available on their Web site, including audio versions of books, periodicals and radio programs. Several manufacturers have agreed to support and promote the playback of their content on their hand-held audio-enabled electronic devices.

Amazon Careers

Joining Amazon presents an unparalleled opportunity to become part of a vibrant team pushing the boundaries of innovation and growth in the global marketplace. As a leader in e-commerce, technology, and logistics, Amazon offers a variety of job opportunities that cater to a range of skills and professional interests. Work You’ll Do At Amazon, every day is an opportunity to collaborate with the brightest minds in technology and business to redefine what’s possible. Whether you’re interested in software development, marketing, human resources, or customer service, Amazon has a position waiting for you. Transform the way the world shops and innovates with our diverse and inclusive team. Amazon is not just a company; it’s a community where you can drive real change and contribute to projects impacting millions globally. Lead with Innovation and Leadership Amazon is the perfect place to enhance your leadership and innovation skills. Our culture encourages pushing the envelope and imagining the unimaginable. Here, you will lead projects that challenge the status quo and define new industry standards. Work with a team that values diversity and is committed to creating an inclusive environment. Our leadership is focused on harnessing the collective power of unique perspectives to foster growth and innovation. Explore Amazon’s Employment Benefits Amazon’s commitment to its employees extends beyond just career growth. We offer competitive benefits, including health care, parental leave, and diversity training, ensuring that our team not only excels professionally but also enjoys well-being and security. Internship and Networking Opportunities Start your career with an Amazon internship and gain hands-on experience that matters. Our internships provide a gateway to full-time employment and an opportunity to network with professionals across various sectors of the company. Future-Proof Your Career With Amazon, your career path is filled with numerous opportunities for advancement. Our learning and development programs are designed to nurture your professional growth and keep you at the forefront of industry trends. Stay Connected Join Our Team Discover the job opportunities at Amazon that match your skills and interests. We are constantly on the lookout for passionate, curious, and innovative team players ready to make a difference. Keep Up to Date Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here. Job Alert Emails Customize your subscription to receive job alerts, the latest news, and insider tips tailored to your preferences. Explore the exciting and rewarding career opportunities that await at Amazon. Amazon is more than just a company—it’s a platform for building a promising future. Whether you’re starting or looking to advance your career, Amazon offers the resources, support, and network you need to succeed. Join us, and be a part of our continuing mission to be Earth's most customer-centric company.
Learn more about Amazon
Size
1,608 employees
Market Cap
$832.6 billion
Industry
Net Income
$21.3 billion
Founded
1994
5 Year Trend
+28.1%
Revenue
$386 billion
NASDAQ

Similar Jobs

More Jobs at Amazon

More Enterprise Technology Jobs

Find similar SoC Platform Software Engineering Manager, Annapurna Labs Machine Learning Acceleration, AWS jobs: