Senior Firmware Engineer, Edge AI / NPU Runtime

Tacit

• $150K — $200K *

San Francisco, CA 94112In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in embedded firmware, embedded systems, or edge ML systems.
Strong C/C++/Rust experience on resource-constrained embedded platforms.
Experience with RTOS-based systems such as FreeRTOS, Zephyr, or ThreadX.
Experience deploying or optimizing ML inference on embedded targets like NPUs, DSPs, or edge SoCs.
Strong understanding of realtime embedded systems including low-latency data movement.
Hands-on debugging and bring-up experience across embedded hardware and firmware systems.

Responsibilities

Own deployment of ML models onto embedded targets using NPUs, DSPs, and other hardware accelerators.
Build realtime sensor-to-inference pipelines including acquisition and preprocessing.
Optimize inference latency, memory footprint, and power consumption on production hardware.
Support streaming inference patterns such as continuous sensor processing.
Maintain inference quality and timing guarantees in real-world scenarios.
Debug firmware across sensors and production hardware to ensure reliability.
Translate product goals into actionable performance targets.

Benefits

Competitive equity package
Comprehensive medical, dental, and vision insurance
Unlimited PTO
Visa sponsorship
3% 401k matching

Full Job Description

About the role

We're looking for a Senior Firmware Engineer, Edge AI / NPU Runtime to help architect, optimize, and ship next-generation neurotech hardware with production-grade on-device intelligence. You will own critical parts of the embedded AI stack, from realtime sensor acquisition through preprocessing, NPU/DSP-accelerated inference, postprocessing, telemetry, and product deployment.

This is a hands-on role for someone who wants to work close to the hardware while shaping the intelligence users experience in the product. You'll help define how models run on-device, how sensor data moves through the system, and how we meet tight latency, reliability, and power budgets in real-world use.
What you'll do

Edge AI & NPU Inference
- Own deployment of ML models onto embedded targets using NPUs, DSPs, MCUs, or other hardware accelerators.
- Integrate embedded inference runtimes, vendor NPU/DSP SDKs, and model deployment workflows into production firmware.
- Optimize inference latency, memory footprint, throughput, power consumption, and accelerator utilization on production hardware.
- Partner with ML teams on quantization, operator support, model architecture tradeoffs, calibration datasets, and accuracy/performance regressions.
Realtime Sensor-to-Inference Systems
- Build realtime sensor-to-inference pipelines, including acquisition, timestamping, synchronization, preprocessing, feature extraction, model execution, and postprocessing.
- Design low-latency data movement using DMA, interrupts, ring buffers, deterministic scheduling, and efficient memory layouts.
- Support streaming inference patterns such as sliding windows, temporal models, event-driven execution, and continuous sensor processing.
- Maintain inference quality and timing guarantees under real-world conditions such as sensor noise, clock drift, dropped samples, variable system load, and power-state transitions.
Power-Optimized Embedded Firmware
- Optimize end-to-end energy per inference across sensing, preprocessing, model execution, postprocessing, and idle time.
- Use low-power firmware techniques such as sleep states, duty cycling, subsystem power gating, clock scaling, batching/windowing, and dynamic power management.
- Profile and improve power consumption across sensors, CPU, NPU/DSP, memory, and supporting firmware infrastructure.
Product Quality & Debugging
- Bring up and debug firmware across sensors, accelerators, power systems, embedded compute, and production hardware.
- Use lab tools, traces, logs, telemetry, and instrumentation to root-cause complex embedded system issues.
- Translate product and customer experience goals into concrete latency, reliability, responsiveness, and power targets.
- Build diagnostics, validation hooks, and performance benchmarks to ensure reliable real-world edge inference behavior.

Requirements

5+ years of experience in embedded firmware, embedded systems, or edge ML systems.
Strong C/C++/Rust experience on resource-constrained embedded platforms.
Experience with RTOS-based systems such as FreeRTOS, Zephyr, ThreadX, or similar.
Experience deploying or optimizing ML inference on embedded targets, NPUs, DSPs, MCUs, or edge SoCs.
Strong understanding of realtime embedded systems, including DMA, interrupts, concurrency, memory management, and low-latency data movement.
Experience optimizing embedded systems for latency, memory footprint, throughput, and power consumption.
Hands-on debugging and bring-up experience across embedded hardware and firmware systems, with strong cross-functional communication across firmware, ML, electrical, software, and product teams.

Strong candidates may have

Experience with embedded inference runtimes, deployment toolchains, or edge AI SoCs/accelerators such as TensorFlow Lite Micro, ONNX Runtime, CMSIS-NN, Qualcomm QNN/SNPE, ARM Ethos-U/Vela, TVM, ExecuTorch, Qualcomm, ARM, Cadence/Tensilica, Syntiant, Ambiq, Nordic, NXP, ST, Hailo, Google Edge TPU, or similar.
Experience with quantized inference, fixed-point math, SIMD/DSP optimization, accelerator programming, or model conversion workflows.
Experience with streaming or time-series ML workloads such as biosignals, sensor fusion, audio, gesture recognition, keyword spotting, or other realtime inference systems.
Experience shipping battery-powered consumer electronics, wearable, neurotech, AR/VR, robotics, camera, IoT, or other embedded AI products.

Compensation Range

$150,000 - $200,000/year
Benefits

Competitive equity package
Comprehensive medical, dental, and vision insurance
Company size: 20-30 people
Unlimited PTO
Visa sponsorship
3% 401k matching

* Ladders Estimates

Similar Jobs

Sr. Firmware Engineer
$120K — $160K *
Advanced Micro Devices, Inc
San Jose, CA 95123 (Santa Clara County)
Today
ASIC Design Verification Engineering Technical Leader
$183K — $263K *
Cisco
San Jose, CA 95123 (Santa Clara County)
Reposted Yesterday
Software Dev Engineer III - AMZ9890895
$193K — $261K *
Amazon
Cupertino, CA 95014 (Santa Clara County)
Reposted 2 days ago
Senior WLAN/WIFI Networking/Software Engineer
$136K — $276K *
Hewlett Packard Enterprise Development LP
San Jose, CA 95123 (Santa Clara County)
Reposted 2 days ago
Senior/Lead Software Engineer - (Network Systems)
$140K — $210K *
Arista Networks
Santa Clara, CA 95051 (Santa Clara County)
2 days ago
Senior Wireless Software Engineer - MAC Layer - TeraWave
$197K — $276K *
Blue Origin
San Francisco, CA 94112 (San Francisco County)
2 days ago

Get Ready For Your
Next Interview

More Jobs at Tacit

Senior Firmware Engineer, Edge AI / NPU Runtime
$150K — $200K *
San Francisco, CA 94112 (San Francisco County)
Today
Information Technology
In-Person
Computational Neuroscientist
$160K — $180K *
San Francisco, CA 94112 (San Francisco County)
3 days ago
Healthcare
In-Person
Patent Agent
$180K — $200K *
San Francisco, CA 94112 (San Francisco County)
Reposted 1 week ago
Legal & Accounting
In-Person
Hardware Technical Program Manager (TPM)
$160K — $180K *
San Francisco, CA 94112 (San Francisco County)
Reposted 1 week ago
Consumer Technology
In-Person
Head of Hardware
$210K — $260K *
San Francisco, CA 94112 (San Francisco County)
Reposted 1 week ago
Consumer Technology
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Développeur Python (H/F)
$80K — $110K *
Extia
Montreal, QC H1A 0A1
Reposted Today
Information Security Threat Management Specialist
$95K — $144K *
Bank of America Corporation
Denver, CO 80219 (Denver County)
Today
Senior IT & Security Engineer
$100K — $130K *
MirrorWeb
Austin, TX 78745 (Travis County)
Today

Find similar Senior Firmware Engineer, Edge AI / NPU Runtime jobs:

Nationwide San Francisco, CA

Senior Firmware Engineer, Edge AI / NPU Runtime

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Firmware Engineer, Edge AI / NPU Runtime jobs:

Get Ready For Your
Next Interview