The RoleWe are looking for an engineer to design, implement, and optimize custom ML kernels that bolster our model development stack. Your work will be deep in the system, combining hardware and software insights to optimize performance. Some example areas you might work on (not limited to):
- Design and implement custom performant ML kernels that work at scale
- Identify inefficiencies and optimize compute-intensive workloads to reduce memory bandwidth bottlenecks and improve hardware utilization
- Enable and validate low-precision arithmetic formats and contribute to related compiler or runtime stacks
If you're excited about working at the intersection of hardware and frontier AI research, we'd love to hear from you.
We offer a base salary of $350,000-$500,000 USD and a meaningful equity grant, depending on experience and background, along with competitive benefits.