Sr. Kernel Optimzation Engineer

Cerebras Systems

$130K — $180K *
US-AnywhereRemote in California, US
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's, Master's, PhD or equivalent in Computer Science, Computer Engineering, Mathematics, or related fields.
  • Understanding of hardware architecture concepts with a willingness to learn new systems.
  • Proficient in C++ and Python programming languages.
  • Knowledge of library and API development best practices.
  • Strong debugging skills in complex software stacks.

Responsibilities

  • Develop design specifications for machine learning and linear algebra kernels for the Cerebras WSE System.
  • Debug and develop optimized assembly and domain-specific language routines targeting Cerebras hardware.
  • Create high-performance kernel routines using low-level assembly and custom C-like language.
  • Utilize mathematical models to assess performance and guide design choices.
  • Implement unit and system testing methods for kernel libraries.
  • Research ML application trends to adapt the kernel library architecture for evolving neural networks.
  • Collaborate with chip and system architects for instruction set and microarchitecture optimization.

Benefits

  • Opportunity to work with cutting-edge technology in AI and HPC.
  • Collaborative environment with a world-class team of experts.
  • Focus on innovation and pushing boundaries in computational efficiency.
  • Access to continuous learning and skill development opportunities.
Full Job Description
About The Role
As a Kernel Engineer on our team, you will develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads. Your focus will be on implementing, optimizing, and scaling deep learning operations to fully leverage our custom, massively parallel processor architecture.
You will be part of a world-class team responsible for the design, performance tuning, and validation of foundational ML and HPC kernels. This includes building a library of parallel and distributed algorithms that maximize compute utilization and push the boundaries of training efficiency for state-of-the-art AI models. Your work will be critical to unlocking the full potential of our hardware and accelerating the pace of AI innovation.
Responsibilities
  • Develop design specifications for new machine learning and linear algebra kernels and mapping to the Cerebras WSE System using various parallel programming algorithms.
  • Develop and debug kernel library of highly optimized low level assembly instruction and C-like domain specific language routines to implement algorithms targeting the Cerebras hardware system.
  • Develop and debug high-performance kernel routines in low-level assembly and a custom C-like (CSL) language, implementing algorithms optimized for the Cerebras hardware system.
  • Using mathematical models and analysis to measure the software performance and inform design decisions.
  • Develop and integrate unit and system testing methodologies to verify correct functionality and performance of kernel libraries.
  • Study emerging trends in Machine Learning applications and help evolve Kernel library architecture to address computational challenges of the start-of-the-art Neural Networks.
  • Interact with chip and system architects to optimize instruction sets, microarchitecture, and IO of next generation systems.

Skills And Qualifications
  • Bachelor's, Master's, PhD or foreign equivalents in Computer Science, Computer Engineering, Mathematics, or related fields.
  • Understanding of hardware architecture concepts - must be comfortable learning the details of a new hardware architecture.
  • Skilled in C++ and Python programming languages.
  • Good knowledge of library and/or API development best practices.
  • Strong debugging skills and knowledge of debugging complex software stack.

Preferred Skills And Qualifications
  • Experience in kernel development and/or testing.
  • Familiarity with parallel algorithms and distributed memory systems.
  • Experience in programming accelerators such as GPUs and FPGAs.
  • Familiarity with Machine Learning neural networks and frameworks such as TensorFlow and PyTorch.
  • Familiarity with HPC kernels and their optimization.

Similar Jobs

More Jobs at Cerebras Systems

  • ASIC Architect
    $150K — $200K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Consumer Technology
    In-Person
  • Design Verification Engineer
    $190K — $230K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Information Technology
    In-Person
  • Mechanical Engineer
    $180K — $200K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Manufacturing & Automotive
    In-Person
  • Senior Accountant
    $100K — $130K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Manufacturing & Automotive
    In-Person
  • Director, Strategic Finance - Corporate FP&A
    $150K — $200K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Finance & Insurance
    In-Person

More Information Technology Jobs

Find similar Sr. Kernel Optimzation Engineer jobs: