Cerebras Systems

Generative AI Inference Solutions Architect

Cerebras Systems$100K — $150K *
US-AnywhereRemote
Manufacturing & Automotive
5 - 7 years of experience
Full Job Description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.  

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About The Role

As a solutions architect for Cerebras Inference platform, you will provide technical guidance in our sales initiatives, showcase the capabilities of our hardware and software solutions, and drive customer engagements. You will be working with the fastest inference engine in the world and will help our customers to understand and realize its potential for existing and completely new business applications.   

We are looking for talented AI Solutions Architects with a blend of deep technical expertise, customer-facing soft skills and sales acumen. The ideal candidate will also bring a broad knowledge of various industries. 

Responsibilities

  • Lead the technical aspects of the sales process  
  • Join sales calls to present technical aspects of Cerebras Inference solution, addressing customer questions and demonstrating our value proposition. Provide in-depth explanations of our product features, focusing on performance benefits, scalability, and optimizations that our specialized hardware enables. 
  • Understand and gather customer requirements.  
  • Design, scope and drive demos, trials and PoCs  
  • Design demos to showcase key advantages of our unique product 
  • Scope and drive customer trials and proof-of-concept projects, define success metrics, oversee execution and ensure a smooth experience customer satisfaction. 
  • Own end-to-end delivery of the solution, provide technical guidance during deployment and post-sales support 
  • Work closely with customers to design deployment solutions tailored to their needs 
  • Drive end-to-end delivery of the solution from the technical side 
  • Build and maintain strong customer relationships to become their go-to technical expert 
  • Provide feedback to the internal product and engineering teams
  • Collaborate with internal teams, including R&D and product management, to communicate customer feedback and drive future product improvements.  

Requirements 

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field. 
  • 5+ years in customer-facing engineering roles. 
  • Strong understanding of Generative AI model architecture, inference optimization, enterprise infrastructure and deployment challenges. 
  • Experience with specialized AI accelerators. 
  • Solid programming skills in Python and familiarity with distributed computing. 
  • Exceptional communication skills with the ability to explain complex technical concepts to both technical and non-technical audiences. 
  • Ability to work collaboratively in a fast-paced environment and adapt to changing customer needs. 
  • Ability to manage complex technical projects and deliver solutions tailored to customer needs.  
  • Strong interpersonal and communication skills, effective in collaborative and fast-paced team settings.  

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

  1. Build a breakthrough AI platform beyond the constraints of the GPU.
  2. Publish and open source their cutting-edge AI research.
  3. Work on one of the fastest AI supercomputers in the world.
  4. Enjoy job stability with startup vitality.
  5. Our simple, non-corporate work culture that respects individual beliefs.


About Cerebras Systems

Cerebras Systems is a semiconductor company that designs and manufactures a chip for artificial intelligence (AI) applications. The chip, called the Wafer Scale Engine (WSE), is the largest computer chip in the world, measuring 8.5 inches by 8.5 inches. The WSE contains 1.2 trillion transistors and is optimized for deep learning applications. Cerebras Systems' customers include large enterprises and research institutions in the fields of healthcare, finance, and energy.
Learn more about Cerebras Systems
Size
200 employees
Industry

Similar Jobs

More Jobs at Cerebras Systems

  • Physical Design Engineer
    $230K — $280K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Information Technology
    In-Person
  • Product Manager, Strategic Verticals
    $130K — $180K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • ASIC Architect
    $150K — $200K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Consumer Technology
    In-Person
  • Design Verification Engineer
    $190K — $230K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Information Technology
    In-Person
  • Mechanical Engineer
    $180K — $200K *
    Sunnyvale, CA 94087 (Santa Clara County)
    Manufacturing & Automotive
    In-Person

More Manufacturing & Automotive Jobs

Find similar Generative AI Inference Solutions Architect jobs: