Solutions Architect - US

FuriosaAI

$120K — $160K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 2-5 years in a customer-facing technical role related to AI infrastructure or semiconductors.
  • Current knowledge of AI/LLM developments including model releases and frameworks.
  • Hands-on familiarity with modern inference stacks such as vLLM and Triton Inference Server.
  • Experience with orchestration frameworks like LangChain and AutoGen.
  • Proficiency in Python and familiarity with DNN frameworks like TensorFlow.
  • Excellent communication skills for engaging technical and executive audiences.
  • Authorization to work in the US and ability to travel periodically.

Responsibilities

  • Lead technical enablement for US clients deploying AI models on RNGD NPU.
  • Develop proof of concepts and benchmarking studies in customer settings.
  • Act as technical advisor to sales during pre-sales and enterprise evaluations.
  • Showcase expertise in FuriosaAI products at technical forums and conferences.
  • Train customers on integration and optimization best practices after purchase.
  • Provide technical feedback from US clients to product teams in Seoul.

Benefits

  • Opportunity to work on cutting-edge AI technology solutions.
  • Engage with a diverse range of customers from startups to enterprises.
  • Access to training and development for enhancing technical expertise.
  • Work within a forward-thinking company focused on sustainable AI solutions.
Full Job Description
About the Job

FuriosaAI is looking for a Solutions Architect to bring the full potential of our powerful RNGD chips/servers to our customers by acting as the primary technical authority in AI/LLM model deployments. From running POCs to benchmarking and debugging, you will translate RNGD's powerful system to real-world deployments of customers' models, empowering customers with FuriosaAI's powerful solutions.

If you are interested in providing the technical expertise in challenging the current status-quo of AI infrastructure in real-world environments, join us in our path to a sustainable future of AI.

What You'll Do
  • Own end-to-end technical enablement for US customers deploying AI models on FuriosaAI's RNGD NPU using the Furiosa SDK
  • Develop POCs, benchmarking studies, and live debugging sessions directly in customer environments
  • Act as the technical authority to the US BD/Sales team during pre-sales and enterprise evaluations; translate deep technical capability into business value for engineering and C-suite audiences
  • Develop deep, current expertise in FuriosaAI's hardware and software stack and demonstrate it at US technical forums, AI conferences, and customer workshops
  • Onboard and train customers on integration patterns, optimization workflows, and best practices post-purchase
  • Serve as a technical feedback loop from US customers back to Seoul HQ product and engineering teams


Qualifications
  • 2-5 years in a US customer-facing technical role: Solutions Architect, Sales Engineer, Forward Deployed Engineer, or equivalent at an AI infra, cloud, or semiconductor company
  • Actively current on the AI/LLM landscape - tracking model releases, inference frameworks, and serving stack evolution in real time
  • Hands-on experience with modern inference stacks: vLLM, SGLang, TensorRT-LLM, Triton Inference Server, or similar
  • Hands-on experience with agent and orchestration frameworks: LangChain, LlamaIndex, LangGraph, AutoGen, or MCP-based tooling
  • Proficiency in Python; comfortable with DNN frameworks (PyTorch, TensorFlow)
  • Strong written and verbal communication - able to engage credibly with ML engineers at frontier labs and VP/C-suite executives
  • Authorized to work in the US; able to travel to customer sites and to Seoul HQ periodically


Preferred Qualifications
  • Prior experience at a US AI chip company, cloud silicon team, or AI infrastructure startup
  • Familiarity with NPU/GPU accelerator ecosystems, PCIe integration, and data center hardware deployment
  • Experience with inference optimization: quantization, kernel tuning, batching strategies, memory bandwidth optimization
  • Proficiency in C, C++, or Rust
  • Experience working with distributed or cross-timezone engineering teams


Contact

Similar Jobs

More Jobs at FuriosaAI

More Enterprise Technology Jobs

Find similar Solutions Architect - US jobs: