Customer Support Engineer (GPU Cluster)

Together AI

$160K — $230K *
Technical Services
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years in a customer-facing technical role, 1 year in AI support or mission-critical SaaS API support
  • Strong technical foundation in AI, ML, and GPU technologies
  • Familiarity with Kubernetes, SLURM, Ansible, and HPC environments
  • Experience with compute clusters installation, configuration, and troubleshooting
  • Proficient in problem-solving and proactive issue resolution
  • Ability to collaborate across Sales, Engineering, and Product teams
  • Excellent communication skills for translating complex concepts to non-technical audiences

Responsibilities

  • Engage with customers to resolve complex technical issues related to Kubernetes GPU clusters
  • Serve as a product expert and last line of defense before escalation to Engineering
  • Collaborate with Engineering and Product teams to enhance customer satisfaction
  • Analyze support cases to influence product development and improvements
  • Document system configurations and troubleshooting guides for knowledge sharing
  • Provide flexible support coverage during holidays, nights, and weekends as needed

Benefits

  • Flexible remote work options
  • Health insurance
  • Startup equity opportunities
Full Job Description
About the role

As a Customer Support Engineer at a pioneering AI company, you'll be the first line of defense to support customers as they build out training, fine tuning, and inference solutions with Together AI. You'll dive deep into complex technical challenges, providing swift and effective solutions while serving as a product expert. As a part of the Customer Experience organization, you will collaborate closely with product and sales, driving continuous improvement of our offerings. This is an exciting opportunity for a deeply technical professional passionate about AI and customer success to make a significant impact in a fast-paced, innovative environment.
Responsibilities
  • Engage directly with customers to tackle and resolve complex technical challenges involving our cutting-edge Kubernetes GPU clusters; ensure swift and effective solutions every time.
  • Become a product expert in our GPU Cluster service, serving as the last line of technical defense before issues are escalated to Engineering and Product teams.
  • Collaborate seamlessly across Engineering, Research, and Product teams to address customer concerns; collaborate with senior leaders both internally and externally to ensure the highest levels of customer satisfaction.
  • Transform customer insights into action by identifying patterns in support cases and working with Engineering and Go-To-Market teams to drive Together's roadmap (e.g., future models to support)
  • Maintain detailed documentation of system configurations, procedures, troubleshooting guides, and FAQs to facilitate knowledge sharing with team and customers.
  • Be flexible in providing support coverage during holidays, nights and weekends as required by business needs to ensure consistent and reliable service for our customers.
Requirements
  • 3+ years of experience in a customer-facing technical role with at least 1 year in a support function in AI or supporting a mission-critical API in SaaS
  • Strong technical background, with knowledge of AI, ML, GPU technologies and their integration into high-performance computing (HPC) environments.
  • Familiarity with infrastructure services (e.g., Kubernetes, SLURM), infrastructure as code solutions (e.g., Ansible) high-performance network fabrics, NFS-based storage management, container infrastructure, and scripting and programming languages.
  • Foundational understanding in the installation, configuration, administration, troubleshooting, and securing of compute clusters.
  • Complex technical problem solving and troubleshooting, with a proactive approach to issue resolution
  • Ability to work cross-functionally with teams such as Sales, Engineering, Support, Product and Research to drive customer success.
  • Strong sense of ownership and willingness to learn new skills to ensure both team and customer success.
  • Excellent communication and interpersonal skills, with the ability to explain complex technical concepts to non-technical stakeholders.
  • Ability to operate in dynamic environments, adept at managing multiple projects, and comfortable with frequent context switching and prioritization.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000-230,000K + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Similar Jobs

More Jobs at Together AI

More Technical Services Jobs

Find similar Customer Support Engineer (GPU Cluster) jobs: