Software Engineer, Data Infrastructure & Acquisition - San Mateo, CA, USA

Speechify

$140K — $200K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • BS/MS/PhD in Computer Science or a related field
  • 5+ years of industry experience in software development
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code; experience with GCP is a plus
  • Experience with web crawlers and large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities
  • Strong communication skills, both written and verbal.

Responsibilities

  • Identify and source new audio data for the ingestion pipeline
  • Enhance and maintain cloud infrastructure for the ingestion pipeline on GCP using Terraform
  • Collaborate with Scientists to improve data cost, throughput, and quality
  • Develop a dataset roadmap with the AI Team and leadership for future product needs
  • Utilize innovative methods to build petabyte-scale, high-quality datasets
  • Contribute to the alignment of data strategy with Speechify's product vision

Benefits

  • Fast-growing environment with opportunities to shape company and product
  • Supportive entrepreneurial-minded team that values risk and intuition
  • Hands-off management approach for optimal focus and productivity
  • Make a significant impact in a transformative industry
  • Work with a friendly, laid-back culture that embraces asynchronous collaboration
  • Contribute to products that assist individuals with learning differences
  • Engagement in one of the fastest-growing technology sectors, merging AI and audio
Full Job Description
Overview

We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.

What You'll Do
  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
  • Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team's dataset roadmap to power Speechify's next-generation consumer and enterprise products.

An Ideal Candidate Should Have
  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

What we offer
  • A fast-growing environment where you can help shape the company and product.
  • An entrepreneurial-minded team that supports risk, intuition, and hustle.
  • A hands-off management approach so you can focus and do your best work.
  • An opportunity to make a big impact in a transformative industry.
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
  • Opportunity to work on a life-changing product that millions of people use.
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.

Compensation: The United States base salary range for this full-time position is $140,000-$200,000 + bonus + equity depending on experience

Think you're a good fit for this job?

Tell us more about yourself and why you're interested in the role when you apply.
And don't forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit?

Refer them!

Similar Jobs

More Jobs at Speechify

More Information Technology Jobs

Find similar Software Engineer, Data Infrastructure & Acquisition - San Mateo, CA, USA jobs: