Software Engineer, Data Infrastructure

Cohere

• $90K — $130K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years of experience in data storage infrastructure
  • Proficient in Python programming language
  • Experience with Kubernetes, especially storage-related features
  • Skilled at transforming unstructured data into efficient datasets
  • Familiar with distributed data processing frameworks like Apache Beam, Spark, or Flink
  • Nice-to-have: Knowledge of analytics tools such as BigQuery, Airflow, or dbt
  • Genuine passion for AI and current research trends
  • Comfortable working in uncharted territory and innovating solutions

Responsibilities

  • Build and maintain high-performance data layers for AI training workloads
  • Address networking and performance challenges in petabyte-scale storage
  • Collaborate with top-tier researchers and engineers on advanced projects

Benefits

  • Inclusive and open work culture
  • Collaboration with a cutting-edge AI research team
  • Weekly lunch stipend and in-office meals
  • Comprehensive health and dental benefits, inclusive of mental health support
  • 100% Parental Leave top-up for up to 6 months
  • Funding for personal enrichment in arts, fitness, and workspace improvement
  • Remote-flexible work options and co-working stipend
  • Generous vacation policy, offering 6 weeks off (30 working days)
Full Job Description
Why this role?

We're building the data infrastructure behind some of the most demanding AI training workloads in the world, and we want sharp, curious people to help us do it. In this role, you'll build and maintain the high-performance data layer our Modeling teams rely on for training and evaluation jobs.

As a Software Engineer, Data Infrastructure, you will:
  • Work directly on petabyte-scale storage infrastructure, and the networking and performance challenges that come with it.
  • Collaborate daily with researchers and engineers who are some of the best in the world at what they do.

You may be a good fit if you have:
  • 4+ years of experience working on data storage infrastructure
  • Strong command of Python
  • Kubernetes experience, especially on the storage side (Persistent Volumes, CSI drivers, etc.)
  • The ability to transform unstructured data into performant datasets across diverse storage backends including S3, GCS, and POSIX
  • Experience with distributed data processing frameworks such as Apache Beam, Spark, or Flink
  • [Nice-to-have] Familiarity with modern analytics tooling such as BigQuery, Airflow, or dbt
  • Genuine excitement about AI. You follow the research, have opinions, and enjoy being in the weeds
  • Comfort operating at the edge of what's known, with a desire to build something genuinely new rather than optimize what already exists

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!

Full-Time Employees at Cohere enjoy these Perks:

An open and inclusive culture and work environment

Work closely with a team on the cutting edge of AI research

Weekly lunch stipend, in-office lunches & snacks

🦷 Full health and dental benefits, including a separate budget to take care of your mental health

100% Parental Leave top-up for up to 6 months

Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

🏙 Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend

6 weeks of vacation (30 working days!)

Similar Jobs

More Jobs at Cohere

More Information Technology Jobs

Find similar Software Engineer, Data Infrastructure jobs: