Software Engineer, Data Infrastructure

Cohere

• $90K — $130K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years of experience in data storage infrastructure
  • Proficient in Python programming
  • Experience with Kubernetes, particularly for storage management
  • Skilled at transforming unstructured data into usable datasets
  • Familiarity with distributed data processing frameworks like Apache Beam, Spark, or Flink
  • [Nice-to-have] Knowledge of analytics tools such as BigQuery, Airflow, or dbt
  • Passionate about AI with an active interest in recent research and trends
  • Comfortable working in innovative, undefined territories

Responsibilities

  • Build and maintain petabyte-scale data storage infrastructure
  • Address networking and performance challenges associated with large-scale data setups
  • Collaborate closely with top researchers and engineers in AI
  • Develop efficient data solutions for training and evaluation jobs
  • Transform unstructured data into high-performance datasets across various storage backends

Benefits

  • Open and inclusive workplace culture
  • Collaboration with a leading AI research team
  • Weekly lunch stipend plus in-office meals and snacks
  • Comprehensive health and dental benefits including mental health support
  • Generous parental leave policy with a 6-month top-up
  • Personal enrichment benefits for wellness and cultural activities
  • Remote-flexible work environment with co-working stipend
  • Six weeks of vacation time, totaling 30 working days
Full Job Description
Why this role?

We're building the data infrastructure behind some of the most demanding AI training workloads in the world, and we want sharp, curious people to help us do it. In this role, you'll build and maintain the high-performance data layer our Modeling teams rely on for training and evaluation jobs.

As a Software Engineer, Data Infrastructure, you will:
  • Work directly on petabyte-scale storage infrastructure, and the networking and performance challenges that come with it.
  • Collaborate daily with researchers and engineers who are some of the best in the world at what they do.

You may be a good fit if you have:
  • 4+ years of experience working on data storage infrastructure
  • Strong command of Python
  • Kubernetes experience, especially on the storage side (Persistent Volumes, CSI drivers, etc.)
  • The ability to transform unstructured data into performant datasets across diverse storage backends including S3, GCS, and POSIX
  • Experience with distributed data processing frameworks such as Apache Beam, Spark, or Flink
  • [Nice-to-have] Familiarity with modern analytics tooling such as BigQuery, Airflow, or dbt
  • Genuine excitement about AI. You follow the research, have opinions, and enjoy being in the weeds
  • Comfort operating at the edge of what's known, with a desire to build something genuinely new rather than optimize what already exists

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!

Full-Time Employees at Cohere enjoy these Perks:

An open and inclusive culture and work environment

Work closely with a team on the cutting edge of AI research

Weekly lunch stipend, in-office lunches & snacks

🦷 Full health and dental benefits, including a separate budget to take care of your mental health

100% Parental Leave top-up for up to 6 months

Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

🏙 Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend

6 weeks of vacation (30 working days!)

Similar Jobs

More Jobs at Cohere

More Information Technology Jobs

Find similar Software Engineer, Data Infrastructure jobs: