About the RoleWe're seeking a Senior/Staff Engineer to build and maintain the automation infrastructure that powers the development cycles of our North platform. This engineer will design and implement robust automation systems that enable engineers to efficiently test and validate changes across diverse environments and configurations. This role sits at the intersection of infrastructure and standards. You'll build the systems, frameworks, and culture that allow the rest of engineering to own quality themselves; improving and extending our testing platform by creating the infrastructure that allows engineers to write and execute tests, and enable every engineering team to ship with more confidence.
Key Responsibilities- Design and implement automation pipelines that support comprehensive testing across multiple environments with varying feature flags and realistic customer data profiles
- Create intelligent testing agents that simulate real user behavior to validate different configuration combinations
- Develop and maintain GitHub workflows and actions to automate testing, deployment, and validation processes
- Manage and optimize Helm charts for deployment consistency across environments
- Implement and maintain ArgoCD workflows for continuous deployment and environment management
- Establish best practices for testing methodologies and ensure adoption across engineering teams
- Build scalable infrastructure that supports parallel test execution across diverse configurations
- Develop infrastructure-as-code templates and configurations for reproducible test environments
- Implement containerization strategies for test environments and dependencies
- Create benchmarking frameworks to measure performance and reliability across different configurations
- Monitor and improve test coverage and reliability metrics
- Collaborate with product and engineering teams to understand testing requirements and translate them into automated solutions
- Troubleshoot and resolve complex testing infrastructure issues
Required Qualifications- 5+ years of software engineering experience with a focus on automation and testing infrastructure
- Expert proficiency in Python and TypeScript
- Extensive experience with GitHub workflows and actions
- Deep understanding of testing methodologies and best practices
- Experience building and maintaining CI/CD pipelines
- Containerization experience (Docker, Kubernetes)
- Benchmarking experience and performance testing methodologies
- Cloud platform experience (AWS, GCP, or Azure)
- Background in developer tools or platform engineering
- Ability to design and implement complex automation systems
- Strong problem-solving skills and attention to detail
Preferred Qualifications- Experience working with LLMs in production environments
- Familiarity with infrastructure-as-code principles
- Experience with container orchestration and management
- Knowledge of performance testing tools and frameworks
- Experience with monitoring and observability tools
- Background in test framework development
- Strong working knowledge of Helm charts and ArgoCD
- Infrastructure-as-code experience (Terraform, Pulumi, or similar)
What You'll BuildYou'll enhance our testing platform to allow engineers to:
- Spin up environments with specific feature flag combinations using infrastructure-as-code
- Load test configurations with realistic customer data volumes in containerized environments
- Run comprehensive test suites across multiple environment configurations with automated benchmarking
- Generate detailed performance and reliability reports across different cloud environments
- Automatically detect and flag regressions in diverse usage scenarios
- Scale testing infrastructure to accommodate our growing customer base
This is a critical role for ensuring the reliability and scalability of our North platform as we continue to grow our customer base and expand our feature set. You'll have the opportunity to shape the future of our development infrastructure and make a significant impact on product quality. (edited)
Full-Time Employees at Cohere enjoy these Perks:- A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch.
- Full health and dental benefits, including a separate budget for mental health.
- RRSP matching, 401K, Pension Scheme.
- 100% Parental Leave top-up for up to 6 months, for either parent.
- Annual enrichment benefits:
Arts & culture, fitness/wellness, quality time, and a workspace improvement credit.
Education & learning stipend for conferences, courses, and coaching.
- 6 weeks of paid vacation (30 working days!)
- Budget for traveling to other offices if you are remote, plus an annual company offsite.
How and Where We Work:- Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.
- For those in the office: a daily lunch program, plenty of snacks, and regular community and social events.
- For those not near an office: a co-working benefit so you can work alongside others in your city.
- Everyone receives a $500 home office stipend to set up your workspace properly.