The Discipline Platform Engineering at MoonPay is responsible for providing a resilient, secure, production-ready platform that enables MoonPay to safely deploy applications and services in a self-serve, repeatable manner. We believe that Infrastructure Engineers should support both our product delivery and operational teams by surfacing data from our production environment and driving meaningful change based upon what we learn from it.
Current Tech Stack Tech Stack:
- Google Cloud Platform to host our services
- Postgres as our core database
- Redisfor caching
- DataDog for logging and monitoring
- ArgoCDfor continuous deployment on Kubernetes
- GitHub to manage our source code and continuous integration
- Typescript as our programming language of choice
What you'll do In the
short term we are focused on enhancing our PaaS tooling:
- Improving the maintainability and usability of our Infrastructure-as-code
- Adding self-service functionality for deploying and operating our infrastructure
- Lifecycling and maintenance of our Kubernetes clusters and GCP infrastructure
- Build and maintain dashboards, monitoring & alerting mechanisms with Datadog
In the
medium-to-long term you'll get to:
- Implement new and shiny technologies on top of Kubernetes as you see fit to ensure our tech can scale with the business.
- Develop and integrate solutions with a bias for automation in order to improve and maintain reliability across the production estate and make recovery easier.
- Design and track metrics for site uptime and performance ensuring high levels of visibility are maintained.
- Collaborate closely with all other engineering functions to provide timely feedback from our environments.
- Engage with new business units and acquisitions to understand, support and enhance their infrastructure
- Support Engineering on their journey to deliver better software, faster and more safely (think 4It9s OK to deploy on Fridays4 ).
You should apply if - You have strong systems administration skills, know the difference between a container and a virtual machine, and know your way around a Linux terminal
- You have platform engineering/SRE experience at leading startups or fast growing tech companies
- You have either had experience with some of our tech stack or are confident you can cross train and up skill quickly
- You have experience working in a regulated industry
- You are confident working with and guiding developers on monitoring and logging of complex systems at scale
- You have worked on complex projects
- You reflexively reach for AI agents to assist in researching and solving your problems
- You can work collaboratively with different teams i.e. Security, Data, Engineering
- You want to forge and own MoonPays reliability & recovery processes
- You9ve got at least a basic understanding of complex reliability structures, theories, principles, and best practices
- You have worked with JavaScript codebases and frameworks e.g Typescript, Node.JS and React