Work you'll do/Responsibilities
* Lead design and development of domain and integration APIs (e.g., RESTful APIs (Representational State Transfer), GraphQL, event-driven patterns where applicable).
* Produce and maintain API contracts (e.g., OpenAPI (OpenAPI Specification)/Swagger), versioning strategy, and consumer onboarding documentation.
* Implement security patterns: OAuth 2.0 (Open Authorization), OpenID Connect (OIDC), JSON Web Tokens (JWT), mutual Transport Layer Security (mTLS) where required, secrets management, and authorization controls.
* Build reliability and performance features: idempotency, retries/timeouts, rate limiting, caching, circuit breakers, and backward compatibility.
* Establish observability: structured logs, metrics, tracing, dashboards, and service level objectives (SLOs) aligned to business outcomes.
* Drive CI/CD (continuous integration/continuous delivery), automated testing (unit/integration/contract), and code quality practices.
* Administer Databricks workspaces: cluster policies, pools, job/workflow standards, workspace configuration, and operational runbooks.
* Own identity, access, and governance: role-based access control (RBAC), group/role management, Unity Catalog (or equivalent) permissions, auditing, and least-privilege design.
* Manage platform reliability: monitoring/alerting, incident triage, root cause analysis (RCA), and problem management to reduce recurring issues.
* Optimize performance and cost: cluster sizing/autoscaling standards, job tuning guidance, workload isolation, usage reporting, and chargeback/showback inputs if applicable.
* Support data platform hygiene: library/runtime management, upgrade planning, operational readiness for quarterly releases, and coordination with cloud/platform teams.
* Partner with data engineering to enforce best practices for Delta Lake (e.g., partitioning, file sizing, Z-ORDER where applicable) and operational quality gates.
* Communicate regularly with Engagement Managers (Directors), project team members, and representatives from various functional and / or technical teams, including escalating any matters that require additional attention and consideration from engagement management
* Independently and collaboratively lead client engagement workstreams focused on improvement, optimization, and transformation of processes including implementing leading practice workflows, addressing deficits in quality, and driving operational outcomes
Qualifications
Required
* 7+ years of software engineering experience with significant API/service development ownership.
* 3+ years administering or operating Databricks in production (workspace governance, access controls, cluster/job policies, and operational support).
* Strong experience with one backend stack (e.g., Java/Spring, .NET, Node.js, Python) and microservices/service-based architecture.
* Solid understanding of cloud security fundamentals (identity, networking basics, secrets, encryption) and production operations practices (incident/problem/change).
* Proven ability to lead technical design, mentor others, and communicate clearly with technical and non-technical stakeholders
* Bachelor's degree, preferably in Computer Science, Information Technology, Computer Engineering, or related IT discipline; or equivalent experience
* Limited immigration sponsorship may be available
* Ability to travel 10%, on average, based on the work you do and the clients and industries/sectors you serve
* Hybrid Role - 2 days on-site