3+ years of experience in data science, analytics, TPM, or FinOps roles focused on cloud infrastructure cost analysis, capacity planning, or efficiency optimization.
Proven experience building spend forecasting models and large-scale cost attribution systems.
In-depth knowledge of cloud billing systems and cost allocation methodologies.
Strong expertise in Python, SQL, and data visualization tools.
Exceptional problem-solving skills with the ability to communicate effectively, especially under ambiguous circumstances.
Responsibilities
Develop scalable cost attribution models for infrastructure spend across teams and products.
Create and maintain forecasting models for infrastructure demand to support capacity planning.
Deliver detailed reporting on infrastructure capacity and costs, analyzing forecast accuracy and variance.
Define and operationalize unit cost metrics while driving efficiency initiatives to reduce costs.
Collaborate with various cross-functional teams to analyze usage patterns and ensure cost-efficient scaling of services.
Establish efficiency standards and implement continuous improvement practices for capacity planning and resource utilization.
Build self-service dashboards to enhance visibility into cloud spend and foster a cost-aware culture.
Benefits
100% remote work with a focus on PST hours (9am-6pm PST).
Opportunity to impact infrastructure management at an organizational level.
Collaboration with cross-functional teams to influence roadmaps and budgets.
Full Job Description
Cloud Capacity & Efficiency Engineer
100% Remote: PST hours (must work 9am-6pm PST)
We are seeking a strong engineer with expertise in cloud capacity management and forecasting. This role is critical to helping us understand, optimize, and strategically manage our infrastructure spend across cloud and datacenter environments.
In this role, you will drive how efficiently we operate our multi-cloud footprint, including forecasting infrastructure demand and planning capacity, improving utilization, and reducing unit costs across compute, storage, and networking resources. This includes building robust visibility into infrastructure spend, developing accurate cost attribution across teams and workloads, modeling resource demand, and identifying opportunities to improve efficiency at scale.
The Opportunity for You:
Cloud Cost Modeling & Attribution
Build and maintain scalable cost attribution models and cost-of-revenue pipelines that accurately allocate infrastructure spend (compute, storage, networking, data transfer, etc.) across teams, products, and workloads, providing clear visibility into cost drivers.
Forecasting & Capacity Planning
Develop and own forecasting models for infrastructure demand, incorporating business growth, product roadmaps, and historical trends to support accurate budgeting and proactive capacity planning. Oversee forecasts, approve capacity requests, and ensure alignment with organizational priorities.
Capacity & Cost Reporting
Deliver comprehensive reporting on infrastructure capacity and spend, including forecast vs. actuals, requests vs. allocation, allocation efficiency, and scenario analysis. Track forecast accuracy, quality, and key variance drivers.
Unit Economics & Efficiency Optimization
Define and operationalize unit cost metrics (e.g., cost per request, cost per GB stored, cost per pipeline run) and develop workload-level unit economics. Identify inefficiencies and drive initiatives that improve utilization, reduce costs, and meet efficiency targets.
Cross-Functional Partnership
Collaborate with infrastructure, engineering (SRE/app/dev), finance, procurement, and product teams to analyze usage patterns, influence roadmaps, and ensure cost-efficient scaling of services.
Operational Excellence & Governance
Establish efficiency standards, track cost avoidance and utilization metrics against targets, and implement feedback loops to continuously improve forecasting, capacity planning, and infrastructure usage.
Data Accessibility & Cost-Aware Culture
Build self-service dashboards, automated reporting, and accessible datasets to empower teams with visibility into cloud spend, capacity, and efficiency metrics, fostering a cost-aware engineering culture.
Key Success Factors:
3+ years of experience in data science, analytics, TPM, or FinOps roles, with a focus on cloud infrastructure cost analysis, capacity planning, or efficiency optimization.
Must have experience building spend forecasting models and large-scale cost attribution systems.
Deep knowledge of cloud billing systems, cost allocation methodologies, and spend optimization levers (e.g., reserved instances, committed use discounts, rightsizing, spot/preemptible usage).
Expertise in Python, SQL, forecasting, data modeling and data visualization tools.
A strong ability to thrive in ambiguity, taking initiative to create clarity and forward progress, with highly effective communication and presentation skills.