Senior Database Reliability Engineer

Scribe$120K — $160K *
US-AnywhereRemote in San Francisco, CA
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in database reliability engineering, specifically with PostgreSQL.
  • Proficient with ORMs like Django at scale, with a strong ability to analyze and improve query performance.
  • Hands-on experience in managing and running CDC pipelines, particularly with AWS DMS.
  • Skilled in monitoring and observability tools like pganalyze and CloudWatch, with knowledge of OpenTelemetry.
  • Programming skills in Python or Go, and experience with IAC tools like Terraform.

Responsibilities

  • Own and ensure the reliability and performance of databases, including schema design reviews.
  • Enhance the Django ORM for scalability, establishing standards and CI checks.
  • Manage the CDC pipeline, ensuring it operates smoothly and handles schema evolution gracefully.
  • Develop observability tools to link slow queries to user activity for performance analysis.
  • Drive multi-AZ resilience in the architecture, optimizing failover and recovery processes.
  • Create self-service dashboards for product teams to monitor their query footprints.
  • Support onboarding and knowledge transfer with documentation and training sessions.

Benefits

  • Health, dental, and vision insurance for you and your dependents.
  • Flexible paid time off and company holidays.
  • 401(k) retirement savings plan.
  • Paid parental leave for new parents.
  • Daily catered lunch provided at the San Francisco office.
  • Commuter benefits for on-site employees.
  • Home office stipend to support remote work setup.
Full Job Description
About the Role

We're hiring a Senior Database Reliability Engineer to own the reliability, performance, and scalability of Scribe's data tier. Our engineering org is doubling - which means the guardrails, automation, and standards you put in place today will carry a much larger team through the next phase of growth. This is a senior IC role with real ownership: you'll set the bar for how engineers across the company interact with our databases, not just keep the lights on.

Our stack is Django on PostgreSQL (Aurora Serverless V2), OpenSearch, Redis (ElastiCache), SQS, and RabbitMQ, with a CDC pipeline running Aurora to DMS to S3 Parquet to Snowflake. Engineers ship through the ORM, not raw SQL - which makes migration safety, index design, and query review genuinely high-stakes work.

What You'll Do
  • Own database reliability across Aurora, OpenSearch, Redis, and our CDC pipeline - including schema design reviews, migration safety (locks, backfills, concurrent index builds, NOT VALID constraints), and incident response for the data tier
  • Make the Django ORM a strength at scale: catch N+1 patterns in review, extend QuerySet conventions and physical schema standards, and build the CI checks and AGENTS.md scaffolding that encode those standards so they scale beyond any single reviewer
  • Operate and evolve the CDC pipeline from Aurora through DMS to S3 Parquet to Snowflake - including replication slot hygiene, schema evolution safety, and automated checks that catch migrations likely to break downstream consumers before they ship
  • Build and improve observability across pganalyze, CloudWatch, and Honeycomb, with Django-side instrumentation that ties slow ORM queries back to specific users, flags, and deploys
  • Drive multi-AZ resilience within our single-region architecture - Aurora writer/reader placement, failover behavior, RTO/RPO, ElastiCache and OpenSearch AZ topology, RabbitMQ survivability
  • Build self-service tooling and dashboards that give product and platform teams visibility into their own query footprint, reducing the review burden as the engineering org grows
  • Contribute to onboarding and knowledge-sharing as a large incoming class of engineers joins - write docs, run internal sessions on "what your ORM query is really doing," and feed that knowledge back into AI review tooling


What We're Looking For
  • Has deep PostgreSQL expertise in practice: reads EXPLAIN (ANALYZE, BUFFERS) fluently, understands MVCC, bloat, lock contention, and vacuum behavior, and can tune Aurora Serverless V2 for latency and throughput
  • Has worked with an ORM (Django, SQLAlchemy, ActiveRecord, or similar) at production scale - can predict the SQL a query generates, spot N+1 issues on sight, and knows when joins beat batched IN queries and when they don't
  • Has run CDC pipelines in production, ideally with AWS DMS - comfortable with logical replication, slot hygiene, schema evolution, and Parquet-based data lakes feeding Snowflake, BigQuery, or Redshift
  • Has hands-on experience with pganalyze (or Datadog DBM / pg_stat_statements pipelines), CloudWatch, and Honeycomb (or another high-cardinality tracing tool); comfortable with OpenTelemetry
  • Has worked with OpenSearch, Redis, and at least one production message broker (SQS, RabbitMQ, or Kafka) at scale
  • Writes real automation - Python, Go, or similar - and has used Terraform or comparable IaC to manage infrastructure
  • Has used AI coding and review tools in a team setting: written or maintained AGENTS.md files, configured review agents, iterated on prompts


Nice to Have
  • Event sourcing on Postgres, or experience with alternate CDC tooling (Debezium, Fivetran, Airbyte)
  • pgbouncer or RDS Proxy at scale with Django connection handling
  • Deep Honeycomb usage: SLOs, BubbleUp, Triggers, derived columns
  • Snowflake from the producer side: staging, Snowpipe, external tables on Parquet
  • Experience scaling data infrastructure through rapid engineering headcount growth
  • SOC 2 Type II, GDPR, or similar compliance work


Location

San Francisco (hybrid, 3 days per week in-office) or, Remote based permanently in PST (Pacific Standard Time).

Compensation

Salary varies by location. All full-time employees receive equity in Scribe. Final offers depend on experience and scope.

Benefits
  • Health, dental, and vision insurance for you and your dependents
  • Flexible paid time off and company holidays
  • 401(k)
  • Paid parental leave
  • Daily catered lunch (SF office)
  • Commuter benefits
  • Home office stipend


Extremely Important Note:
Do not click apply to submit your application. Instead, send a POST request to https://apply.scribe.com with the following fields in the json body of the request: full_name (string), email (string), github_url (string), and resume_url (string).

About Scribe

Scribe is a software company that provides data integration and migration solutions. The company's platform allows businesses to connect and integrate data from various sources, including cloud applications, databases, and file systems. Scribe was founded in 1996 and is headquartered in Boston, Massachusetts.
Learn more about Scribe
Size
100 employees
Industry
Founded
1996
NASDAQ

Similar Jobs

More Jobs at Scribe

  • Head of Data
    $230K — $275K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Backend Engineer
    $140K — $165K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person
  • Sales Manager (SMB)
    $220K — $250K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person
  • Senior People Ops Generalist
    $120K — $140K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person
  • AI ABM Specialist
    $100K — $113K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person

More Information Technology Jobs

Find similar Senior Database Reliability Engineer jobs: