Data Lakehouse Architect

SEACORP

$120K — $150K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years in data engineering or architecture roles; 3+ years in lakehouse architecture.
  • Hands-on experience with Apache Kafka for real-time data ingestion.
  • Familiarity with Apache Iceberg in analytical environments.
  • Experience with cloud object storage like Amazon S3 or CEPH.
  • Knowledge of Trino for distributed SQL queries.

Responsibilities

  • Design and document lakehouse architecture using specified technologies.
  • Define architecture for data lifecycle management and maintenance.
  • Architect ingestion frameworks for real-time and batch processing.
  • Establish secure storage patterns across different environments.
  • Define governance patterns for data access and compliance integration.
  • Collaborate with data engineers to optimize query performance.
  • Lead teams in architectural design reviews and alignment.

Benefits

  • Choice of two medical insurance programs through Blue Cross & Blue Shield.
  • Best in class Dental Insurance Plan through Delta Dental.
  • Excellent Vision Benefit for discounts and allowances.
  • 401(k) plan with up to 8% employer matching.
  • Generous Paid Time Off (PTO) program and Tuition Reimbursement.
Full Job Description
Data Lakehouse Architect.

Primary Duties and Responsibilities:

Job Summary: SEACORP is seeking a Data Lakehouse Architect to lead thedesign, implementation, and evolution of a modern, tiered data platform thatsupports scalable ingestion, storage, processing, governance, and analytics. Thisposition is in support of our SWFTS Data Strategy and Data Pipelineprogram. This role will define thetarget-state architecture for a lakehouse environment built on technologiesincluding Kafka, Apache Iceberg, Amazon S3, CEPH, and Trino, while ensuring theplatform is secure, performant, reliable, and cost-effective.

The architect will partner with engineering, platform,analytics, security, and business teams to establish architectural standards,guide implementation, and enable high-quality data products across batch andstreaming domains. The ideal candidate combines deep technical expertise indistributed data systems with strong design judgment, leadership, and theability to translate business requirements into durable platform capabilities.

Job Responsibilities Include:
  • Design and document lakehouse architecture using Kafka for streaming ingestion, Iceberg for table format and data management, S3 and/or CEPH for object storage, and Trino for distributed SQL query access.
  • Define architecture for data partitioning, compaction, schema evolution, metadata management, table maintenance, and lifecycle policies.
  • Architect data ingestion frameworks for both real-time and batch workloads, including event-driven and CDC-based integration patterns.
  • Establish scalable, resilient, and secure storage patterns across cloud and on-premises or hybrid object storage environments.
  • Define governance patterns including access control, encryption, data retention, lineage, auditability, and compliance integration.
  • Partner with data engineers to optimize query performance, file sizing, partitioning strategy, and workload concurrency in Trino and related engines.
  • Lead engineering teams and review designs, code, and deployment approaches for alignment with target architecture.


Qualifications:

Education: Bachelor's degree in Computer Science, Engineering, Information Systems, or a related technical field.

Required Experience: Required knowledge of Atlassian Tool Suite, Git, and Linux. Preferred knowledge in C++, Java, Python, Linux. Candidate should have the ability to work in a fast-paced work environment. Able to collaborate with others while being able to handle independent tasking. Ability to learn new technologies quickly.
  • 7+ years of experience in data engineering, data architecture, or platform architecture roles.3+ years of experience designing and implementing modern data lake or lakehouse architectures in production environments.
  • Hands-on experience with Apache Kafka for streaming data ingestion, event architecture, or real-time data integration.Hands-on experience with Apache Iceberg or a similar open table format in large-scale analytical environments.
  • Experience designing data platforms on object storage, including Amazon S3, CEPH, or equivalent S3-compatible storage systems.
  • Experience with Trino or similar distributed SQL query engines for interactive analytics over large datasets.
  • Strong understanding of distributed systems principles, including scalability, fault tolerance, consistency tradeoffs, and performance tuning. Experience with data modeling, schema design, partitioning strategy, and optimization for analytical workloads.
  • Experience with security architecture including role-based access control, encryption, and data governance controls.
  • Experience creating architecture documentation, technical standards, and implementation roadmaps. Strong knowledge of batch and streaming pipeline patterns, including CDC, event-driven design, and ingestion orchestration.

Desired Experience: Desired knowledge in the areas of Databases, SQL and No-SQL (Postgres, MongoDB), Apache Data Frameworks (Kafka, Spark, Iceberg, OpenMetadata, Ranger), Data Infrastructure (Ceph, S3, MinIO/Parquet, REST, Nessie, Druid), Data APIs (Trino, Metabase, MLLib, Superset). Desired knowledge in the areas of software prototyping, VS Code, Cursor IDE, and prompt engineering.
  • Master's degree in Computer Science, Data Engineering, Distributed Systems, or a related field.
  • Experience with Team Submarine, SWFTS, US Navy program offices, TI/APB cycle
  • Experience with metadata catalogs such as Hive Metastore, AWS Glue Catalog, Nessie, or Polaris.
  • Familiarity with data processing engines such as Spark, Flink, or dbt in lakehouse environments.
  • Experience implementing data quality, observability, and lineage tooling.
  • Experience supporting hybrid or multi-cloud data architectures.
  • Familiarity with Kubernetes-based deployment and platform operations.
    Experience with regulated data environments and compliance frameworks such as SOC 2, HIPAA, PCI-DSS, or FedRAMP.
  • Exceptional Qualifications: Candidates possessing knowledge in these technologies will be considered exceptional candidates including Kubernetes, RKE2, containerization, Helm, AI/ML APIs, SparkML, AI/ML Integration (LLM
  • Development Stack), DPCN training, PINN training, or agentic development integration.
  • Recognized expertise designing enterprise-scale lakehouse platforms using open standards and interoperable tooling.
  • Experience delivering software and systems for Team Submarine or SWFTS programs, including experience with the Submarine platform tactical systems.
  • Deep production experience with Kafka + Iceberg + Trino architectures, including performance optimization and operational scaling.
  • Experience building platforms that span cloud and on-premises object storage, especially S3 and CEPH in hybrid deployments.
  • Demonstrated success leading architecture for high-volume, low-latency, and mission-critical data ecosystems.
  • Ability to make principled architectural decisions regarding catalogs, table maintenance, file formats, compaction, and query federation.
  • Strong record of mentoring senior engineers and establishing architecture review processes and engineering standards.
  • Experience leading major data platform migrations from legacy warehouse, Hadoop, or tightly coupled ETL ecosystems to modern lakehouse architectures.
  • Ability to balance long-term architectural integrity with pragmatic delivery timelines and business value.

As a requirement of employment, all SEACORP employees must hold U.S. Citizenship

Location: Manassas, VA

Travel: Quarterly (approximately 4 times a year)

Clearance: Secret

Work Environment & Physical Demands: Office & Computer Laboratories - Sitting, standing, extended periods of time using a mouse and keyboard and viewing computer screens. Infrequent lifting of
Successful candidates will enjoy competitive wages and a very rich benefit program, including:
  • Medical Benefits: Choice of two medical insurance programs through Blue Cross & Blue Shield.
  • Dental Benefits: A best in class Dental Insurance Plan through Delta Dental.
  • Vision Benefits: An excellent Vision Benefit providing discounts and allowances for prescription glasses and contact lenses.
  • Retirement Benefits: A qualified 401(k) Retirement Savings Account with a generous employer matching contribution up to 8% of your eligible compensation.
  • Life Insurance Benefits: Employer paid Life and Accidental Death & Dismemberment Insurance equal to your annual salary. Supplemental coverage is available for you and qualified family members as well as Supplemental Short-Term and Long-Term Disability Insurance.
  • Additional Benefits: Ten (10) Paid Holidays per year (including 2 floating Holidays), a generous Paid Time Off (PTO) program; Tuition Reimbursement, and Referral Bonuses.

Similar Jobs

More Jobs at SEACORP

More Enterprise Technology Jobs

Find similar Data Lakehouse Architect jobs: