5+ years of experience in distributed systems, platform engineering, or infrastructure engineering, with a strong focus on event streaming platforms
Hands-on experience with Apache Kafka / Confluent Platform, including brokers, KRaft/Zookeeper, Schema Registry, Kafka Connect, and related ecosystem components
Strong experience working with container platforms such as OpenShift (OCP) or Kubernetes, including operators, namespaces, networking, and storage integration
Proven experience designing and operating highly available, multi-node or multi-data-center distributed systems, including replication, fault tolerance, and disaster recovery strategies
Experience with automation and Infrastructure-as-Code, including CI/CD pipelines, configuration management tools, and declarative deployment models
Solid understanding of networking, DNS, load balancing, and secure service exposure in containerized environments
Strong knowledge of security practices, including TLS/mTLS, authentication, authorization (RBAC), and certificate lifecycle management in enterprise environments
Experience with observability and monitoring tools, including metrics, logging, alerting, and performance tuning of distributed platforms
Proven ability to troubleshoot complex platform issues, perform root cause analysis, and implement long-term solutions
Strong understanding of capacity planning, performance optimization, and resource utilization for large-scale platforms
Experience working in regulated enterprise environments, with knowledge of risk management, controls, and compliance requirements
Excellent collaboration and communication skills, with the ability to work across engineering teams, infrastructure teams, and external vendors