Job Description: In this role you will be responsible for the end-to-end installation configuration and monitoring of Apache Kafka clusters to ensure 24/7 high availability and optimal performance. You will manage complex topic partitioning strategies implement enterprise-grade security protocols (SSL/TLS SASL ACLs) and automate routine tasks using Infrastructure as Code (IaC). This position requires a strong Linux administrator who can provide 3rd-level support for producer/consumer bottlenecks and design disaster recovery strategies across multiple regions.
Key Responsibilities:
-
Cluster Lifecycle Management: Install configure and upgrade Kafka brokers Zookeeper/KRaft Connect and Schema Registry.
-
Security Orchestration: Implement and maintain robust security measures including SSL/TLS encryption Kerberos authentication and ACLs.
-
Performance Engineering: Monitor cluster health consumer lag and producer latency using Prometheus and Grafana to ensure high throughput and low latency.
-
Topic & Partition Strategy: Manage topic configurations replication factors and data retention policies to optimize storage and recovery.
-
Infrastructure Automation: Develop automation scripts for provisioning and monitoring using Ansible Terraform or Kubernetes.
-
High Availability (HA): Design and manage cross-region replication strategies to ensure broker redundancy and seamless disaster recovery.
-
3rd-Level Support: Perform root cause analysis (RCA) to resolve complex broker and connectivity issues for application teams.
-
Linux Administration: Leverage strong Linux system skills to manage the underlying platform for the Kafka ecosystem.
-
Collaborative Design: Partner with developers to optimize producer/consumer configurations and onboard new streaming applications.
Must-Have Technical Skills:
-
Kafka Mastery: In-depth knowledge of Apache Kafka and Confluent Kafka architectures.
-
Monitoring & Observability: Hands-on experience with Prometheus Grafana and JMX.
-
Automation & Scripting: Proficient in Python or Bash and experienced with Ansible Puppet or Terraform.
-
Containerization: Practical experience with Docker and Kubernetes/OpenShift.
-
System Admin: Strong Linux system administration background.
Job Description: In this role you will be responsible for the end-to-end installation configuration and monitoring of Apache Kafka clusters to ensure 24/7 high availability and optimal performance. You will manage complex topic partitioning strategies implement enterprise-grade security protocols (S...
Job Description: In this role you will be responsible for the end-to-end installation configuration and monitoring of Apache Kafka clusters to ensure 24/7 high availability and optimal performance. You will manage complex topic partitioning strategies implement enterprise-grade security protocols (SSL/TLS SASL ACLs) and automate routine tasks using Infrastructure as Code (IaC). This position requires a strong Linux administrator who can provide 3rd-level support for producer/consumer bottlenecks and design disaster recovery strategies across multiple regions.
Key Responsibilities:
-
Cluster Lifecycle Management: Install configure and upgrade Kafka brokers Zookeeper/KRaft Connect and Schema Registry.
-
Security Orchestration: Implement and maintain robust security measures including SSL/TLS encryption Kerberos authentication and ACLs.
-
Performance Engineering: Monitor cluster health consumer lag and producer latency using Prometheus and Grafana to ensure high throughput and low latency.
-
Topic & Partition Strategy: Manage topic configurations replication factors and data retention policies to optimize storage and recovery.
-
Infrastructure Automation: Develop automation scripts for provisioning and monitoring using Ansible Terraform or Kubernetes.
-
High Availability (HA): Design and manage cross-region replication strategies to ensure broker redundancy and seamless disaster recovery.
-
3rd-Level Support: Perform root cause analysis (RCA) to resolve complex broker and connectivity issues for application teams.
-
Linux Administration: Leverage strong Linux system skills to manage the underlying platform for the Kafka ecosystem.
-
Collaborative Design: Partner with developers to optimize producer/consumer configurations and onboard new streaming applications.
Must-Have Technical Skills:
-
Kafka Mastery: In-depth knowledge of Apache Kafka and Confluent Kafka architectures.
-
Monitoring & Observability: Hands-on experience with Prometheus Grafana and JMX.
-
Automation & Scripting: Proficient in Python or Bash and experienced with Ansible Puppet or Terraform.
-
Containerization: Practical experience with Docker and Kubernetes/OpenShift.
-
System Admin: Strong Linux system administration background.
View more
View less