Job Title: Lead Data Engineer
Experience: 7 Years
Location: Noida Uttar Pradesh
Mode: Work from Office
Role Overview:
We seek an experienced Lead Data Engineer to architect develop and optimize on-premise data infrastructure using Apache Kafka Spark and Airflow. This role involves leading a data engineering team managing real-time streaming solutions and ensuring high-performance scalable data pipelines.
Key Responsibilities:
Technical Leadership & Team Management:
- Lead and mentor a team of data engineers ensuring best practices in data pipeline development and real-time processing.
- Conduct code reviews performance tuning and architectural optimizations.
- Collaborate with cross-functional teams to align data solutions with business goals.
Data Architecture & Pipeline Development:
- Design and manage high-throughput scalable data pipelines using Kafka Spark and Airflow.
- Architect real-time streaming solutions for event-driven processing.
- Optimize data ingestion transformation and processing workflows for structured & unstructured data.
On-Premise Data Infrastructure & Integration:
- Deploy and manage on-premise big data solutions on Kubernetes OpenShift VMware or OpenStack.
- Design and optimize ETL processes for integrating multiple data sources.
- Work with SQL PLSQL and Oracle databases for efficient data storage and retrieval.
Requirements:
- 8 years of experience in data engineering and real-time data streaming.
- Expertise in Kafka Spark Airflow SQL PLSQL and Oracle databases.
- Strong experience with on-premise infrastructure (Kubernetes OpenShift VMware OpenStack).
- Proficiency in Python or Scala for data processing.
- Excellent problem-solving leadership and communication skills.
openshift,spark,vmware,oracle,scala,openstack,kubernetes,sql,plsql,kafka,airflow,python,on-premise