SeniorLead Data Engineer Databricks

Not Interested
Bookmark
Report This Job

profile Job Location:

Gurgaon - India

profile Monthly Salary: Not Disclosed
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

Senior/Lead Data Engineer - Databricks

We are seeking a highly skilled and experienced Data Engineering Lead with a strong background in the retail domain and exceptional programming abilities. As a Lead you will play a pivotal role in implementing and optimizing data architecture to support our retail business operations and analytics initiatives. Your expertise in Spark programming optimization techniques and familiarity with Databricks and CI/CD practices will be instrumental in ensuring the efficient and effective management of our dataecosystem.

Imp Pointers as below skills should be in the CV

Data Platform (Data Engineers)

  • Data Lake:AWS S3 / Azure / Dell Cloud Data Lake with Delta Lake format
  • Data Warehouse:Databricks IOMETE Google BigQuery for analytical workloads
  • Batch Processing /Stream Processing:Batch/real-time processing of data pipelines
  • Database:PostgreSQL for transactional data NoSQL (ex. MongoDB Cassendra) for document storage

Responsibilities:

  1. Design and develop data models data integration processes and data pipelines to capture transform and load structured and unstructured data from various retail sources.
  2. Hands-on programming in Spark to develop and optimize data processing applications and analytics workflows.
  3. Apply optimization techniques to enhance the performance and efficiency of data processing and analytical tasks.
  4. Evaluate and implement appropriate tools and technologies including Databricks to streamline data operations and ensure scalability and reliability.
  5. Work closely with other team members to ensure data integrity consistency and accessibility across the organization.
  6. Define and enforce best practices for data governance and data management including data quality metadata management and data security.
  7. Collaborate with DevOps teams to establish and maintain CI/CD pipelines for data engineering and analytics workflows.
  8. Peer Review of team members deliverables
  9. Stay updated with the latest advancements and trends in the retail domain data architecture and programming languages to drive continuous improvement.

Requirements:

  1. At least 5 years of experience in the data engineering domain.
  2. Proven experience as a senior Data Engineer /Lead preferably within the retail industry.
  3. Strong programming skills with expertise in PySpark programming and optimization techniques.
  4. Hands-on experience with Databricks Deltalake and its components for data processing and analytics.
  5. Hands-on experience in data modelling data integration and ETL/ELT processes.
  6. Experience in working with Gitlab pipelines and an in-depth understanding of CI/CD pipeline designs.
  7. Experience with data governance data quality and metadata management.
  8. Strong analytical and problem-solving abilities with a detail-oriented mindset.
  9. Excellent communication and collaboration skills to work effectively with cross-functional teams.
  10. Ability to adapt to a fast-paced and evolving environment while managing multiple priorities.
  11. Good to have experience in at least one of the Cloud Vendor (AWS / Azure / GCP)
  12. Good to have experience with streaming technologies as well


Required Experience:

Senior IC

Senior/Lead Data Engineer - DatabricksWe are seeking a highly skilled and experienced Data Engineering Lead with a strong background in the retail domain and exceptional programming abilities. As a Lead you will play a pivotal role in implementing and optimizing data architecture to support our reta...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala