About the Company:
Our client is a dynamic and rapidly growing company recognized as one of Indias leading kids fashion brands. We are currently seeking a passionate Data Engineer to join their team. Operating in a fastpaced startup environment they collaborate across various teams including data analytics marketing data science and individual product teams. If youre enthusiastic about data engineering thrive in building scalable systems and possess a strong background in ETL processes we invite you to be part of their innovative and collaborative team.
Responsibilities:
- Thrive in a fastpaced startup environment contributing to a culture of innovation and agility.
- Manage all aspects of data extraction transfer and load activities.
- Develop robust data pipelines to ensure data availability across platforms.
- Execute ETL processes including data ingestion cleaning and curation into data warehouses databases or data platforms.
- Collaborate on various aspects of the AI/ML ecosystem including data modeling and ML pipelines.
- Work closely with DevOps and senior architects to design scalable system and model architectures for realtime and batch services.
Requirements:
- 36 years of experience as a data engineer or data scientist with a focus on data engineering and ETL jobs.
- Proficiency in data warehousing data modeling and/or data analysis concepts.
- Strong experience in building and using pipelines performing ETL with industrystandard best practices on Redshift (minimum 2 years).
- Ability to troubleshoot and resolve performance issues related to data ingestion processing and query execution on Redshift.
- Familiarity with orchestration tools such as Airflow.
- Excellent coding skills in Python and SQL.
- Handson experience with distributed systems like Spark.
- Experience with AWS Data and ML Technologies (AWS Glue MWAA Data Pipeline EMR Athena Redshift Lambda etc.).
- Solid understanding and practical application of various data extraction techniques like CDC or Time/batch based and related tools (Debezium AWS DMS Kafka Connect etc.) for near realtime and batch data extraction.
Note:
Preference will be given to candidates with experience in productbased companies and ecommerce companies.
aws,amazon redshift,etl,spark,python,sql,problem solving,system design,kafka,data engineering,data modeling,athena,data warehousing,data analysis,aws lambda,glue