We are seeking a highly skilled and motivated Senior Data Engineer to play a key role in our significant data platform migration project. You will be responsible for designing developing and optimizing data pipelines and transformations as we transition from our current Airflow PySpark Athena and BigQuery environment to a modern stack built on Airflow dbt and Snowflake. This role requires a strong understanding of data warehousing principles excellent SQL and Python skills and a proven ability to deliver robust and scalable data solutions.
Your Responsibilities
Data Migration & Pipeline Development:
Design develop and implement efficient and reliable data pipelines to migrate data from PySpark/Athena/BigQuery to DBT/Snowflake.
Translate complex data requirements into actionable dbt models and transformations within Snowflake.
Build and maintain Airflow DAGs for orchestrating data ingestion transformation and loading processes.
Optimize existing data pipelines for performance scalability and cost efficiency in the new Snowflake environment.
Data Modeling & Transformation:
Develop and maintain robust data models in Snowflake using dbt adhering to best practices for data warehousing and analytics.
Write complex SQL queries for data extraction transformation and loading.
Ensure data quality accuracy and consistency throughout the migration and ongoing data operations.
Troubleshooting & Optimization:
Identify diagnose and resolve data-related issues performance bottlenecks and data discrepancies.
Proactively monitor data pipelines and systems to ensure smooth operation and data availability.
Implement performance tuning strategies within Snowflake and dbt to optimize query execution and resource utilization.
Collaboration & Documentation:
Collaborate closely with Lead Data Engineers Data Analysts and other stakeholders to understand data needs and deliver effective solutions.
Contribute to the development and maintenance of comprehensive technical documentation for data pipelines models and processes.
Participate in code reviews and contribute to the teams adherence to coding standards and best practices.
Requirements
Qualifications
Bachelors or Masters degree in Computer Science Engineering or a related quantitative field.
3 years of experience in data engineering with a focus on building and maintaining scalable data pipelines.
Solid experience with data migration projects and working with large datasets.
Strong hands-on experience with Snowflake including data loading querying and performance optimization.
Proficiency in dbt (data build tool) for data transformation and modeling.
Proven experience with Apache Airflow for scheduling and orchestrating data workflows.
Expert-level SQL skills including complex joins window functions and performance tuning.
Proficiency in Python for data manipulation scripting and automation for edge cases
Familiarity with PySpark AWS Athena and Google BigQuery (source systems).
Understanding of data warehousing concepts dimensional modeling and ELT principles.
Knowledge of building CI/CD pipelines for code deployment
Experience with version control systems (e.g. Github).
Excellent problem-solving analytical and communication skills.
Ability to work independently and as part of a collaborative team in an agile environment.
Must speak and write in English fluently; Effective communicator
Required Experience:
Senior IC
We are seeking a highly skilled and motivated Senior Data Engineer to play a key role in our significant data platform migration project. You will be responsible for designing developing and optimizing data pipelines and transformations as we transition from our current Airflow PySpark Athena and Bi...
We are seeking a highly skilled and motivated Senior Data Engineer to play a key role in our significant data platform migration project. You will be responsible for designing developing and optimizing data pipelines and transformations as we transition from our current Airflow PySpark Athena and BigQuery environment to a modern stack built on Airflow dbt and Snowflake. This role requires a strong understanding of data warehousing principles excellent SQL and Python skills and a proven ability to deliver robust and scalable data solutions.
Your Responsibilities
Data Migration & Pipeline Development:
Design develop and implement efficient and reliable data pipelines to migrate data from PySpark/Athena/BigQuery to DBT/Snowflake.
Translate complex data requirements into actionable dbt models and transformations within Snowflake.
Build and maintain Airflow DAGs for orchestrating data ingestion transformation and loading processes.
Optimize existing data pipelines for performance scalability and cost efficiency in the new Snowflake environment.
Data Modeling & Transformation:
Develop and maintain robust data models in Snowflake using dbt adhering to best practices for data warehousing and analytics.
Write complex SQL queries for data extraction transformation and loading.
Ensure data quality accuracy and consistency throughout the migration and ongoing data operations.
Troubleshooting & Optimization:
Identify diagnose and resolve data-related issues performance bottlenecks and data discrepancies.
Proactively monitor data pipelines and systems to ensure smooth operation and data availability.
Implement performance tuning strategies within Snowflake and dbt to optimize query execution and resource utilization.
Collaboration & Documentation:
Collaborate closely with Lead Data Engineers Data Analysts and other stakeholders to understand data needs and deliver effective solutions.
Contribute to the development and maintenance of comprehensive technical documentation for data pipelines models and processes.
Participate in code reviews and contribute to the teams adherence to coding standards and best practices.
Requirements
Qualifications
Bachelors or Masters degree in Computer Science Engineering or a related quantitative field.
3 years of experience in data engineering with a focus on building and maintaining scalable data pipelines.
Solid experience with data migration projects and working with large datasets.
Strong hands-on experience with Snowflake including data loading querying and performance optimization.
Proficiency in dbt (data build tool) for data transformation and modeling.
Proven experience with Apache Airflow for scheduling and orchestrating data workflows.
Expert-level SQL skills including complex joins window functions and performance tuning.
Proficiency in Python for data manipulation scripting and automation for edge cases
Familiarity with PySpark AWS Athena and Google BigQuery (source systems).
Understanding of data warehousing concepts dimensional modeling and ELT principles.
Knowledge of building CI/CD pipelines for code deployment
Experience with version control systems (e.g. Github).
Excellent problem-solving analytical and communication skills.
Ability to work independently and as part of a collaborative team in an agile environment.
Must speak and write in English fluently; Effective communicator
Required Experience:
Senior IC
View more
View less