- Create and maintain robust scalable data pipelines using GCP services like Dataflow Pub/Sub and Cloud Storage.
- Develop and optimize Extract Transform Load (or Extract Load Transform) processes to move and transform data efficiently.
- Utilize and optimize BigQuery for storing querying and analyzing large datasets.
- Automate and monitor data workflows using tools like Cloud Composer (based on Apache Airflow).
- Implement data governance security best practices and perform data quality checks to maintain accuracy and integrity.
- Work with data scientists analysts and other engineers to understand data needs and deliver solutions.
- Monitor performance of data systems troubleshoot issues and optimize pipelines for scalability and efficiency.
- Write and maintain scripts in languages like Python and SQL for data processing automation and analysis
Create and maintain robust scalable data pipelines using GCP services like Dataflow Pub/Sub and Cloud Storage. Develop and optimize Extract Transform Load (or Extract Load Transform) processes to move and transform data efficiently. Utilize and optimize BigQuery for storing querying and analyzing l...
- Create and maintain robust scalable data pipelines using GCP services like Dataflow Pub/Sub and Cloud Storage.
- Develop and optimize Extract Transform Load (or Extract Load Transform) processes to move and transform data efficiently.
- Utilize and optimize BigQuery for storing querying and analyzing large datasets.
- Automate and monitor data workflows using tools like Cloud Composer (based on Apache Airflow).
- Implement data governance security best practices and perform data quality checks to maintain accuracy and integrity.
- Work with data scientists analysts and other engineers to understand data needs and deliver solutions.
- Monitor performance of data systems troubleshoot issues and optimize pipelines for scalability and efficiency.
- Write and maintain scripts in languages like Python and SQL for data processing automation and analysis
View more
View less