Perform end-to-end testing of large-scale data pipelines using PySpark BigQuery Dataproc and SQL
Develop and execute test cases test plans and automated test scripts for data validation
Validate migration of Hadoop/Spark/Hive workloads to GCP ensuring data integrity and accuracy
Write complex SQL/BigQuery queries for batch and real-time data validation
Design and implement test automation frameworks using modern testing practices (TDD BDD)
Conduct performance and load testing using tools such as JMeter or pytest
Collaborate with development teams to improve code quality and test coverage
Participate in Agile ceremonies requirement analysis and backlog grooming
Identify and mitigate risks related to performance scalability and data quality
Support deployment monitoring and testing across dev staging and production environments
Job Description: Perform end-to-end testing of large-scale data pipelines using PySpark BigQuery Dataproc and SQL Develop and execute test cases test plans and automated test scripts for data validation Validate migration of Hadoop/Spark/Hive workloads to GCP ensuring data integrity...
Job Description:
Perform end-to-end testing of large-scale data pipelines using PySpark BigQuery Dataproc and SQL
Develop and execute test cases test plans and automated test scripts for data validation
Validate migration of Hadoop/Spark/Hive workloads to GCP ensuring data integrity and accuracy
Write complex SQL/BigQuery queries for batch and real-time data validation
Design and implement test automation frameworks using modern testing practices (TDD BDD)
Conduct performance and load testing using tools such as JMeter or pytest
Collaborate with development teams to improve code quality and test coverage
Participate in Agile ceremonies requirement analysis and backlog grooming
Identify and mitigate risks related to performance scalability and data quality
Support deployment monitoring and testing across dev staging and production environments