Role: ETL Tester (with Python Expertise)
Location: Irvine CA- Onsite
Experience: 8
Core Responsibilities
- Thoroughly test ETL processes-extract transform and load-to ensure data accuracy integrity and alignment with business requirements
- Validate transformations data reconciliation regression checks and detect missing or duplicate records
- Develop and maintain Python scripts to automate data testing workflows using libraries like pandas NumPy and frameworks such as pytest or unittest
- Write and execute complex SQL queries for comparing source and target data spot-checking data loads and slicing data for analysis
- Collaborate with ETL developers data engineers analysts and stakeholders to understand data models mappings and translate them into effective test cases
- Create test plans test cases and maintain documentation-including test strategies test scripts result logs and defect tracking
- Automate test suites and integrate into CI/CD pipelines using tools like Jenkins or Git-based workflows
- Desirable Technical Stack
- Python (scripting) plus data-handling libraries (e.g. pandas NumPy PySpark)
- SQL-especially for data validation across various database platforms (Oracle MS SQL Snowflake Hive etc.)
- Familiarity with ETL tools like Informatica Talend SSIS or cloud platforms like AWS Glue S3 Snowflake or Hadoop ecosystems
- Experience in scripting frameworks in big data environments using PySpark
- Cloud experience (AWS Azure) and orchestration tools like Airflow; knowledge of data lakes data warehousing and modeling concepts such as star/snowflake schemas
- Experience with CI/CD version control (Git) and test management tools (JIRA Jenkins Quality Center etc.)