Strong hands on experience with Ab Initio GDE including core components such as Scan Rollup Normalize Reformat Join Filter by Expression and other standard transformation components. Proven experience designing and delivering high volume parallel ETL solutions including effective use of partitioning and performance optimisation techniques. Hands on experience working with big data platforms including HDFS and Hive integration within Ab Initio. Experience designing and implementing Ab Initio Plans including parallel and concurrent executions. Proficiency in PDL (Parameterised Data Language) to develop reusable parameter driven logic. Experience with code promotion and environment migration across DEV / UAT / PROD. Good understanding of data dependencies and lineage within complex ETL graphs and plans.
Job Responsibilities
Supporting Skills Strong Unix/Linux scripting experience for automation orchestration and operational support. Solid SQL skills for data analysis validation reconciliation and performance tuning. Experience working in batch oriented ETL environments including scheduling and monitoring (e.g. TWS or equivalent). Working knowledge of AWS based data platforms including: Familiarity with cloud data storage and processing concepts (e.g. S3 based data lakes Hadoop on AWS). Experience supporting or running ETL workloads in AWS hosted environments. Awareness of cloud security access controls and environment segregation in regulated setups.
Nice to Have Experience supporting data migration or modernisation programmes (e.g. on prem to Hadoop / cloud). Exposure to financial services or banking data environments. Experience with production support defect analysis and performance troubleshooting in distributed data platforms. Experience working in hybrid estates combining on prem and cloud hosted data platforms.
Skill Category
Data Engineering
Keyskills - Must Have
ETL
Big Data
Hive
Keyskills - Nice to Have
Unix
Linux
Additional Skills
Ab Initio GDE Parameterised Data Language
Job Description Strong hands on experience with Ab Initio GDE including core components such as Scan Rollup Normalize Reformat Join Filter by Expression and other standard transformation components. Proven experience designing and delivering high volume parallel ETL solutions including effect...
Job Description
Strong hands on experience with Ab Initio GDE including core components such as Scan Rollup Normalize Reformat Join Filter by Expression and other standard transformation components. Proven experience designing and delivering high volume parallel ETL solutions including effective use of partitioning and performance optimisation techniques. Hands on experience working with big data platforms including HDFS and Hive integration within Ab Initio. Experience designing and implementing Ab Initio Plans including parallel and concurrent executions. Proficiency in PDL (Parameterised Data Language) to develop reusable parameter driven logic. Experience with code promotion and environment migration across DEV / UAT / PROD. Good understanding of data dependencies and lineage within complex ETL graphs and plans.
Job Responsibilities
Supporting Skills Strong Unix/Linux scripting experience for automation orchestration and operational support. Solid SQL skills for data analysis validation reconciliation and performance tuning. Experience working in batch oriented ETL environments including scheduling and monitoring (e.g. TWS or equivalent). Working knowledge of AWS based data platforms including: Familiarity with cloud data storage and processing concepts (e.g. S3 based data lakes Hadoop on AWS). Experience supporting or running ETL workloads in AWS hosted environments. Awareness of cloud security access controls and environment segregation in regulated setups.
Nice to Have Experience supporting data migration or modernisation programmes (e.g. on prem to Hadoop / cloud). Exposure to financial services or banking data environments. Experience with production support defect analysis and performance troubleshooting in distributed data platforms. Experience working in hybrid estates combining on prem and cloud hosted data platforms.