PositionOverview
JobTitle:DataEngineer(6-MonthContract)
Department:Services
Location:Singapore
ReportingTo:Contract
Duration:6months
Tookitaki is seeking a Data Engineer (Contract) with strong expertise in Apache Spark and Cloudera (CDP) to support high-priority data initiatives for our AI-driven financial crime prevention platformsFinCense and the AFC Ecosystem. This role will contribute to building and maintaining robust data pipelines that ensure accurate scalable and production-grade data processing across real-time and batch workflows.
PositionPurpose
This role is designed to support data engineering efforts during a critical delivery phase. The engineer will work closely with platform product and services teams to enable high quality data ingestion transformation and availability across Tookitakis compliance modules. The work done in this role directly contributes to risk scoring transaction monitoring and fraud detection systems for global banks and fintech clients.
KeyResponsibilities
-BasedDataDevelopment
- DesignandoptimizebatchandstreamingpipelinesusingApacheSpark.
- DebugperformanceandmemoryissuesinSpark-basedETLprocesses.
(CDP)Handling
- LeverageHDFSHiveImpala/TrinoandHBasewithinClouderatosupportdataworkflows.
- CollaboratewithinfrateamstoensureCDPclusterreliabilityandschemaalignment.
&Monitoring
- BuildingestionpipelinesusingKafkaHiveSparkforlarge-scalefinancialdatasets.
- SupportAirflow-basedorchestrationandensureproductionSLAsaremet.
&Debugging
- WriteandoptimizeSQLqueriestovalidatedataaccuracyandingestionsuccess.
- Assistintracingpipelineissuesandexecutingbackfillsifnecessary.
-FunctionalCollaboration
- CoordinatewithdatascientistsDevOpsandserviceteamstosupportplatformreleases.
- Deliveronstrictprojecttimelinestiedtoactiveclientdeployments.
QualificationsandSkills
Education
- Bachelors/MastersinComputerScienceEngineeringorrelateddiscipline.
Experience
- 58yearsasaDataEngineerwithatleast2yearsinSpark-heavyenvironments.
- PriorexperienceworkingwithClouderaDataPlatform(CDP)inproduction.
TechnicalExpertise
- ApacheSpark(CoreSQLTuning)
- ClouderaCDP:HiveHDFSHBaseImpala/Trino
- KafkaAirflowSQL
- PythonandBashscripting
- FamiliaritywithLinux-basedenvironments
- ExposuretoAWSisaplus
SoftSkills
- Strongproblem-solvingmindset
- Abilitytothriveincontractualdelivery-drivensettings
- Clearcommunicationanddocumentationhabits
- Focusonexecutionqualityandspeed
KeyCompetencies
- DataPipelineOwnership
- BigDataArchitecture
- ExecutionAgilityinProjectTimelines
- CollaborativeImplementationMindset
- OperationalReadinessSuccessMetrics
- On-timedeliveryofassignedpipelinecomponents
- StabilityandperformanceofSparkworkflowsinUATandproduction
- Accuracyofdatavalidationandtransformationlogic
- Cross-teamsatisfactionwithdeliverablesinrolloutsprints
PositionOverviewJobTitle:DataEngineer(6-MonthContract)Department:ServicesLocation:SingaporeReportingTo:ContractDuration:6monthsTookitaki is seeking a Data Engineer (Contract) with strong expertise in Apache Spark and Cloudera (CDP) to support high-priority data initiatives for our AI-driven financia...
PositionOverview
JobTitle:DataEngineer(6-MonthContract)
Department:Services
Location:Singapore
ReportingTo:Contract
Duration:6months
Tookitaki is seeking a Data Engineer (Contract) with strong expertise in Apache Spark and Cloudera (CDP) to support high-priority data initiatives for our AI-driven financial crime prevention platformsFinCense and the AFC Ecosystem. This role will contribute to building and maintaining robust data pipelines that ensure accurate scalable and production-grade data processing across real-time and batch workflows.
PositionPurpose
This role is designed to support data engineering efforts during a critical delivery phase. The engineer will work closely with platform product and services teams to enable high quality data ingestion transformation and availability across Tookitakis compliance modules. The work done in this role directly contributes to risk scoring transaction monitoring and fraud detection systems for global banks and fintech clients.
KeyResponsibilities
-BasedDataDevelopment
- DesignandoptimizebatchandstreamingpipelinesusingApacheSpark.
- DebugperformanceandmemoryissuesinSpark-basedETLprocesses.
(CDP)Handling
- LeverageHDFSHiveImpala/TrinoandHBasewithinClouderatosupportdataworkflows.
- CollaboratewithinfrateamstoensureCDPclusterreliabilityandschemaalignment.
&Monitoring
- BuildingestionpipelinesusingKafkaHiveSparkforlarge-scalefinancialdatasets.
- SupportAirflow-basedorchestrationandensureproductionSLAsaremet.
&Debugging
- WriteandoptimizeSQLqueriestovalidatedataaccuracyandingestionsuccess.
- Assistintracingpipelineissuesandexecutingbackfillsifnecessary.
-FunctionalCollaboration
- CoordinatewithdatascientistsDevOpsandserviceteamstosupportplatformreleases.
- Deliveronstrictprojecttimelinestiedtoactiveclientdeployments.
QualificationsandSkills
Education
- Bachelors/MastersinComputerScienceEngineeringorrelateddiscipline.
Experience
- 58yearsasaDataEngineerwithatleast2yearsinSpark-heavyenvironments.
- PriorexperienceworkingwithClouderaDataPlatform(CDP)inproduction.
TechnicalExpertise
- ApacheSpark(CoreSQLTuning)
- ClouderaCDP:HiveHDFSHBaseImpala/Trino
- KafkaAirflowSQL
- PythonandBashscripting
- FamiliaritywithLinux-basedenvironments
- ExposuretoAWSisaplus
SoftSkills
- Strongproblem-solvingmindset
- Abilitytothriveincontractualdelivery-drivensettings
- Clearcommunicationanddocumentationhabits
- Focusonexecutionqualityandspeed
KeyCompetencies
- DataPipelineOwnership
- BigDataArchitecture
- ExecutionAgilityinProjectTimelines
- CollaborativeImplementationMindset
- OperationalReadinessSuccessMetrics
- On-timedeliveryofassignedpipelinecomponents
- StabilityandperformanceofSparkworkflowsinUATandproduction
- Accuracyofdatavalidationandtransformationlogic
- Cross-teamsatisfactionwithdeliverablesinrolloutsprints
View more
View less