Requirements:
- 6 years in Big Data Engineering with at least 3 years of hands-on Cloudera (CDH/CDP).
- Deep knowledge of HDFS Kudu Ozone and the Cloudera Manager API.
- Proficiency in Python or PySpark for automation and data processing.
- Excellent English communication for coordinating with KSA-based architects and stakeholders.
- Cloudera Certified Professional (CCP) or Associate is highly preferred.
Responsibilities:
- Design deploy and manage Cloudera Data Platform (CDP) Private Cloud Base environments.
- Implement enterprise-grade security using Apache Ranger (RBAC) Apache Atlas (Lineage) and Kerberos to meet Saudi national data security standards (NCA/NDMO).
- Optimize YARN resource allocation and perform deep-dive tuning for Impala Hive on Tez and Spark workloads.
- Manage high-volume data ingestion using Apache NiFi (CDF) and Kafka for real-time service monitoring.
- Automated cluster monitoring backup/recovery strategies and 24/7 high-availability maintenance.
Requirements: 6 years in Big Data Engineering with at least 3 years of hands-on Cloudera (CDH/CDP).Deep knowledge of HDFS Kudu Ozone and the Cloudera Manager API.Proficiency in Python or PySpark for automation and data processing.Excellent English communication for coordinating with KSA-based archit...
Requirements:
- 6 years in Big Data Engineering with at least 3 years of hands-on Cloudera (CDH/CDP).
- Deep knowledge of HDFS Kudu Ozone and the Cloudera Manager API.
- Proficiency in Python or PySpark for automation and data processing.
- Excellent English communication for coordinating with KSA-based architects and stakeholders.
- Cloudera Certified Professional (CCP) or Associate is highly preferred.
Responsibilities:
- Design deploy and manage Cloudera Data Platform (CDP) Private Cloud Base environments.
- Implement enterprise-grade security using Apache Ranger (RBAC) Apache Atlas (Lineage) and Kerberos to meet Saudi national data security standards (NCA/NDMO).
- Optimize YARN resource allocation and perform deep-dive tuning for Impala Hive on Tez and Spark workloads.
- Manage high-volume data ingestion using Apache NiFi (CDF) and Kafka for real-time service monitoring.
- Automated cluster monitoring backup/recovery strategies and 24/7 high-availability maintenance.
View more
View less