Hadoop Data Infrastructure Engineer | Cloud Migration, Cluster Support, Automation, Performance Optimization, Security & High Availability
Job Summary
Job Summary
Synechron is seeking a highly skilled DevOps Data Hadoop Engineer to lead the design implementation and management of enterprise-grade big data infrastructure. This role involves supporting high-availability Hadoop clusters optimizing performance automating deployment workflows and integrating new technological solutions within large-scale data environments. You will work closely with platform security and data science teams to ensure scalable reliable and secure systems that support advanced analytics and data-driven initiatives.
Software Requirements
Required: Cloudera platform (CDH/HDp) or Hadoop 2.x/3.x Terraform Ansible Git Jenkins Shell and Python scripting monitoring tools (Splunk CloudWatch New Relic ELK Stack) Linux OS network and security tools (firewalls VPNs encryption APIs)
Preferred: Spark Hive Presto Kafka HDFS Redshift AWS EMR Azure HDInsight Kubernetes GitHub Actions Prometheus Grafana
Experience level: 5 years supporting large-scale Hadoop clusters migration projects and automation in enterprise environments
Overall Responsibilities
Lead the deployment support and optimization of Hadoop clusters supporting data analytics platforms
Automate provisioning patching and configuration management of Hadoop data environments using Terraform Ansible and scripting
Support high-availability architectures monitor system health and perform capacity planning and tuning
Collaborate with data and analytics teams to enable data ingestion processing and storage workflows
Conduct root cause analysis of system incidents optimize performance and implement preventive measures
Drive automation initiatives to reduce manual operations and improve system resilience
Maintain operational documentation runbooks and compliance records for data infrastructure
Support migration activities and cloud integration (AWS Azure) supporting hybrid data environments
Ensure compliance with security data governance and enterprise standards
Technical Skills (By Category)
Programming Languages:
Essential: Shell scripting Python SQL (query optimization data validation)
Preferred: Java Scala Spark APIs for data processing tasks
Databases/Data Management:
Enterprise Hadoop-related data storage (HDFS HBase) Redshift cloud data lakes (ADLS S3)
Cloud Technologies:
Basic knowledge of AWS Azure or GCP for cloud migration automation and managed data services (preferred)
Frameworks and Libraries:
Spark Hive Presto Kafka Kafka Connect Presto Apache Ranger Knox security gateways
Development Tools & Methodologies:
Terraform Ansible Jenkins Git CI/CD pipelines Agile/Scrum DevSecOps principles
Security & Compliance:
Encryption Kerberos LDAP integration IAM policies data masking audit logging
Experience Requirements
5 years of experience supporting enterprise Hadoop clusters data lakes or big data ecosystems
Proven success in automating deployment patching and scaling of Hadoop environments
Demonstrable experience supporting high-availability clusters and performance tuning at scale
Previous involvement in cloud data migration or hybrid cloud integration projects preferred
Industry experience in banking finance telecom or healthcare sectors supporting data analytics is advantageous; extensive enterprise support experience is necessary
Day-to-Day Activities
Manage support and optimize Hadoop clusters supporting enterprise analytics workflows
Develop and maintain automation scripts and IaC to support provisioning and configuration management
Monitor system health optimize performance and troubleshoot outages or security incidents
Lead capacity planning upgrade and patch management activities supporting high availability
Support migration of data and operational workflows into cloud or hybrid environments
Collaborate with data science security and platform teams to implement best practices
Maintain documentation incident reports and operational runbooks
Conduct root cause analysis performance tuning and proactive system enhancements
Qualifications
Bachelors or Masters degree in Computer Science Data Science or a related field
5 years of experience supporting large-scale enterprise Hadoop clusters or big data platforms
Certifications such as Cloudera Certified Administrator (CCA) or AWS/Azure Data Engineer are a plus
Strong scripting and automation skills for deployment and operational management
Proven experience supporting high-availability large data environments
Excellent troubleshooting communication and documentation skills
Professional Competencies
Critical thinking and analytical problem-solving in complex data environments
Leadership and mentoring skills to guide support teams and foster best practices
Strong stakeholder management for coordinating cross-team activities
Adaptability to evolving tech environments and cloud services support
Ownership mentality for system reliability security and continuous improvement
Time management and prioritization skills in a fast-paced high-impact setting
SYNECHRONS DIVERSITY & INCLUSION STATEMENT
Diversity & Inclusion are fundamental to our culture and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity Equity and Inclusion (DEI) initiative Same Difference is committed to fostering an inclusive culture promoting equality diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger successful businesses as a global company. We encourage applicants from across diverse backgrounds race ethnicities religion age marital status gender sexual orientations or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements mentoring internal mobility learning and development programs and more.
All employment decisions at Synechron are based on business needs job requirements and individual qualifications without regard to the applicants gender gender identity sexual orientation race ethnicity disabled or veteran status or any other characteristic protected by law.
Required Experience:
IC
About Company
Chez Synechron, nous croyons en la puissance du numérique pour transformer les entreprises en mieux. Notre cabinet de conseil mondial combine la créativité et la technologie innovante pour offrir des solutions numériques de premier plan. Les technologies progressistes et les stratégie ... View more