drjobs Staff Site Reliability Engineer - PRE

Staff Site Reliability Engineer - PRE

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Warsaw - Poland

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Hadoop/Big-Data: 

  • Sound knowledge on managing large scale Hadoop platforms including monitoring the platform debugging issues and tuning the performance of the cluster.

  • In-depth knowledge of the Hadoop ecosystem including Zookeeper HDFS Yarn HIVE SPARK Trino and Kafka.

  • Proven experience in debugging issues on both Hadoop platform and applications.

  • Familiarity with security tools such as Kerberos Ranger and active directory integrations.

  • Experience on Cloud technologies preferably AWS EMR.

  • Knowledge on Kubernetes AI MLOPS will be advantageous.

Collaboration and Teamwork:

  • Collaborate closely with L-3 teams to review new use cases and implement cluster hardening techniques ensuring the development of robust and reliable platforms.

  • Foster cross-team collaboration building and maintaining strong relationships with customer teams user communities architects and engineering teams.

  • Work jointly on key deliverables to ensure production scalability and stability.

Automation: Hands-on Experience with automations using Ansible Shell python or any programming languages. The ability to automate the manual tasks is key in this role.

Observability: knowledge on observability tools like Grafana opera Prometheus and Splunk.

Linux: understanding of Linux networking CPU memory and storage. 

Programming Languages: Knowledge of and ability to code or program in one of python Java or a widely used coding language.

Communication: Excellent interpersonal skills along with superior verbal and written communication abilities.

This position is not ideal for a Hadoop developer.

This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.

 


Qualifications :

Basic Qualifications:


- As a Staff Site Reliability Engineer you will play a key role in maintaining and supporting Visas Data Platform ensuring the reliability and performance of critical Big Data systems.  
- You will drive innovation for our partners and clients globally by working on open-source Big Data clusters optimizing their availability efficiency and scalability.  

Education & Experience:
  - Masters degree in Math Science Engineering Computer Science Information Systems or a related field; OR
  - Bachelors degree in Math Science Engineering Computer Science Information Systems or a related field AND a minimum of five years of relevant experience; OR
  - A minimum of five years of experience working with Hadoop systems.  

Preferred Qualifications:
- Experience in Big Data SRE and Engineering across open-source platforms such as Hadoop Kafka HBase and Spark with strong troubleshooting and debugging skills.  
- Proven ability to conduct effective root cause analysis of major production incidents document findings and implement high-availability solutions for critical services.  
- Expertise in capacity planning system expansions and timely upgrades to mitigate scaling challenges while automating repetitive tasks to reduce manual effort and prevent errors.  
- Ability to fine-tune alerting and set up observability tools to proactively identify and resolve performance issues collaborating with Level-3 teams on use case reviews and cluster hardening.  
- Strong documentation skills to create standard operating procedures and platform utilization guidelines ensuring consistency and efficiency in operations.  
- Proficiency in leveraging DevOps tools and industry best practices including incident problem and change management disciplines.  
- Commitment to ensuring Hadoop platform performance meets service-level agreements with experience in security remediation automation and self-healing implementations.  
- Experience in developing automation tools and reports to streamline processes using technologies such as Shell scripting Ansible Python or other programming languages. 


Additional Information :

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.


Remote Work :

No


Employment Type :

Full-time

Employment Type

Full-time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.