drjobs Staff Site Reliability Engineer - PRE

Staff Site Reliability Engineer - PRE

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Warsaw - Poland

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Hadoop/BigData: 

  • Sound knowledge on managing large scale Hadoop platforms including monitoring the platform debugging issues and tuning the performance of the cluster.

  • Indepth knowledge of the Hadoop ecosystem including Zookeeper HDFS Yarn HIVE SPARK Trino and Kafka.

  • Proven experience in debugging issues on both Hadoop platform and applications.

  • Familiarity with security tools such as Kerberos Ranger and active directory integrations.

  • Experience on Cloud technologies preferably AWS EMR.

  • Knowledge on Kubernetes AI MLOPS will be advantageous.

Collaboration and Teamwork:

  • Collaborate closely with L3 teams to review new use cases and implement cluster hardening techniques ensuring the development of robust and reliable platforms.

  • Foster crossteam collaboration building and maintaining strong relationships with customer teams user communities architects and engineering teams.

  • Work jointly on key deliverables to ensure production scalability and stability.

Automation: Handson Experience with automations using Ansible Shell python or any programming languages. The ability to automate the manual tasks is key in this role.

Observability: knowledge on observability tools like Grafana opera Prometheus and Splunk.

Linux: understanding of Linux networking CPU memory and storage. 

Programming Languages: Knowledge of and ability to code or program in one of python Java or a widely used coding language.

Communication: Excellent interpersonal skills along with superior verbal and written communication abilities.

This position is not ideal for a Hadoop developer.

This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.

 


Qualifications :

Basic Qualifications
As a Staff Site Reliability Engineer you will be part of a team that maintains and
supports Visas Data Platform and provides support for key Big data Platforms. You will be responsible for driving innovation for our partners and clients
within Visa and globally. You will work on opensource Big Data clusters
ensuring their availability performance reliability and improving operational
efficiency.
Masters degree in Math Science Engineering or Computer Science
Information Systems or related field. OR
Bachelors degree in Math Science Engineering or Computer Science
Information Systems or related field AND minimum five (5) years of
experience in a directly related field. OR
Minimum five (5) plus years working on Hadoop systems.

Preferred Qualifications
The role involves performing Big Data SRE and Engineering activities on
multiple opensource platforms such as Hadoop Kafka HBase and Spark. The
candidate should possess strong troubleshooting and debugging skills.
Other responsibilities include effective root cause analysis of major production
incidents and the development of learning documentation. The person will
identify and implement highavailability solutions for services with a single
point of failure.
The role involves planning and performing capacity expansions and upgrades
in a timely manner to avoid any scaling issues and bugs. This includes
automating repetitive tasks to reduce manual effort and prevent human errors.
The successful candidate will tune alerting and set up observability to
proactively identify issues and performance problems. They will also work
closely with Level3 teams in reviewing new use cases and cluster hardening
techniques to build robust and reliable platforms.
The role involves creating standard operating procedure documents and
guidelines on effectively managing and utilizing the platforms. The person will
leverage DevOps tools disciplines (Incident problem and change
management) and standards in daytoday operations.
The individual will ensure that the Hadoop platform can effectively meet
performance and service level agreement requirements. They will also perform
security remediation automation and selfhealing as per the requirement.
The individual will concentrate on developing automations and reports to
minimize manual effort. This can be achieved through various automation
tools such as Shell scripting Ansible or Python scripting or by using any other
programming language.
 


Additional Information :

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.


Remote Work :

No


Employment Type :

Fulltime

Employment Type

Full-time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.