drjobs Site Reliability Engineer (SRE/ DevOps) - Engineering Productivity

Site Reliability Engineer (SRE/ DevOps) - Engineering Productivity

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bengaluru - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Who Youll Work With

Arista Networks is looking for a skilled professional for our Engineering Productivity (EngProd) team to help maintain and support our rapidly expanding infrastructure and internal user base. The ideal candidate is someone who can wear many hats is versatile and is enthusiastic about learning new technologies. As a part of the software engineering team you will work with other team members to design build and administer secure scalable and faulttolerant tools and infrastructure in a hybrid cloud environment.

Working in the EngProd group you will collaborate and work with other engineers to design build scale and operate the systems used by Aristas product development teams.  Thes systems are based on industrystandards including  Ansible Artifactory Gerrit Jenkins Kubernetes Grafana Spinnaker MySQL ElasticSearch Google Cloud Varnish Perforce Gerrit etc 3rd party storage appliances as well as internal systems developed from the groundup to automate CI/CD testing analysis and visualization.

What Youll Do

  • Build deploy safely and incrementally and operate critical production systems with focus on scalability reliability observability performance and security.
  • Monitor support and enhance developer experience across services.
  • Build automation to remove toil and efficiently operate production systems.
  • Proactively monitor respond to and enhance alerts and set up automated alert handling
  • Create and maintain the incident response runbooks.
  • Build and deploy new systems with scalability reliability and observability as primary requirements
  • Triage platform/infrastructural issues and help Arista software engineers in their triages. Engage with 3rd party vendor support.
  • Deploy new systems in a staged manner
  • Write postmortem documents and build solutions to avoid incidents from repeating.
  • Plan and communicate maintenance windows on production systems.
  • Work with Aristas product development teams to identify infrastructural issues that are causing bottlenecks and limitations in their workflows. Design and implement solutions to resolve them.
  • Survey and adopt best practices around infrastructure/platform to maintain secure scalable and faulttolerant systems.
  • Implement solutions to scale the systems
  • Implement faulttolerance and performance to improve availability of the systems
  • Study the design and sufficient implementation details of OSS systems for better triage and fix resolution.

 


    Qualifications :

    Essential to have all of the following skills

    • At least BSc Computer Science or Engineering 5 years experience MS Computer Science or Engineering 5 years experience or equivalent work experience.
    • Knowledge of one or more of Go Python shell scripting to be able to implement medium complexity automation workflows.
    • Knowledge of Linux (or UNIX) from administration and debugging perspective
    • Handson experience in operating software systems (infrastructure complex applications etc) at scale
    • Experience in server provisioning (esp from storage and networking perspective).
    • Strong problem solving and software troubleshooting skills
    • Experience with infrastructureascode

    Desirable to have one/more of the following skills

    • Experience managing databases mariadb postgres mongodb etc
    • Experience with docker and virtualization technologies kvm qemu katacontainers etc
    • Experience managing monitoring stack Prometheus Loki Tempo InfluxDB Grafana Thanos etc
    • Experience managing ElasticSearch clusters
    • Experience managing Artifactory docker registry etc
    • Experience managing CI/CD systems like ArgoCD Spinnaker etc
    • Experience managing version control systems like Perforce Gerrit etc
    • Experience with infrastructureascode frameworks like Ansible
    • Experience managing large Java applications
    • Experience in storage infrastructure management eg: NAS SAN Ceph etc


    Additional Information :

    Arista stands out as an engineeringcentric company. Our leadership including founders and engineering managers are all engineers who understand sound software engineering principles and the importance of doing things right.

    We hire globally into our diverse team. At Arista engineers have complete ownership of their projects. Our management structure is flat and streamlined and software engineering is led by those who understand it best. We prioritize the development and utilization of test automation tools.

    Our engineers have access to every part of the company providing opportunities to work across various domains. Arista is headquartered in Santa Clara California with development offices in Australia Canada India Ireland and the US. We consider all our R&D centers equal in stature.

    Join us to shape the future of networking and be part of a culture that values invention quality respect and fun.


    Remote Work :

    Yes


    Employment Type :

    Fulltime

    Employment Type

    Remote

    Company Industry

    Department / Functional Area

    Software Engineering

    About Company

    Report This Job
    Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.