drjobs SITE reliability engineer

SITE reliability engineer

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Jobs by Experience drjobs

4-5years

Job Location drjobs

Delhi - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Description

Incorporate various software engineering aspects to develop and implement services that improve IT and support teams. Services can range from production code changes to alerting and monitoring adjustments.

  • Detect issues.
  • Automatically handle failures.
  • Prepare disaster recovery plans.
  • Keep the system up and reliable.
  • Mitigate broken systems and prevent them from causing future disruptions.

Key Responsibilities:

  • System Reliability: Ensure the availability performance and scalability of microservices
    running in production.
  • Monitoring & Alerting: Implement and manage monitoring tools (e.g. Prometheus Grafana orDatadog) to detect and address issues proactively.
  • Automation: Develop automation tools for deployment scaling and incident management to reduce manual intervention and improve efficiency.
  • Incident Response: Investigate troubleshoot and resolve production incidents promptly and lead postmortems to prevent recurrence.
  • Infrastructure Management: Manage containerized workloads using Kubernetes Docker or other orchestration platforms.
  • Performance Optimization: Identify bottlenecks in the system and implement solutions to enhance performance and reduce latency.
  • CI/CD Pipelines: Design and maintain continuous integration and deployment pipelines for seamless code delivery.
  • Capacity Planning: Conduct capacity planning to ensure systems can handle growth and scale efficiently.
  • Collaboration: Work closely with development teams to design systems that are resilient and aligned with SRE best practices.
  • Security Compliance: Implement and maintain security measures across infrastructure to protect against vulnerabilities.

Required Skills:

Experience:

  • 3 years of experience as an SRE DevOps Engineer or related role.
  • Handson experience with microservices architecture.
  • Technical Expertise: Strong programming skills in languages such as Shell Script Python NodeJs etc.
  • Proficiency in containerization technologies like Docker and orchestration tools like Kubernetes.
  • Familiarity with eventdriven architectures and message brokers (e.g. Kafka RabbitMQ).
  • Experience with cloud platforms (AWS Azure) and Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
  • Deep understanding of networking load balancing and distributed systems.
  • Proficiency with monitoring and logging tools (e.g. ELK stack or New Relic).
  • Experience with database systems including SQL and NoSQL solutions.
  • Knowledge of compliance standards such as ISO 27001 SOC 2 or GDPR.
  • Soft Skills: Strong problemsolving skills and the ability to work under pressure.
  • Excellent communication and collaboration abilities to work across teams.
  • A proactive mindset focused on continuous improvement.


SRE, DevOps Engineer, Azure

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.