SRE

TalentOla

Not Interested
Bookmark
Report This Job

profile Job Location:

Bengaluru - India

profile Monthly Salary: Not Disclosed
Posted on: 5 hours ago
Vacancies: 1 Vacancy

Job Summary

Job Overview
As an SRE Lead you will be responsible for leading a team of Site Reliability Engineers to ensure the highest level of system and infrastructure availability. You will be tasked with developing and implementing strategies for service scalability reliability and efficiency. Your role will involve troubleshooting and resolving system issues managing incident responses and working closely with cross-functional teams to improve the overall system performance. Your leadership and technical expertise will be crucial in maintaining the stability and resilience of our systems thereby ensuring a seamless experience for our users.
Responsibilities
  • Lead the SRE team in designing building and maintaining the companys large-scale complex systems.
  • Collaborate with the development team to ensure system reliability efficiency and performance.
  • Develop and implement automation strategies to improve the scalability and reliability of systems.
  • Monitor system performance troubleshoot issues and conduct root cause analysis to prevent recurrence.
Required Skills
  • Proficiency in programming languages such as Python Java or Go.
  • Strong understanding of cloud computing platforms like AWS Google Cloud or Azure.
  • Expertise in system design system management and storage systems.
  • The candidate must have a Bachelors degree in Computer Science Information Technology or a related field. A Masters degree or relevant certifications would be advantageous.
Preferred Skills
  • Familiarity with containerization technologies like Docker and Kubernetes.
  • Experience with CI/CD pipelines and tools such as Jenkins GitLab or CircleCI.
  • Knowledge of monitoring tools like Prometheus Grafana or New Relic.
  • Understanding of database technologies like SQL NoSQL or MongoDB.
  • Experience with configuration management tools like Ansible Chef or Puppet.
  • Knowledge of network protocols IP networking VPNs DNS load balancing and firewalling.
  • Familiarity with Agile methodologies and Scrum framework.
  • Experience with disaster recovery planning and execution.
  • Understanding of ITIL processes.
  • Excellent problem-solving and analytical skills.
Job Overview As an SRE Lead you will be responsible for leading a team of Site Reliability Engineers to ensure the highest level of system and infrastructure availability. You will be tasked with developing and implementing strategies for service scalability reliability and efficiency. Your role...
View more view more