Roll: Site Reliability Engineer
Exp. : 3
Position Type: Contract
Location : Mumbai Maharashtra India
Mandatory Skills: IT Operations Management
JOB DESCRIPTION
Role Purpose
Required Skills:
experience in system administration application development infrastructure development or related areas
experience with programming in languages like JavaScript Python PHP Go Java or Ruby
in reading understanding and writing code in the same
Mastery of infrastructure automation technologies (like Terraform Code Deploy Puppet Ansible Chef)
expertise in container/containerfleetorchestration technologies (like Kubernetes OpenShift AKS EKS Docker Vagrant etcd zookeeper)
Cloud and container native Linux administration /build/ management skills
Key Responsibilities:
Handson design analysis development and troubleshooting of highlydistributed largescale production systems and eventdriven cloudbased services
Primarily Linux Administration managing a fleet of Linux and Windows VMs as part of the application solutions
Involved in Pull Requests for site reliability goals
Advocate IaC (Infrastructure as Code) and Cac (Configuration as Code) practices within Honeywell HCE
Ownership of reliability up time system security cost operations capacity and performanceanalysis
Monitor and report on service level objectives for a given applications services. Work with the business Technology teams and product owners to establish key service level indicators.
Ensuring the repeatability traceability and transparency of our infrastructure automation
Support oncall rotations for operational duties that have not been addressed with automation
Support healthy software development practices including complying with the chosen software development methodology (Agile or alternatives)building standards for code reviews work packaging etc.
Create and maintain monitoring technologies and processes that improve the visibility to our applications performance and business metrics and keep operational workload incheck.
Partnering with security engineers and developing plans and automation to aggressively and safely respond to new risks and vulnerabilities.
Develop communicate collaborate and monitor standard processes to promote the longterm health and sustainability of operational development tasks.
About company
Give a brief summary of what your company does
Roles and Responsibilities
Outline the activities a person will perform on a regular
Desired Candidate Profile
Specify required technical expertise previous job experience or certification
Perks and Benefits
Mention salary details like reimbursement breakup of salary facilities available
system administration,java,python,php,ruby,kubernetes,docker