Site Reliability Engineer (FLEEM)


Job Location:

Pune - India

Monthly Salary: Not Disclosed
Posted on: 10 days ago
Vacancies: 1 Vacancy

Job Summary

Summary:

  • Implements and operates monitoring logging alerting and dashboards.
  • Works closely with developers on build & run topics.
  • Monitors and coordinates security-related topics (vulnerabilities findings) and stability-related topics (query patterns indexing).
  • First responder for production incidents.

Mandatory Skills (in order of importance):

  1. Monitoring & alerting (CloudWatch logs metrics dashboards alarms)
  2. Distributed tracing (AWS X-Ray Lambda Insights)
  3. Incident management (root-cause analysis runbook authoring)
  4. Database performance analysis (MongoDB PostgreSQL)
  5. Security operations ()
  6. AWS services (Lambda S3 SQS IAM VPC)
  7. Bash / shell scripting & automation

Advantageous Skills:

  • GitHub Actions AWS CodePipeline (CI/CD understanding)
  • Docker ECS Fargate (container health troubleshooting)
  • AWS Cost Explorer resource tagging
  • Release coordination (hotfix processes rollback procedures)
  • BMW (Integrate) platform operational knowledge
  • TypeScript (reading application code for


Required Experience:

Manager

Summary:Implements and operates monitoring logging alerting and dashboards.Works closely with developers on build & run topics.Monitors and coordinates security-related topics (vulnerabilities findings) and stability-related topics (query patterns indexing).First responder for production incidents.M...