Im Shikha Dixit Lead Recruiter at Siri Info Solutions. Ive been reviewing your background in SRE DevOps AWS Service Now CloudWatch MongoDB New Relic and your profile stands out as an excellent match for a Sr. SRE Engineer opening I have in Irving TX. This is a high-priority 5-day onsite role with one of our key clients. Given your experience Id love to discuss the details and see if this aligns with your career goals. Are you open to a quick 5-minute chat
Role: Sr. SRE Engineer (W2)
Duration: 6 months
Location: Irving TX (5-day onsite)
Job Description:
Overview:
Client is seeking a highly skilled Site Reliability Engineer (SRE) to join our Production Support team. This role is responsible for ensuring the reliability performance and stability of our production systems across AWS MongoDB and related application services. The ideal candidate has strong operational instincts deep troubleshooting skills and a passion for building resilient systems.
Responsibilities:
Production Support & Incident Management
Serve as a primary responder for production incidents ensuring rapid triage mitigation and resolution.
Lead root cause analysis (RCA) and drive long term corrective actions.
Maintain and improve incident response processes runbooks and escalation paths.
Collaborate with engineering QA and product teams to prevent recurrence of issues.
AWS Infrastructure Operations
Support and optimize AWS services such as EC2 ECS/EKS Lambda S3 CloudWatch IAM RDS and VPC networking.
Monitor system health performance and capacity across cloud environments.
Implement infrastructure best practices around reliability scalability and cost efficiency.
Assist with deployments environment configuration and CI/CD pipelines.
Database & Storage Support
Manage and troubleshoot MongoDB clusters including performance tuning replication backups and failover.
Diagnose query performance issues and collaborate with developers on schema optimization.
Ensure data integrity availability and recovery readiness.
Monitoring Observability & Alerting
Use New Relic CloudWatch and other observability tools to monitor application and infrastructure performance.
Build dashboards alerts and telemetry that provide actionable insights.
Continuously refine monitoring thresholds to reduce noise and improve signal quality
Must-Have:
Experience with on-call rotations and 24/7 production environments.
Work cross-functionally with the various teams in the organization and help establish SLOs and achieve those SLOs.
5 years of experience in SRE DevOps Production Support or similar operational roles.
Strong hands-on experience with AWS services and cloud-native architectures.
Proficiency with MongoDB administration and troubleshooting.
Experience with New Relic or similar APM/observability platforms.
Experience using additional tools like Postman Intune and Firebase Service Now Cloudwatch.
Strong understanding of Linux systems networking and distributed systems.
Solid scripting skills (Python Bash or similar).
5 years Monitoring and Alarming in all environments and familiar with tools like Mongo Charts New Relic Cloudwatch Service Now.
Proven experience managing high-severity incidents and driving RCA processes.
Familiarity with CI/CD tools (Jenkins GitHub Actions GitLab CI etc.).
Additional:
Experience with container orchestration (ECS EKS Kubernetes).
Knowledge of message queues (Kafka SQS RabbitMQ).
Exposure to microservices architectures.
Certifications such as AWS Solutions Architect AWS SysOps or MongoDB DBA.
Working experience with IoT devices and Microsoft Intune.
Hi Hope you are doing well! Im Shikha Dixit Lead Recruiter at Siri Info Solutions. Ive been reviewing your background in SRE DevOps AWS Service Now CloudWatch MongoDB New Relic and your profile stands out as an excellent match for a Sr. SRE Engineer opening I have in Irving TX. This is a high-pr...
Hi
Hope you are doing well!
Im Shikha Dixit Lead Recruiter at Siri Info Solutions. Ive been reviewing your background in SRE DevOps AWS Service Now CloudWatch MongoDB New Relic and your profile stands out as an excellent match for a Sr. SRE Engineer opening I have in Irving TX. This is a high-priority 5-day onsite role with one of our key clients. Given your experience Id love to discuss the details and see if this aligns with your career goals. Are you open to a quick 5-minute chat
Role: Sr. SRE Engineer (W2)
Duration: 6 months
Location: Irving TX (5-day onsite)
Job Description:
Overview:
Client is seeking a highly skilled Site Reliability Engineer (SRE) to join our Production Support team. This role is responsible for ensuring the reliability performance and stability of our production systems across AWS MongoDB and related application services. The ideal candidate has strong operational instincts deep troubleshooting skills and a passion for building resilient systems.
Responsibilities:
Production Support & Incident Management
Serve as a primary responder for production incidents ensuring rapid triage mitigation and resolution.
Lead root cause analysis (RCA) and drive long term corrective actions.
Maintain and improve incident response processes runbooks and escalation paths.
Collaborate with engineering QA and product teams to prevent recurrence of issues.
AWS Infrastructure Operations
Support and optimize AWS services such as EC2 ECS/EKS Lambda S3 CloudWatch IAM RDS and VPC networking.
Monitor system health performance and capacity across cloud environments.
Implement infrastructure best practices around reliability scalability and cost efficiency.
Assist with deployments environment configuration and CI/CD pipelines.
Database & Storage Support
Manage and troubleshoot MongoDB clusters including performance tuning replication backups and failover.
Diagnose query performance issues and collaborate with developers on schema optimization.
Ensure data integrity availability and recovery readiness.
Monitoring Observability & Alerting
Use New Relic CloudWatch and other observability tools to monitor application and infrastructure performance.
Build dashboards alerts and telemetry that provide actionable insights.
Continuously refine monitoring thresholds to reduce noise and improve signal quality
Must-Have:
Experience with on-call rotations and 24/7 production environments.
Work cross-functionally with the various teams in the organization and help establish SLOs and achieve those SLOs.
5 years of experience in SRE DevOps Production Support or similar operational roles.
Strong hands-on experience with AWS services and cloud-native architectures.
Proficiency with MongoDB administration and troubleshooting.
Experience with New Relic or similar APM/observability platforms.
Experience using additional tools like Postman Intune and Firebase Service Now Cloudwatch.
Strong understanding of Linux systems networking and distributed systems.
Solid scripting skills (Python Bash or similar).
5 years Monitoring and Alarming in all environments and familiar with tools like Mongo Charts New Relic Cloudwatch Service Now.
Proven experience managing high-severity incidents and driving RCA processes.
Familiarity with CI/CD tools (Jenkins GitHub Actions GitLab CI etc.).
Additional:
Experience with container orchestration (ECS EKS Kubernetes).
Knowledge of message queues (Kafka SQS RabbitMQ).
Exposure to microservices architectures.
Certifications such as AWS Solutions Architect AWS SysOps or MongoDB DBA.
Working experience with IoT devices and Microsoft Intune.