drjobs Site Reliability Engineer

Site Reliability Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Atlanta, GA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Position: Site Reliability Engineer (SRE) with Telecom Domain
Location: Atlanta GA (Preferred) or remote

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our telecom technology team. The SRE will be responsible for ensuring the reliability availability scalability and performance of mission-critical telecom platforms and services. The role blends software engineering systems engineering and operations with a focus on automation monitoring and continuous improvement.
Key Responsibilities
Reliability & Uptime: Ensure high availability and performance of telecom network platforms OSS/BSS applications and customer-facing services.
Automation: Develop automation frameworks for deployment monitoring incident response and capacity management.
Monitoring & Alerting: Design implement and maintain end-to-end observability solutions (logs metrics traces alerts).
Incident Management: Lead incident response root cause analysis and postmortems for outages and performance degradation.
Performance Optimization: Identify bottlenecks in applications networks and infrastructure and optimize them for efficiency.
Capacity Planning: Forecast resource utilization ensure proactive scaling and support telecom-grade SLAs.
Collaboration: Work closely with software development DevOps and telecom network engineering teams to improve service delivery.
Continuous Improvement: Implement SRE best practices (error budgets SLIs SLOs) aligned with telecom standards.
Security & Compliance: Ensure systems are hardened compliant with telecom regulatory frameworks (e.g. TRAI GDPR FCC) and aligned with data privacy/security best practices.
Required Skills & Experience
Education: Bachelors or Masters degree in Computer Science Telecommunications Information Technology or related field.
Experience: 3 7 years in SRE/DevOps/Systems Engineering roles (telecom domain experience preferred).

Technical Skills:
Strong knowledge of Linux/Unix system administration and network protocols (TCP/IP DNS VoIP SIP SS7 Diameter 5G Core).
Hands-on experience with cloud platforms (AWS Azure GCP or private telco clouds like OpenStack/VMware).
Proficiency in automation & configuration tools (Ansible Terraform Chef Puppet).
CI/CD tools and pipelines (Jenkins GitLab CI/CD ArgoCD).
Monitoring/observability tools (Prometheus Grafana ELK/EFK Splunk OpenTelemetry Nagios Zabbix).
Programming/scripting in Python Go Shell or Java.
Telecom OSS/BSS systems knowledge API integrations and microservices architecture.
Kubernetes and containerization (Docker Helm Service Mesh Istio/Linkerd).
Soft Skills:
Strong problem-solving and troubleshooting skills.
Ability to handle high-pressure incident scenarios.
Effective communicator with cross-functional teams.

Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race color religion sex pregnancy sexual orientation gender identity national origin age protected veteran status or disability status.

Employment Type

Full-time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.