Site Reliability Engineer Data Center (Level 3)
Atlanta, GA - USA
Job Summary
Plano Texas
Alpharetta Georgia
Mississauga ON-Canada
Required Travel: No
Open to Relocation: No
Who are we
Amdocs helps the worlds leading communications and media companies deliver exceptional customer experiences through reliable efficient and secure operations at scale. We provide software products and services that embed intelligence into how work runs across business IT and network domains delivering measurable outcomes in customer experience network performance cloud modernization and revenue growth. With our talented people and more than forty years of experience running mission-critical systems around the globe Amdocs runs billions of transactions daily. Our technology is relied on every day connecting people worldwide and advancing a more inclusive connected world. Together we help those who shape the future to make it amazing. Amdocs is listed on the NASDAQ Global Select Market (NASDAQ: DOX) and reported revenue of $4.53 billion in fiscal 2025. For more information visit
At Amdocs our mission is to empower our employees to Live Amazing Do Amazing every day. We believe in creating a workplace where you not only excel professionally but also thrive personally. Through our culture of making a real impact fostering growth embracing flexibility and building connections we enable them to live meaningful lives while making a difference in the world.
In one sentence
We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering team at Level 3. This role requires a technically strong and operationally mature engineer who will help design scale and maintain the reliability of our physical and virtual data center infrastructure. As a Level 3 SRE you will be a technical leader responsible for ensuring system uptime optimizing capacity and performance and contributing to long-term infrastructure resiliency.
What will your job look like
Design implement and maintain PostgreSQL databases including schema design indexing strategies query optimization logical/physical replication hot standby failover and load balancing.
Develop and execute backup and recovery strategies including pgdump pgbasebackup WAL archiving point-in-time recovery (PITR) and disaster recovery planning.
Monitor and optimize database performance resource utilization and storage growth using pgstatstatements EXPLAIN ANALYZE pgtop and Prometheus/Grafana dashboards; proactively troubleshoot performance bottlenecks.
Ensure database security through role-based access control (RBAC) audit logging with pgaudit and compliance with regulatory standards.
Implement high availability (HA) and disaster recovery (DR) solutions using Patroni streaming replication synchronous/asynchronous replication and failover orchestration.
Plan and execute database version upgrades and apply security or performance patches with minimal downtime ensuring data integrity and compatibility checks.
Collaborate with application teams BI developers and ETL engineers to support data pipelines optimizing queries and workflow performance.
Implement monitoring and alerting solutions using Prometheus Grafana Zabbix or Nagios to track database health query latency and resource usage.
Manage database user accounts roles and privileges to enforce security policies and regulatory compliance including sudo/OS-level permissions for critical operations.
Conduct capacity planning workload forecasting and index/partition tuning to handle anticipated growth and high-concurrency workloads.
Automate database maintenance tasks using Python Bash or Ansible scripts including schema migrations routine checks and patch deployment.
Document procedures configurations operational runbooks and PostgreSQL best practices for team knowledge sharing.
Mentor and guide team members on PostgreSQL internals replication setups and performance tuning techniques.
Evaluate and recommend new database tools extensions (like TimescaleDB pgstatstatements) and best practices to improve efficiency scalability and resilience.
All you need is...
Bachelors degree in Computer Engineering Electrical Engineering Information Technology or a related technical field.
4-7 years of experience in database administration and operations.
Experience participating in or leading incident response and postmortem analysis processes.
Previous exposure to hybrid environments integrating on-premise data centers with public or private cloud platforms is desirable.
Experienced PostgreSQL Database Administrator managing production and non-production PostgreSQL environments.
Skilled in backup and recovery replication performance tuning and high availability.
Proven ability to troubleshoot critical issues automate DBA tasks and ensure database reliability.
4 years of hands-on PostgreSQL administration experience.
Strong SQL and PL/pgSQL expertise; experience with database optimization and indexing.
Hands-on experience with backup recovery and HA solutions.
Strong proficiency in Linux and Debian environments.
Proficiency in scripting for database automation.
Excellent analytical problem-solving and troubleshooting skills.
Strong communication skills for cross-team collaboration.
Understanding of Oracle and MySQL databases is a plus but not mandatory.
Why you will love this job:
Ability to grow and gain many opportunities for professional development.
We are a dynamic multi-cultural organization that constantly innovates and empowers our employees to grow. Our people our passionate daring and phenomenal teammates that stand by each other with a dedication to creating a diverse inclusive workplace!
We offer a wide range of stellar benefits including health dental vision and life insurance as well as paid time off sick time and parental leave!
Required Experience:
IC
About Company
Amdocs is a leading software and services provider to communications and media companies of all sizes, accelerating the industry’s dynamic and continuous digital transformation. With a rich set of innovative solutions, long-term business relationships with 350 communications and medi ... View more