Deploy relevant monitoring tools to monitor CI-CD at run time
Report and monitor set of defined KPIs to assess the platform health and the service level and proactively act in case of issues
Incident And Change Management
Act in proximity to software development teams
Proactively detect, Debug and resolve various CI issues
Implement solutions to prevent recurrence of detected issues
Train the development team on CI-CD usage
Support and maintain our cloud infrastructure.
Continuous monitoring of systems to ensure stability, reliability, proper logging, and audibility.
Be responsible for the planning, implementation, and growth of the AWS cloud infrastructure.
Work closely with engineers, test engineers, product owners, and managers to understand requirements, adhere to and implement best practices and standards around Cloud Infrastructure, Network & Security.
Measurement, optimization, and tuning of system performance and ensuring that systems will run reliably, securely and are highly available in a 24/7 production environment.
Define, implement and detail operational processes and procedures, with periodic review for efficiency and improvement.
Prioritize operational deliverables with production teams.
Coordinate technical resolution of major incidents, production issues and helping come up with hotfixes and resolutions.
Troubleshoot the system and solve problems across all platform and application domains.
Execute on improvement efforts in information security and privacy.
Continually assess the infrastructure for vulnerabilities and suggest. de-risking efforts.
Stay updated with the latest cloud offerings and technologies.
Job Requirements
Technical Requirements:
Advanced experience in CI-CD tools chain (Jenkins, git, gerrit, gcc, nexus, etc.) is a must.
Advanced experience in operating system administration (Windows, Linux) is a must
Expertise on Cloud Technologies: AWS is a must, Google Cloud & Azure are a plus
Strong experience with using AWS (EC2, EKS, ECS, S3, Route 53, Elastic Cache, Elasticsearch, RDS,
SQS, SNS, and others) and experience implementing standard methodologies.
Good experience in container tools (Docker is a must, LXC is a plus)
Knowledge about Container Orchestration tools (ex. Kubernetes) is a plus
Experience in Performance Monitoring tools (Prometheus, Grafana)
Good Knowledge in one scripting language or more (java, .Net, Perl, Shell etc.).
Good understanding of Networking Concepts (DNS, HTTP/HTTPS, SSH, etc.)
Good understanding of OS Concepts (Process Management, I/O Management, Service
Management, File Systems, Virtualization, Memory and Storage)
Knowledge in Infrastructure as a code concept (Chef, Ansible, Puppet,..) is a plus
Soft Skills
High level of Autonomy and responsibility
Excellent Troubleshooting, debugging skills
Strong analytical, critical thinking, root-cause analysis and problem-solving skills
إخلاء المسؤولية: د.جوب هو مجرد منصة تربط بين الباحثين عن عمل وأصحاب العمل. ننصح المتقدمين بإجراء بحث مستقل خاص بهم في أوراق اعتماد صاحب العمل المحتمل.
نحن نحرص على ألا يتم طلب أي مدفوعات مالية من قبل عملائنا، وبالتالي فإننا ننصح بعدم مشاركة أي معلومات شخصية أو متعلقة بالحسابات المصرفية مع أي طرف ثالث. إذا كنت تشك في وقوع أي احتيال أو سوء تصرف، فيرجى التواصل معنا من خلال تعبئة النموذج الموجود على الصفحة اتصل بنا