drjobs SRE - Integrated Order Management English

SRE - Integrated Order Management

صاحب العمل نشط

1 وظيفة شاغرة
drjobs

حالة تأهب وظيفة

سيتم تحديثك بأحدث تنبيهات الوظائف عبر البريد الإلكتروني
Valid email field required
أرسل الوظائف
drjobs
أرسل لي وظائف مشابهة
drjobs

حالة تأهب وظيفة

سيتم تحديثك بأحدث تنبيهات الوظائف عبر البريد الإلكتروني

Valid email field required
أرسل الوظائف
الخبرة drjobs

5سنوات

موقع الوظيفة drjobs

القاهرة - مصر

الراتب شهرياً drjobs

لم يكشف

drjobs

لم يتم الكشف عن الراتب

عدد الوظائف الشاغرة

1 وظيفة شاغرة

الوصف الوظيفي

Our client is looking for SRE for Integrated Order Management (IOM) who will be responsible for maintaining and
improving the reliability availability performance and scalability of the IOM platform and provided
services. This includes designing implementing and maintaining the infrastructure tools and processes
necessary to support the platform operations. The SRE will work closely with development and operation
teams from the core IOM platform based on SAP S/4 Order To Cash (OTC former SD) as well as cross-
platform ones covering various sales order entry channels and integrations to ensure that the platform is
scalable secure and highly available. SRE will also be responsible for monitoring the platform identifying
and resolving complex issues and implementing improvements to prevent future incidents. The SRE will be
expected to have a strong understanding of the SAP S/4 OTC configuration and systems operations
experienced with highly-integrated order management landscapes DevOps practices and cloud
infrastructure with a focus on automating operations and resolving complex technical challenges.

Key Responsibilities:
Availability & Performance Monitoring:
  • Monitor product systems health availability and performance to ensure services meet defined service-level objectives (SLOs) and service-level agreements (SLAs).Work on system reliability engineering by designing and implementing system that is resilient fault-tolerant and capable of withstanding both expected and unexpected failures.
  • Proactively identify troubleshoot and resolve issues affecting the stability of the platform and its integrations.
  • Implement and maintain monitoring and alerting systems to provide visibility on performance application health and user experience.
  • Leverage observability tools (e.g. Azure Monitor Application Insights Power Platform Monitor Dynatrace) to enhance real-time visibility.

Problem Management & Root Cause Analysis:
  • Lead troubleshooting efforts for complex issues (problems) diagnose root causes and drive improvements to prevent recurrence in collaboration with Development teams and Product Mgr. partners. Ensure proactive management of alerts and participation in crisis management teams as required
  • Perform root cause analyses with the relevant Development & Operations teams to identify and resolve recurring issues documenting findings and driving systemic changes to prevent future failures. Lead post-incident reviews to document learnings and work with teams to implement long-term fixes.

Automation & Continuous Improvement:
  • Implement processes and practices that continuously improve the reliability performance and scalability of the system.
  • Develop & maintain automations for repetitive tasks to increase efficiency reduce manual effort and improve response times.
  • Collaborate with DevOps teams ensuring that releases follow testing and quality assurance standards.
  • Ensure the platform can scale efficiently meet growing user demand and plan for future capacity needs.

Collaboration & Cross-Functional Coordination:
  • Collaborate with product development teams cloud operations integration and other platform teams to define reliability requirements and prioritize improvement initiatives.
  • Partner with the security team to implement best practices and maintain compliance within the environment.
  • Provide training and support to team members and users on incident response protocols monitoring tools and troubleshooting techniques.
  • Share insights and best practices with other SREs to foster knowledge sharing and improve reliability across the organization.


Requirements

  • Bachelor s degree in Computer Science Engineering Information Technology or a related field.
  • 5 years of experience in Site Reliability Engineering DevOps or IT operations with a focus on application reliability and observability.
  • 3 years of experience with SAP S/4 OTC/SD and D365 CRM and order management interfaces within B2B or B2C preferably in the FMCG industry including understanding its architecture configuration and integrations.
  • Hands-on experience with systems monitoring alerting and observability tools especially within the Microsoft ecosystem (e.g. xx-xx-xx).
  • Strong knowledge of cloud infrastructure data structures and algorithms
  • Experience with SAP TMS Azure DevOps CD/CI Pipelines (ALM)
  • Excellent troubleshooting and problem-solving skills with a focus on improving long-term reliability and efficiency.
  • Strong communication and collaboration skills with the ability to work effectively across cross-functional teams. Fluent in written and verbal English
  • Detail-oriented and capable of managing multiple priorities in a fast-paced environment.

نوع التوظيف

دوام كامل

نبذة عن الشركة

الإبلاغ عن هذه الوظيفة
إخلاء المسؤولية: د.جوب هو مجرد منصة تربط بين الباحثين عن عمل وأصحاب العمل. ننصح المتقدمين بإجراء بحث مستقل خاص بهم في أوراق اعتماد صاحب العمل المحتمل. نحن نحرص على ألا يتم طلب أي مدفوعات مالية من قبل عملائنا، وبالتالي فإننا ننصح بعدم مشاركة أي معلومات شخصية أو متعلقة بالحسابات المصرفية مع أي طرف ثالث. إذا كنت تشك في وقوع أي احتيال أو سوء تصرف، فيرجى التواصل معنا من خلال تعبئة النموذج الموجود على الصفحة اتصل بنا