Who we are:
Fulcrum Digital is an agile and nextgeneration digital accelerating company providing digital transformation and technology services from ideation to implementation. These services have applicability across a variety of industries including banking & financial services insurance retail higher education food healthcare and manufacturing.
What you ll do:
Provide L2 support production systems like application database middleware components infrastructure and network components.
Manage production incidents endtoend within defined SLAs focusing on resolution rather than who caused it.
Interact with various stakeholders such as Release managers program leads service managers development and test leads.
Review operational readiness requirements such as monitoring and alerting log rotation and resilience of the components and report the gaps.
Provide preimplementation support with activities such as release notes review and implementation dry runs.
Protect production components by running health checks and monitoring latency and memory utilization.
Automate daytoday activities and propose changes that improve reliability.
Participate in CAB and provide feedback on change requests.
Support the DevOps team in testing the promote pipelines and suggest automation of configuration items.
Practice incident management best practices and perform RCA.
Participate in disaster recovery tests and operational acceptance tests.
Analyse the technology stack that makes up the product and optimize recovery time objective.
Work with team members spread across time zones.
Share knowledge document improvements and mentor junior resources.
It is good to have skill using Jenkins to orchestrate builds as well as link to Sonar Maven etc. to build out the CI/CD pipeline.
Support deployments of code into multiple lower environments. Supporting current processes needed with an emphasis on automating everything as soon as possible.
It is good to have skill to design Implement and enhance our deployment automation based on Chef. We need proven experience designing and implementing an overall release and deployment process.
It is good to have skill to design and implement a Git based code management strategy that will support multiple environment deployments in parallel. Experience with automation for Branch management code promotions and version management.
Engage in and improve the whole lifecycle of services from inception and design through deployment operation and refinement.
Tools:
Log Monitoring Tool Splunk
Application Monitoring tool Dynatrace
Ticketing incident/problem management tool Remedy
Devops Basics CICD Basics Overview of Git Bitbucket SonarQube Ansible/Chef
Requirements
Must Have:
- API/EG experience (Basic)
- Linux & Shell Scripting (Basic)
- ITIL / ITSM(Basic)
- PL/SQL(Basic)
- SQL(Basic)
- Troubleshooting(Basic)
- Nginx(Basic)
- Java / JEE development experience interested in operations
- EventDriven Architectures
- MQ or NATS broker or similar messaging solutions.
- Kafka
- Clientserver communication aspects sockets TLS protocol
- Understand the concept of region and AZs.
- Production Support Experience
- Deployments MTF/Prod Maintenance items (including stop/start Disaster Recoveryrelated activities etc.) CR for changes in MTF/Prod
Good To Have:
Jenkins CI/CD
Groovy Scripting/Yaml
Ansible/Chef
What you ll do: Provide L2 support production systems like applications, database, middleware components, infrastructure, and network components. Manage production incidents end-to-end within defined SLAs focusing on resolution rather than who caused it. Interact with various stakeholders such as Release managers, program leads, service managers, development and test leads. Review operational readiness requirements such as monitoring and alerting, log rotation, and resilience of the components and report the gaps. Provide pre-implementation support with activities such as release notes review and implementation dry runs. Protect production components by running health checks and monitoring latency and memory utilization. Automate day-to-day activities and propose changes that improve reliability. Participate in CAB and provide feedback on change requests. Support the DevOps team in testing the promote pipelines and suggest automation of configuration items. Practice incident management best practices and perform RCA. Participate in disaster recovery tests and operational acceptance tests. Analyse the technology stack that makes up the product and optimize recovery time objective. Work with team members spread across time zones. Share knowledge, document improvements, and mentor junior resources. It is good to have skill using Jenkins to orchestrate builds as well as link to Sonar, Maven, etc. to build out the CI/CD pipeline. Support deployments of code into multiple lower environments. Supporting current processes needed with an emphasis on automating everything as soon as possible. It is good to have skill to design, Implement, and enhance our deployment automation based on Chef. We need proven experience designing and implementing an overall release and deployment process. It is good to have skill to design and implement a Git based code management strategy that will support multiple environment deployments in parallel. Experience with automation for Branch management, code promotions, and version management. Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement. Tools: Log Monitoring Tool Splunk Application Monitoring tool Dynatrace Ticketing incident/problem management tool Remedy Dev-ops Basics - CI-CD Basics, Overview of Git, Bit-bucket, SonarQube, Ansible/Chef Must Have: API/EG experience (Basic) Linux & Shell Scripting (Basic) ITIL / ITSM(Basic) PL/SQL(Basic) SQL(Basic) Troubleshooting(Basic) Nginx(Basic) Java / JEE development experience interested in operations Event-Driven Architectures MQ or NATS broker or similar messaging solutions. Kafka Client-server communication aspects - sockets, TLS protocol Understand the concept of region and AZs. Production Support Experience Deployments MTF/Prod, Maintenance items (including stop/start, Disaster Recovery-related activities, etc.), CR for changes in MTF/Prod Good To Have: Jenkins - CI/CD Groovy Scripting/Yaml Ansible/Chef