Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
Job Responsibilities:
Alarming Architecture:
Network and Services are monitored (auto-ticketed) to ensure proper ticket creation.
Including Alarm correlation rule definition alarm enhancement/ enrichment etc.
Understanding of MIBs SNMP Traps and OIDs.
Work with users to ensure end result is optimize the balance between too few and too many ticket creation.
Automation Development:
Design and develop automation scripts and tools using languages such as Python and frameworks like Ansible to automate network configurations provisioning monitoring and management tasks.
Network Configuration and Observability Management:
Implement automated solutions for network configuration monitoring observability and troubleshooting to ensure high availability reliability and deep insight into network health and behavior. Integrate telemetry data collection into automation workflows.
Monitoring and Observability Implementation:
Design deploy and maintain observability solutions including metrics collection log aggregation and distributed tracing using tools such as Prometheus Grafana ELK stack and Open Telemetry. Enable proactive detection of anomalies trends and performance degradation across the network infrastructure.
Production support:
Provide production support for alarming automation and AIOps platforms. Be available to answer after hour support requests in case of emergency.
Collaboration with Cross-Functional Teams:
Work closely with network engineering DevOps SRE and system administration teams to identify automation and observability opportunities and integrate solutions into existing infrastructure and CI/CD pipelines.
Security and Compliance:
Ensure all automation and observability solutions comply with relevant security standards and best practices. Implement automated compliance checks and anomaly-based threat detection through monitoring pipelines.
Monitoring and Troubleshooting:
Utilize automated tools to continuously monitor network performance collect and analyze telemetry data and implement self-healing or alert-based solutions to maintain optimal functionality and service reliability.
Documentation and Training:
Maintain comprehensive documentation of automation and observability processes dashboards and metrics. Provide training to team members on observability tools telemetry practices and automation best practices.
Qualifications:
Bachelors degree in Computer Science Network Engineering or a related field.
Minimum of eight (8) years of proven experience in network engineering with a focus on automation observability and orchestration.
Strong network and Telecom background.
Proficiency in programming languages such as Python.
Experience with network automation tools.
Experience with monitoring/observability tools such as Grafana.
Familiarity with telemetry protocols and observability frameworks (e.g. SNMP NetFlow sFlow OpenTelemetry).
Working knowledge of alarm collection systems Netcool Federos Icinga CA Spectrum.
Working knowledge of AIOps platforms such as GROK Big Panda ServiceNow.
Strong understanding of networking protocols and architectures (e.g. TCP/IP BGP OSPF).
Familiarity with cloud technologies and microservices architecture.
Excellent problem-solving abilities.
Strong communication and collaboration skills.
Ability to work independently and manage multiple tasks effectively.
Preferred Qualifications:
Experience with Network OS: Familiarity with network operating systems such as JunOS or Arista EOS.
Cloud Networking: Experience in designing implementing and managing virtual networks within cloud platforms.
Cybersecurity Knowledge: Understanding of network security principles and practices including automated threat detection and mitigation.
Observability Best Practices: Familiarity with distributed systems observability SLO/SLI definitions and real-time alerting strategies
Cloud BC Labs Inc is a digital transformation organization aimed at creating seamless solutions for clients to effectively manage their business operations. The company specializes in Business and Management Consulting AI/ML Data Analytics & Visualization Cloud Data Warehouse Migration Snowflake Implementation Informatica Implementation & Upgrade Staffing Services and Data Management Solutions
Full-time