About Brillio:
Brillio is one of the fastest growing digital technology service providers and a partner of choice for many Fortune 1000 companies seeking to turn disruption into a competitive advantage through innovative digital renowned for its worldclass professionals referred to as Brillians distinguishes itself through their capacity to seamlessly integrate cuttingedge digital and design thinking skills with an unwavering dedication to client satisfaction.
Brillio takes pride in its status as an employer of choice consistently attracting the most exceptional and talented individuals due to its unwavering emphasis on contemporary groundbreaking technologies and exclusive digital projects. Brillios relentless commitment to providing an exceptional experience to its Brillians and nurturing their full potential consistently garners them the Great Place to Work certification year after year.
System Engineer
Responsibilities:
Design and develop robust observability solutions to monitor analyze and troubleshoot distributed systems.
Familiar with OTEL standards and tools.
Previous experience working with application teams to implement selfhealing i.e. alerting that triggers automated remediation.
Implement and configure monitoring logging tracing and alerting systems to ensure comprehensive coverage of our infrastructure and applications.
Collaborate with software engineers to instrument code for telemetry data collection and analysis.
Optimize observability tooling and processes to improve system reliability performance and scalability.
Create dashboards reports and visualizations to provide actionable insights into system health and performance.
Investigate and resolve incidents by analyzing telemetry data and identifying root causes.
Stay current with industry trends and best practices in observability and recommend improvements to our observability strategy and infrastructure.
Qualifications:
Bachelors degree in computer science Engineering or a related field (or equivalent experience).
23 years experience as an Observability Engineer or a similar role in a production environment.
Deep understanding of observability principles methodologies and tools such as Prometheus Grafana Jaeger ELK stack etc.
Proficiency in programming/scripting languages like Java Python Go or similar for automation and tooling development.
Strong knowledge of cloud computing platforms (AWS preferred) and container orchestration systems (e.g. Kubernetes).
Excellent problemsolving skills and the ability to troubleshoot complex issues in distributed systems.
Strong communication skills and the ability to collaborate effectively with crossfunctional teams.
Why should you apply for this role
As Brillio continues to gain momentum as a trusted partner for our clients in their digital transformation journey we strive to set new benchmarks for speed and value creation. The DI team at Brillio is at the forefront of leading this charge by reimagining and executing how we structure sell and deliver our services to better serve our clients.
Brillio is an equal opportunity employer to all regardless of age ancestry colour disability (mental and physical) exercising the right to family care and medical leave gender gender expression gender identity genetic information marital status medical condition military or veteran status national origin political affiliation race religious creed sex (includes pregnancy childbirth breastfeeding and related medical conditions) and sexual orientation.
#LICH1