Ready to build the future with AI
At Genpact we dont just keep up with technologywe set the pace. AI and digital innovation are redefining industries and were leading the charge. Genpacts AI Gigafactory our industry-first accelerator is an example of how were scaling advanced technology solutions to help global enterprises work smarter grow faster and transform at scale. From large-scale models to agentic AI our breakthrough solutions tackle companies most complex challenges.
If you thrive in a fast-moving innovation-driven environment love building and deploying cutting-edge AI solutions and want to push the boundaries of whats possible this is your moment.
Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge operational excellence and cutting-edge solutions we help companies across industries get ahead and stay ahead. Powered by curiosity courage and innovation our teams implement data technology and AI to create tomorrow today. Get to know us at and on LinkedIn X YouTube and Facebook.
Inviting applications for the role of Lead Consultant Monitoring Specialist
In this role We are seeking a Monitoring & Observability Dashboard Expert who can design build and operate advanced monitoring dashboards that provide end-to-end visibility across infrastructure platforms and applications. This role is critical in enabling rapid troubleshooting anomaly detection and root cause analysis by correlating system level signals with application behavior on a single pane of glass.
Responsibilities:
Dashboard Design & Visualization
oDesign and maintain holistic monitoring dashboards that combine:
Infrastructure metrics (CPU memory disk network)
Application metrics (latency error rates throughput)
Logs and traces
oBuild role-based dashboards for Ops SRE and Engineering teams (L1L3 support).
oEnsure dashboards support quick triage and drill-down during incidents.
Service Mapping & Dependency Analysis
oImplement and maintain service maps showing:
Application-to-application dependencies
Upstream/downstream service relationships
Infra-to-app linkage (VMs containers clusters)
oEnable impact analysis to identify blast radius during outages.
Anomaly Detection & Proactive Monitoring
oConfigure anomaly detection for key system and application metrics.
oReduce alert noise through threshold tuning baselining and correlation.
oIdentify early warning signals before customer impact.
Correlation & Root Cause Analysis
oCorrelate:
Infrastructure failures (CPU spikes pod restarts node issues)
Application errors (exceptions failed transactions latency spikes)
oEnable end-to-end troubleshooting workflows directly from dashboards.
oSupport post-incident RCA with data-driven insights.
Alerting & Incident Support
oDesign alerting strategies aligned with SLAs SLOs and error budgets.
oIntegrate alerts with incident management tools (PagerDuty preferable).
oAct as a key partner during production incidents reducing MTTR.
Continuous Improvement
oPartner with engineering teams to improve instrumentation logging and tracing.
oRecommend improvements to observability coverage as architectures evolve.
oDocument dashboards metrics and troubleshooting playbooks.
Qualifications we seek in you!
Minimum Qualifications
BE/B Tech/MCA
Excellent written and verbal communication skills
Preferred Qualifications/ Skills
Strong experience with monitoring and observability platforms such as:
oGrafana Datadog (preferable) Dynatrace Splunk Prometheus Elastic
Deep understanding of:
oMetrics logs and distributed tracing
oApplication Performance Monitoring (APM)
oService maps and dependency graphs
Hands-on experience correlating infra app telemetry.
Platform & Architecture Knowledge
oSolid understanding of:
AWS Cloud platform
Containers and orchestration (Docker Kubernetes EKS/AKS)
Microservices and distributed systems
oFamiliarity with CI/CD and how releases impact monitoring signals.
Troubleshooting Mindset
oProven ability to troubleshoot complex production issues using dashboards.
oStrong root-cause analysis skills across layered systems.
oExperience working in production support or SRE environments is a plus.
Experience with:
oSLO/SLA definition and error budgets
oSynthetic monitoring and user journey tracking
oCustom metric instrumentation (OpenTelemetry preferred)
Scripting or automation (Python Bash) for monitoring enhancements.
Behavioral & Collaboration Skills
oStrong communication skillsable to explain complex system behavior visually.
oComfortable working with Ops SRE and Engineering teams.
oCalm and effective during high-severity incidents.
oDocumentation-first mindset.
Why join Genpact
Lead AI-first transformation Build and scale AI solutions that redefine industries
Make an impact Drive change for global enterprises and solve business challenges that matter
Accelerate your careerGain hands-on experience world-class training mentorship and AI certifications to advance your skills
Grow with the best Learn from top engineers data scientists and AI experts in a dynamic fast-moving workplace
Committed to ethical AI Work in an environment where governance transparency and security are at the core of everything we build
Thrive in a values-driven culture Our courage curiosity and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress
Come join the 140000 coders tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up.
Lets build tomorrow together.
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race color religion or belief sex age national origin citizenship status marital status military/veteran status genetic information sexual orientation gender identity physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity customer focus and innovation.
Furthermore please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a starter kit paying to apply or purchasing equipment or training.
Required Experience:
IC
Artificial Intelligence. Real Outcomes. AI is changing big businesses, and so are we. Discover how cutting-edge AI drives unparalleled value.