drjobs APM Agent Developer

APM Agent Developer

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Nairobi - Kenya

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

This is a remote position.

The developer will be responsible for building a lightweight selfhealing autoscaling multiplatform APM (Application Performance Monitoring) agent that can:

  1. Instrumentation & Data Collection:

    • Automatically instrument applications to collect transaction traces logs and performance metrics.

    • Capture distributed tracing across microservices.

    • Track response times error rates resource usage and database query performance.

    • Collect and forward application system and security logs.

  2. Performance & Efficiency Optimization:

    • Implement adaptive sampling to reduce overhead.

    • Ensure async & nonblocking data collection.

    • Optimize CPU memory and network utilization to minimize application impact.

  3. Distributed Tracing & Database Monitoring:

    • Assign and propagate trace IDs across microservices.

    • Monitor slow queries and database calls with minimal overhead.

  4. Log Collection & Security Monitoring:

    • Collect filter and forward application/system logs.

    • Detect security anomalies and unusual resource usage patterns.

  5. Communication & Data Transmission:

    • Efficiently batch and compress data before sending to the APM platform.

    • Use lightweight protocols (gRPC Protobuf etc. for communication.

  6. SelfHealing & AutoScaling Mechanisms:

    • Implement triggers for autoscaling based on CPU memory and latency thresholds.

    • Enable selfhealing by restarting services upon failure or excessive resource usage.

  7. Delivery Timeline & Reporting:

    • Develop the agent within 2 to 3 weeks.

    • Provide technical documentation and performance benchmarks.



Requirements

Qualifications:

  • Programming Expertise: Proficiency in languages commonly used for APM agents such as Java Python Go .NET or C.

  • Instrumentation & Monitoring Experience: Handson experience with code profiling distributed tracing (OpenTelemetry) and application instrumentation.

  • Performance Optimization: Knowledge of efficient data collection strategies async programming and lowlatency data transmission.

  • Logging & Security: Experience integrating with logging pipelines (ELK Splunk Loki) and implementing basic security anomaly detection.

  • Scalability & Resilience: Familiarity with autoscaling selfhealing mechanisms and cloudnative architectures.

  • APM & Observability Tools: Experience with tools like Prometheus OpenTelemetry Datadog New Relic or Dynatrace is a plus.

  • Networking & Communication Protocols: Proficiency in gRPC Protobuf or HTTPbased telemetry data transfer.

  • Agile Development & FastPaced : Ability to deliver a functional prototype within 23 weeks and iterate based on feedback.

  • Strong Debugging & ProblemSolving Skills: Ability to analyze performance bottlenecks and optimize agent behavior.



Qualifications: Programming Expertise: Proficiency in languages commonly used for APM agents such as Java, Python, Go, .NET, or C++. Instrumentation & Monitoring Experience: Hands-on experience with code profiling, distributed tracing (OpenTelemetry), and application instrumentation. Performance Optimization: Knowledge of efficient data collection strategies, async programming, and low-latency data transmission. Logging & Security: Experience integrating with logging pipelines (ELK, Splunk, Loki) and implementing basic security anomaly detection. Scalability & Resilience: Familiarity with auto-scaling, self-healing mechanisms, and cloud-native architectures. APM & Observability Tools: Experience with tools like Prometheus, OpenTelemetry, Datadog, New Relic, or Dynatrace is a plus. Networking & Communication Protocols: Proficiency in gRPC, Protobuf, or HTTP-based telemetry data transfer. Agile Development & Fast-Paced : Ability to deliver a functional prototype within 2-3 weeks and iterate based on feedback. Strong Debugging & Problem-Solving Skills: Ability to analyze performance bottlenecks and optimize agent behavior.

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.