913 Data Engineer Senior Remote LATAM

Darwoft

Not Interested
Bookmark
Report This Job

profile Job Location:

Córdoba - Argentina

profile Monthly Salary: Not Disclosed
Posted on: 21 hours ago
Vacancies: 1 Vacancy

Job Summary

Data Engineer (Graph Data Systems/Neo4j/Python/Entity Resolution) Senior Remote LATAM

We are partnering with a fast-growing healthtech startup that is building AI-native infrastructure for next-generation personalized and preventative healthcare.
They are developing an all-in-one clinical data platform designed for a new era of medicinewhere fragmented longitudinal and multi-modal health data becomes real-time actionable insights for clinicians and care teams.

This platform unifies records labs wearables diagnostics genomics and lifestyle inputs into a single coherent patient model. Early customers include leading functional integrative longevity and concierge medicine clinics helping shape the product from day one.

The mission: turn complex health data into clarity and action laying the foundation for predictive personal health models adaptive clinical intelligence and digital health twins.

Senior Data Engineer (Graph-Based Data Systems)

This role is ideal for an engineer who has strong experience with graph databases and thrives in building complex scalable data systems from the ground up. You will design the core graph architecture data lineage systems fuzzy matching engines and ingestion frameworks that power the future of healthcare intelligence.

Core Responsibilities

Graph Data Architecture

  • Design and implement the companys core graph database architecture (likely Neo4j).

  • Create advanced graph models representing patients biomarkers protocols events and relationships across multi-modal data.

  • Enable high-context queries such as find patients with similar longitudinal patterns.

Data Normalization & Entity Resolution

  • Build the entire normalization and deduplication engine.

  • Implement fuzzy matching & entity resolution for unifying messy multi-source health data.

  • Establish rules heuristics confidence scores and automated unification pipelines.

Metadata Lineage & Provenance

  • Design a transparent lineage layer where every data point tracks:

    • source

    • timestamp

    • reliability

    • transformation path

    • confidence scoring

  • Ensure full traceability from ingestion storage computation.

Time-Series & Longitudinal Data Systems

  • Architect storage optimized for high-frequency time-series signals (CGM wearables diagnostics).

  • Support long-term longitudinal analysis across patient lifecycles.

API Layer & Query Engine

  • Build APIs enabling complex graph queries and multi-relationship traversal.

  • Support both real-time and analytical access patterns.

Multi-Source Health Data Ingestion

  • Design ingestion for:

    • clinical records (EHRs)

    • labs

    • wearables

    • lifestyle/behavioral inputs

    • continuous monitoring devices

    • advanced diagnostics and genomics

  • Unify this data into a structured high-quality canonical model.

Required Skills

Must-Haves

  • Deep hands-on experience with graph databases (Neo4j or similar).

  • Strong graph-based and relational data modeling.

  • Experience with fuzzy matching & entity resolution algorithms.

  • Experience in metadata lineage and provenance systems.

  • Time-series data architecture and longitudinal data design.

  • API design for complex multi-hop queries.

  • Strong Python SQL.

  • Solid system design especially around scalability and performance.

Nice-to-Haves

  • Knowledge graphs or semantic data modeling.

  • Built full data platforms end to end.

  • Experience normalizing messy multi-source data.

  • Kafka Pub/Sub or other streaming frameworks.

  • Exposure to healthcare data: FHIR HL7.

  • Experience with ML-driven feature extraction or enrichment.

What You Can Expect

  • 100% remote

  • Full contractor engagement

  • High-impact role with ownership from day one

  • Work alongside a world-class founding team in healthtech and AI

  • Build foundational infrastructure for the next decade of healthcare

  • Fast-moving product-driven environment

Ideal Candidate

  • Thrives in early-stage high-ownership environments.

  • Loves building complex systems from scratch.

  • Enjoys deeply technical challenges with real-world impact.

  • Believes in the power of data to transform healthcare.

What Darwoft Offers

  • Contractor agreement with payment in USD
  • 100% remote work
  • Argentinas public holidays
  • English classes
  • Referral program
  • Access to learning platforms

Explore this and other opportunities at:

Data Engineer (Graph Data Systems/Neo4j/Python/Entity Resolution) Senior Remote LATAM We are partnering with a fast-growing healthtech startup that is building AI-native infrastructure for next-generation personalized and preventative healthcare. They are developing an all-in-one clinical data pl...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala