Sr. Data Engineer

Abacus Insights

Not Interested
Bookmark
Report This Job

profile Job Location:

Pune - India

profile Monthly Salary: Not Disclosed
Posted on: 22 hours ago
Vacancies: 1 Vacancy

Job Summary

About Us

Abacus Insights is transforming how data works for health plans. Our mission is simple: make healthcare data usable so the people responsible for care and cost decisions can act faster with confidence.
We help health plans break down data silos to create a single trusted data foundation. That foundation powers better decisions so plans can improve outcomes reduce waste and deliver better experiences for members and providers alike.

Backed by $100M from top investors were tackling big challenges in an industry thats ready for change. Our platform enables GenAI use cases by delivering clean connected and reliable healthcare data that can support automation prioritization and decision workflowsand its why we are leading the way.

Our innovation begins with people. We are bold curious and collaborativebecause the best ideas come from working together. Ready to make an impact Join us and lets build the future together.

About the role

We are seeking an accomplished Data Engineer to join our dynamic and rapidly expanding Tech Ops division. With significant projected growth this is an opportunity to drive meaningful technical this role you will work directly with customers data vendors and internal engineering teams to design implement and optimize complex data integration solutions within a modern largescale cloud environment.

You will leverage advanced skills in distributed computing data architecture and cloud-native engineering to enable scalable resilient and highperformance data ingestion and transformation pipelines. As a trusted technical advisor you will guide customers in adopting Abacuss core data management platform and ensure high-quality compliant data operations across the lifecycle.

Your day to day

  • Architect design and implement high-volume batch and real-time data pipelines using PySpark SparkSQL Databricks Workflows and distributed processing frameworks.
  • Build endtoend ingestion frameworks integrating with Databricks Snowflake AWS services (S3 SQS Lambda) and vendor data APIs ensuring data quality lineage and schema evolution.
  • Develop data modeling frameworks including star/snowflake schemas and optimization techniques for analytical workloads on cloud data warehouses.
  • Lead technical solution design for health plan clients creating highly available fault-tolerant architectures across multi-account AWS environments.
  • Translate complex business requirements into detailed technical specifications engineering artifacts and reusable components.
  • Implement security automation including RBAC encryption at rest/in transit PHI handling tokenization auditing and compliance with HIPAA and SOC 2 frameworks.
  • Establish and enforce data engineering best practices such as CI/CD for data pipelines code versioning automated testing orchestration logging and observability patterns.
  • Conduct performance profiling and optimize compute costs cluster configurations partitions indexing and caching strategies across Databricks and Snowflake environments.
  • Produce high-quality technical documentation including runbooks architecture diagrams and operational standards.
  • Mentor junior engineers through technical reviews coaching and training sessions for both internal teams and clients.

What you bring to the team

  • Bachelors degree in Computer Science Computer Engineering or a closely related technical field.
  • 5 years of handson experience as a Data Engineer working with largescale distributed data processing systems in modern cloud environments.
  • Working knowledge of U.S. healthcare data domainsincluding claims eligibility and provider datasetsand experience applying this knowledge to complex ingestion and transformation workflows.
  • Strong ability to communicate complex technical concepts clearly across both technical and nontechnical stakeholders.
  • Expertlevel proficiency in Python SQL and PySpark including developing distributed data transformations and performanceoptimized queries.
  • Demonstrated experience designing building and operating productiongrade ETL/ELT pipelines using Databricks Airflow or similar orchestration and workflow automation tools.
  • Proven experience architecting or operating largescale data platforms using dbt Kafka Delta Lake and eventdriven/streaming architectures within a cloudnative data services or platform engineering environmentrequiring specialized knowledge of distributed systems scalable data pipelines and cloudscale data processing.
  • Experience working with structured and semistructured data formats such as Parquet ORC JSON and Avro including schema evolution and optimization techniques.
  • Strong working knowledge of AWS data ecosystem componentsincluding S3 SQS Lambda Glue IAMor equivalent cloud technologies supporting highvolume data engineering workloads.
  • Proficiency with Terraform infrastructureascode methodologies and modern CI/CD pipelines (e.g. GitLab) supporting automated deployment and versioning of data systems.
  • Deep expertise in SQL and compute optimization strategies including ZOrdering clustering partitioning pruning and caching for largescale analytical and operational workloads.
  • Handson experience with major cloud data warehouse platforms such as Snowflake (preferred) BigQuery or Redshift including performance tuning and data modeling for analytical environments.

What we would like to see but not required:

  • Experience in large-scale healthcare or payer data environments.

Our Commitment as an Equal Opportunity Employer

As a mission-led technology company helping to drive better healthcare outcomes Abacus Insights believes that the best innovation and value we can bring to our customers comes from diverse ideas thoughts experiences and perspectives. Therefore we dedicate resources to building diverse teams and providing equal employment opportunities to all applicants. Abacus prohibits discrimination and harassment regarding race color religion age sex national origin disability status genetics protected veteran status sexual orientation gender identity or expression or any other characteristic protected by federal state or local laws.

At the heart of who we are is a commitment to continuously and intentionally building an inclusive cultureone that empowers every team member across the globe to do their best work and bring their authentic selves. We carry that same commitment into our hiring process aiming to create an interview experience where you feel comfortable and confident showcasing your strengths. If theres anything we can do to support thatbig or smallplease let us know.


Required Experience:

Senior IC

About UsAbacus Insights is transforming how data works for health plans. Our mission is simple: make healthcare data usable so the people responsible for care and cost decisions can act faster with confidence. We help health plans break down data silos to create a single trusted data foundation. Tha...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala

About Company

Company Logo

Abacus Insights simplifies healthcare data with best-in-class data management solutions that improve data quality and drive valuable insights. We provide our customers with an intelligent platform that unlocks the value of data and removes the burden of maintaining legacy data managem ... View more

View Profile View Profile