Lead Data Platform Engineer

Exactera

Not Interested
Bookmark
Report This Job

profile Job Location:

San Diego, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 17 hours ago
Vacancies: 1 Vacancy

Job Summary

Exactera has offices in New York City Tarrytown NY San Diego CA Londonand Argentina.

The Role

As Lead Data Platform Engineer youll architect and implement our centralized data platform on Databricks. Youll establish governance patterns using Unity Catalog optimize for cost and performance at scale and enable our existing Data Engineers to build confidently on the platform. This is a data infrastructure rolefocused on pipelines storage governance and platform operations.

The Business Challenge

We operate multiple product lines (Transfer Pricing R&D Services RoyaltyStat Provisioning) each with distinct databases containing enterprise financial datajournal entries general ledgers and financial statements. Our immediate challenge is migrating multi-terabyte datasets from legacy systems to a unified Databricks lakehouse while establishing governance patterns that enable multi-product operations at scale.

What Youll Build

  • Data Structuring: Design data models and implement unified schemas across multiple disparate product lines.
  • Unity Catalog Architecture: Design and implement multi-catalog governance strategy supporting data isolation cross-product data sharing and comprehensive lineage tracking across our product portfolio
  • Delta Lake Optimization: Establish patterns for Z-ordering compaction and liquid clustering at multi-TB scale. Define table structures partitioning strategies and retention policies that balance query performance with storage costs
  • ETL Pipeline Framework: Build declarative pipeline patterns using Delta Live Tables. Create orchestration workflows for ingesting data from internal sources such as SQL databases and S3
  • Third Party Integrations: Integrate with third party data sources such as ERP systems (Netsuite etc.) and external data providers (S&P etc.) with automated ingest robust error handling and monitoring.
  • Platform Operations: Implement cost monitoring and optimization strategies establish data quality frameworks create self-service patterns enabling Data Engineers to work independently while maintaining governance standards

Business Problems Youll Solve

  • Key Legacy Product Migrations: Lead the architecture for migrating multi-terabyte datasets from legacy systems to Databricksestablishing patterns that will be reused across multiple product lines
  • Multi-Product Data Architecture: Design Unity Catalog structures enabling secure data separation between product lines while allowing controlled cross-product analytics where appropriate
  • Cost-Efficient Scale: Build infrastructure that scales efficientlythrough intelligent caching query optimization and compute management strategies that avoid linear cost growth
  • Platform Reliability: Establish monitoring alerting and data quality validation ensuring the platform operates reliably as foundation for both analytics and AI workloads

Required Experience

Databricks Expertise (Required)

  • Unity Catalog: Production experience with multi-catalog governance metastore design and lineage tracking.
  • Data Structuring: Experience designing and building unified schemas across multiple disparate product lines.
  • Delta Lake: Expert-level experience with Z-ordering compaction liquid clustering and performance tuning at multi-TB scale
  • Delta Live Tables: Strong hands-on experience building declarative ETL pipelines including change data capture and expectations/constraints
  • Databricks Workflows: Experience with job orchestration scheduling and operational monitoring
  • Business Intelligence: Experience enabling company-wide analytics and reporting with modern business intelligence tools and maintaining source of truth data and metrics.
  • PySpark & Databricks SQL: Strong proficiency for code review performance tuning and query optimization

Core Platform Engineering

  • 5-8 years in data engineering or data platform roles with 3 years hands-on Databricks experience
  • Track record leading at least one significant platform build or migration project
  • AWS experience (S3 IAM VPC) with ability to collaborate on infrastructure decisions
  • Infrastructure-as-code experience (Terraform preferred)

Technical Leadership

  • Demonstrated ability architecting data platforms from first principles and defending technical decisions
  • Strong written and verbal communication document architecture decisions and present to both technical and business stakeholders

Preferred But Not Required

  • Experience with financial data accounting systems (NetSuite) or enterprise ERP platforms
  • Background building platforms that serve AI/ML workloads (experience preparing data for downstream ML consumption RAG and retrieval and LLMs.
  • Understand advanced intelligence concepts such as relationship surfacing with knowledge graphs
  • Familiarity with data governance frameworks and compliance requirements for regulated industries

What We Offer:
(The following only applies to US-based positions)

  • A collaborative team culture with opportunities for career development.
  • Ample opportunities to be recognized build valuable skills and grow your career.
  • Generous vacation policy including paid parental leave.
  • Comprehensive health plans with FSA and HSA options.
  • 401(k) retirement plan.
  • Life and disability insurance coverage.
  • Supplemental benefits like a dependent care savings plan pet insurance will preparation and an employee assistance program.

About Us:


Required Experience:

IC

Exactera has offices in New York City Tarrytown NY San Diego CA Londonand Argentina.The RoleAs Lead Data Platform Engineer youll architect and implement our centralized data platform on Databricks. Youll establish governance patterns using Unity Catalog optimize for cost and performance at scale and...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala

About Company

Company Logo

Exactera offers expert tax and transfer pricing compliance services delivered by experts and powered by AI.

View Profile View Profile