Data Architect with Lakehouse
Job Location:
Iselin, NJ - USA
Monthly Salary:
Not Disclosed
Posted on:
22 hours ago
Vacancies:
1 Vacancy
Job Summary
Role : Data Architect
Location : Iselin NJ (Hybrid)
Type of Hire: C2C
Responsibilities:
- Define and maintain reference architectures (Lakehouse CDC streaming) and domain data models (conceptual logical physical).
- Create and enforce data standards: naming conventions data types modeling practices semantic definitions (aligned to business glossaries).
- Establish metadata operating model: ownership stewardship processes for Catalog Glossary Data Dictionary and Data Lineage.
- Integrate lineage capture across pipelines (ETL/ELT/streaming) BI layers and ML workflows.
- Architect cross-platform data flows across Databricks Oracle/SQL Server Snowflake and metadata tools.
- Define IAM models: RBAC/ABAC SSO/federation SCIM provisioning; directory-driven entitlements and periodic access reviews.
- Define catalog strategy (e.g. Unity Catalog/Purview/Collibra/Alation) and integrate with CI/CD for automated registration and lineage.
- Design reusable pipeline frameworks with configuration-driven IO logging metrics retry/error handling and data quality checks.
Skills:
- Data Modeling: Dimensional (star/snowflake) 3NF Data Vault business glossary-to-model mapping SCD types time-series/event modeling.
- Metadata & Governance: Practical use and integration of Data Catalogs Lineage
- Oracle/SQL Server (data modeling migration/CDC patterns).
- Snowflake (roles warehouses performance tuning tasks/streams dynamic tables).
- Databricks/Spark (SQL/PySpark Structured Streaming Delta Lake; Unity Catalog).
- Security & Compliance: IAM/RBAC masking tokenization encryption; PCI/AML/KYC/GDPR/DPDP awareness.
- Integration & Orchestration: Databricks Workflows Airflow/ADF API integrations with catalog tools; schema registry
- Exceptional interpersonal and collaboration skills within a team environment