Require EnglishData engiener for someone is living in Japan

Not Interested
Bookmark
Report This Job

profile Job Location:

Tokyo - Japan

profile Monthly Salary: Not Disclosed
Posted on: 20 hours ago
Vacancies: 1 Vacancy

Job Summary

Job Description

National-scale AI & Robotics project in Japan global research environment!

Job Title

Data Engineer

Company Overview

This project focuses on building large-scale datasets and data platforms to support next-generation AI and robotics models. The team works with real-world humanoid robot data and develops open reusable data assets for global researchers and engineers.

Backed by a major national initiative the project brings together experts from academia and industry to advance AI and robotics through high-quality data scalable infrastructure and open collaboration.

Your Role and Responsibilities

Design and implement large-scale data pipelines covering the full lifecycle of datasets including collection processing curation and publishing.

Design build and maintain data schemas storage systems and query interfaces for efficient dataset access.

Collaborate closely with AI and robotics researchers to understand evolving data requirements.

Build and scale distributed data-processing pipelines capable of handling large multimodal datasets (e.g. images depth data point clouds).

Define data-quality metrics and implement monitoring and feedback loops to ensure continuous improvement.

Manage metadata lineage and data governance for reliable and reproducible data usage.

Experience and Qualifications

Masters degree in Computer Science Engineering or a related field or equivalent practical experience.

5 years of professional experience in data engineering or data platform development.

Proven experience delivering production-grade distributed data systems.

3 years of experience building and operating large-scale ETL/ELT pipelines using Spark Flink Ray or similar frameworks.

Hands-on experience with workflow orchestration tools such as Airflow Kedro or Dagster.

Experience optimizing high-volume data workloads (10TB per day).

Additional Preferred Qualifications

Experience with lakehouse architectures such as Delta Lake Apache Iceberg or Apache Hudi.

Experience with query engines and catalogs such as Trino Athena Databricks SQL or similar tools.

Strong understanding of schema evolution data versioning and cost-performance optimization.

Experience building bronze/silver/gold data layers using dbt or equivalent tools.

Experience defining and enforcing data quality SLAs.

Familiarity with data quality lineage and governance tools (e.g. Great Expectations DataHub OpenMetadata).

Experience working with terabyte- or petabyte-scale datasets.

Business-level English proficiency; Japanese proficiency is a plus.

Good Reasons to Join

Work on a rare large-scale AI and robotics data project.

Build data platforms that directly support advanced AI research and real-world applications.

Collaborate with researchers and engineers from diverse backgrounds.

Gain visibility through impactful technical contributions and research output.

Contribute to a project with strong social and technological significance.

Work Location

Tokyo on-site main

Salary

Negotiable

Job DescriptionNational-scale AI & Robotics project in Japan global research environment! Job TitleData Engineer Company OverviewThis project focuses on building large-scale datasets and data platforms to support next-generation AI and robotics models. The team works with real-world humanoid robot d...
View more view more

Key Skills

  • Gems Jewellery
  • General Services
  • Account Development
  • Animation
  • Customer Care Service
  • Blackberry