Career Opportunity for a Data Scientist (Creative Vision) in Japan!
Data Scientist (Creative Vision)
Company Overview
Japan-based advanced AI company focused on building next-generation intelligent systems. Backed by a strong global technology group it works at the intersection of cutting-edge research and real-world application developing scalable AI solutions with long-term impact.
Your Role and Responsibilities
Design and operate large-scale multi-modal data pipelines (ingestion deduplication filtering versioning)
Build data APIs and high-throughput loaders (streaming caching sampling)
Develop and manage captioning and annotation workflows including multilingual support
Oversee annotators gold sets QA metrics and quality dashboards
Curate and verify datasets using CLIP/VLM-assisted captioning
Perform data quality control (duplicate detection clustering policy filtering such as NSFW/PII)
Balance datasets across domains and regions; evaluate dense captions and synthetic data
Conduct data ablation studies and create internal research reports
Collaborate with research and product teams; define reusable schemas and SLAs
Experience and Qualifications
Experience with large-scale data infrastructure and multi-modal datasets
Strong background in data processing curation and annotation workflows
Knowledge of data quality evaluation policy filtering and safety considerations
Research-oriented mindset with cross-functional collaboration experience
Additional Preferred Qualifications
Familiarity with CLIP metrics aesthetic/safety evaluation and test set management
Knowledge of data governance licensing deletion workflows and NSFW tracing
Good Reasons to Join
Full remote work possible within Japan
Work Location
Tokyo Japan
Details will be provided during the meeting.
Required Experience:
IC