In this role you will be responsible for the quality and accessibility of the multimodal data (including image video text audio sensor data metadata etc.) generated from various data collections processing pipelines and annotations. Youll design and implement systematic processes automated pipelines and collaborate with data collection data processing and ML & product engineers to create high quality data support data-driven product and AIML development and ensure data compliance with security and privacy regulations.
- 6 years of industry experience architecting and developing scalable and reliable software pipeline and platforms for validation analytics and curation on the multimodal data (including image video text audio sensor data etc.)
- B.S. in Computer Science and/or an equivalent engineering field
- Proficiency with programming languages Python Java SQL or equivalent
- Proficiency with data pipeline modeling database and query tools like Dagster PostgreSQL MangoDB Trino or equivalent
- Experience with vision data processing tools like FFmpeg GStreamer OpenCV or equivalent
- Able to rotate on-call for mission-critical operations and applications
- Passion for data quality and curation code elegance clear documentation operational excellence attention to details and delivering outstanding user experiences
- Excellent communication skills with ability to confidently express the benefits and constraints of technology solutions to cross-functional technical and non-technical teams
- Experience in building Cloud Data Warehouses in Snowflake Redshift BigQuery or analogous architectures
- Experience with the practical application of data warehousing concepts methodologies and frameworks
- Experience with AI/ML frameworks like TensorFlow or PyTorch
- Experience with machine learning algorithms for data curation and annotation
- Experience in managing a team
- Experience with data collection and/or annotation operations
Required Experience:
Senior IC
In this role you will be responsible for the quality and accessibility of the multimodal data (including image video text audio sensor data metadata etc.) generated from various data collections processing pipelines and annotations. Youll design and implement systematic processes automated pipelines...
In this role you will be responsible for the quality and accessibility of the multimodal data (including image video text audio sensor data metadata etc.) generated from various data collections processing pipelines and annotations. Youll design and implement systematic processes automated pipelines and collaborate with data collection data processing and ML & product engineers to create high quality data support data-driven product and AIML development and ensure data compliance with security and privacy regulations.
- 6 years of industry experience architecting and developing scalable and reliable software pipeline and platforms for validation analytics and curation on the multimodal data (including image video text audio sensor data etc.)
- B.S. in Computer Science and/or an equivalent engineering field
- Proficiency with programming languages Python Java SQL or equivalent
- Proficiency with data pipeline modeling database and query tools like Dagster PostgreSQL MangoDB Trino or equivalent
- Experience with vision data processing tools like FFmpeg GStreamer OpenCV or equivalent
- Able to rotate on-call for mission-critical operations and applications
- Passion for data quality and curation code elegance clear documentation operational excellence attention to details and delivering outstanding user experiences
- Excellent communication skills with ability to confidently express the benefits and constraints of technology solutions to cross-functional technical and non-technical teams
- Experience in building Cloud Data Warehouses in Snowflake Redshift BigQuery or analogous architectures
- Experience with the practical application of data warehousing concepts methodologies and frameworks
- Experience with AI/ML frameworks like TensorFlow or PyTorch
- Experience with machine learning algorithms for data curation and annotation
- Experience in managing a team
- Experience with data collection and/or annotation operations
Required Experience:
Senior IC
View more
View less