Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailAbout Karya:
Why was Karya on the cover of the Time Magazine highlighted by Satya Nadella and chosen by Google as its partner for Project Vaani
In part because Karya is on a mission to provide AI enabled earning and learning opportunities to economically underserved communities thereby building a pathway out of poverty for them. Karya achieves this while also delivering high quality timely and price competitive data to its clients.
Karyas workers make at least 20 times the Indian minimum wage and through our oneofakind digital work platform we have delivered over 40 million digital tasks and are poised to positively impact over 100 thousand workers by the end of the year. In the coming years our goal is to rapidly scale our impact by bringing economic opportunities to millions of underserved users in India.
We are looking for a Language Data Manager to join our data team that manages and oversees the companys language datasets. This role will be crucial in ensuring the proper collection organization formatting and storage of linguistic data. In addition to dataset management the role will also involve significant analysis of the language data to support our research activities. The ideal candidate will have experience with language data management including data processing and annotation and experience with scripting tools.
Key Responsibilities:
Manage and maintain the companys language datasets ensuring they are accurate wellstructured and accessible to relevant teams.
Oversee the collection annotation and processing of linguistic data for various projects.
Develop and implement best practices for data processing cleaning and formatting specific to language datasets.
Work closely with language experts and technical teams to ensure linguistic data meets the necessary quality standards.
Support the development of tools and workflows that streamline language data preparation and analysis.
Provide regular updates and reports on the status and quality of language datasets to stakeholders.
Assist in the creation of metadata and documentation for datasets to ensure they are welldocumented and reusable.
MustHave Skills & Qualifications:
Experience in managing and processing language datasets including working with largescale text or speech corpora.
Proficiency in programming languages like Python R or similar for data processing.
Familiarity with linguistic annotation standards and techniques.
Experience with tools and platforms commonly used for language data management (e.g. linguistic annotation tools NLP libraries).
Strong problemsolving skills and attention to detail.
Excellent communication skills and the ability to collaborate with crossfunctional teams.
Bachelors degree in Linguistics Computer Science Data Science or a related field.
NicetoHave Skills:
Experience working with speech or textbased datasets for NLP and AI applications.
Familiarity with machine learning models and datasets used in language technologies.
Experience with cloudbased platforms and tools for data management.
People matter at Karya and these are some of the perks and benefits we created for our team:
Qualified applicants will receive consideration without regard to their race colour religion sex sexual orientation gender identity and disability.
Karya invites all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search to apply for a career opportunity reach out at.
Required Experience:
Manager
Full Time