DescriptionWe are looking for an experienced detail-oriented individual with demonstrated native-level language expertise in Armenian (Eastern and Classical) Korean (Hanja and Hangul) and/or German (Latin and Gothic)language(s) deep knowledge of historical genealogical documents and demonstrated understanding of guidelines and principles for creating training data for machine learning systems to help us build high-quality training data for machine learning systems and review the work of others and mentor them to do likewise. This persons work will help make historical records available in FamilySearchs automation and machine learning platform.
ResponsibilitiesAs a Machine Learning Historical Records Linguist II in the Records Product Group at FamilySearch you will:
- exercise demonstrated expertise in paleography linguistics technology and historical records to build machine learning training datasets for historical genealogical documents in many languages.
- review the work of peers and provide mentoring and timely feedback to ensure datasets are created accurately model the truth of the underlying artifact and follow project guidelines.
- Assist in updating instructional materials and training others.
- Use paleography skills to:
- Accurately decipher historical documents.
- Annotate language data with linguistic information to build natural language processing (NLP) datasets.
- Model the way people naturally read historical documents by creating hierarchies of relationships between areas of text.
- Precisely map the layout of historical documents.
- Curate large amounts of data.
- Review datasets for errors and provide corrections in a timely manner.
- Perform other data modeling activities and duties as assigned.
- Enable FamilySearchs automation efforts by meeting aggressive deadlines and accomplishing work assignments with consistently high output quality and accuracy and helping others to do likewise.
QualificationsEducation:
BA/BS Linguistics Family History Instructional Design or other bachelors degree with related or equivalent experience required.
Experience:
2 years relevant or related experience or equivalent experience
Skills & Abilities:
Native level fluency in at least one of the following:
Armenian: experience with Eastern and Classical Armenian
Korean: experience with old Hanja characters and Hangul characters
German: experience with Gothic and Latin scripts
Experience with additional languages a plus
Business level fluency in English
Demonstrated paleography skills to accurately decipher historical documents
Demonstrated linguistic skills to build NLP datasets
Demonstrated understanding of guidelines and best practices for creating different types of machine learning datasets from historical genealogical documents
Demonstrated ability to mentor and train others
Demonstrated ability to update instructional materials in a manner that is grammatically correct concise accurate and easy to understand
Experience working with historical documents
Strong technical and analytical aptitude with a passion for data efficiency and accuracy
Independent worker who is self-motivated dependable detail oriented responsible self-disciplined and a team player with a record of timely delivery of requests
Willingness to support several projects at one time and to accept reprioritization as necessary in a fast paced constantly evolving environment
Comfortable handling a high volume of work on a daily basis
High proficiency in Microsoft Office tools including: Word PowerPoint and Excel
Ability to quickly grasp technical concepts
#LI-KS1
DescriptionWe are looking for an experienced detail-oriented individual with demonstrated native-level language expertise in Armenian (Eastern and Classical) Korean (Hanja and Hangul) and/or German (Latin and Gothic)language(s) deep knowledge of historical genealogical documents and demonstrated und...
DescriptionWe are looking for an experienced detail-oriented individual with demonstrated native-level language expertise in Armenian (Eastern and Classical) Korean (Hanja and Hangul) and/or German (Latin and Gothic)language(s) deep knowledge of historical genealogical documents and demonstrated understanding of guidelines and principles for creating training data for machine learning systems to help us build high-quality training data for machine learning systems and review the work of others and mentor them to do likewise. This persons work will help make historical records available in FamilySearchs automation and machine learning platform.
ResponsibilitiesAs a Machine Learning Historical Records Linguist II in the Records Product Group at FamilySearch you will:
- exercise demonstrated expertise in paleography linguistics technology and historical records to build machine learning training datasets for historical genealogical documents in many languages.
- review the work of peers and provide mentoring and timely feedback to ensure datasets are created accurately model the truth of the underlying artifact and follow project guidelines.
- Assist in updating instructional materials and training others.
- Use paleography skills to:
- Accurately decipher historical documents.
- Annotate language data with linguistic information to build natural language processing (NLP) datasets.
- Model the way people naturally read historical documents by creating hierarchies of relationships between areas of text.
- Precisely map the layout of historical documents.
- Curate large amounts of data.
- Review datasets for errors and provide corrections in a timely manner.
- Perform other data modeling activities and duties as assigned.
- Enable FamilySearchs automation efforts by meeting aggressive deadlines and accomplishing work assignments with consistently high output quality and accuracy and helping others to do likewise.
QualificationsEducation:
BA/BS Linguistics Family History Instructional Design or other bachelors degree with related or equivalent experience required.
Experience:
2 years relevant or related experience or equivalent experience
Skills & Abilities:
Native level fluency in at least one of the following:
Armenian: experience with Eastern and Classical Armenian
Korean: experience with old Hanja characters and Hangul characters
German: experience with Gothic and Latin scripts
Experience with additional languages a plus
Business level fluency in English
Demonstrated paleography skills to accurately decipher historical documents
Demonstrated linguistic skills to build NLP datasets
Demonstrated understanding of guidelines and best practices for creating different types of machine learning datasets from historical genealogical documents
Demonstrated ability to mentor and train others
Demonstrated ability to update instructional materials in a manner that is grammatically correct concise accurate and easy to understand
Experience working with historical documents
Strong technical and analytical aptitude with a passion for data efficiency and accuracy
Independent worker who is self-motivated dependable detail oriented responsible self-disciplined and a team player with a record of timely delivery of requests
Willingness to support several projects at one time and to accept reprioritization as necessary in a fast paced constantly evolving environment
Comfortable handling a high volume of work on a daily basis
High proficiency in Microsoft Office tools including: Word PowerPoint and Excel
Ability to quickly grasp technical concepts
#LI-KS1
View more
View less