hatch I.T. is partnering with Babel Street to find an Image & Computer Vision AI Engineer.Please see details below:
About the Role
As an Engineer on the Image & Computer Vision AI team you will play a hands-on role in developing and deploying computer vision capabilities that support Babel Streets intelligence applications. You will build systems that extract analyze and reason over visual dataenabling facial matching object and scene understanding geolocation and location inference from imagery and multimodal intelligence workflows.
This role is execution-focused and suited for engineers with strong foundations in computer vision image processing and machine learning who want to apply their skills to real-world mission-driven problems. You will work closely with AI Product and Engineering teams to deliver reliable scalable and cost-efficient vision capabilities including integration with multimodal LLM systems that allow users to search and reason over images using natural language.
This is a hybrid role to be based out of either their Reston VA/Washington DC office or their Somerville MA office.
About the Company
Babel Street is the trusted technology partner for the worlds most advanced identity intelligence and risk operations. They deliver advanced AI and data analytics solutions providing unmatched analysis-ready data regardless of language proactive risk identification 360-degree insights high-speed automation and seamless integration into existing systems. Babel Street empowers government and commercial organizations to transform high-stakes identity and risk operations into a strategic advantage. The actionable insights we deliver safeguard lives and protect critical assets around the world.Babel Street is headquartered inReston Virginia withregionaloffices in BostonMAand Cleveland OH andinternationaloffices inAustralia Canada Israel Japan and the U.K.
Role Focus:
This role spans three practical execution areas:
Computer Vision & Image Analytics
You will implement andoperateimage analytics pipelines that support facial matching object detection scene understanding and image similarity. This includes image preprocessing feature extraction model inference evaluation and performance optimization to meet mission-grade accuracy and latency requirements.
Geospatial & Location Inference from Imagery
You will contribute to capabilities that infer location context or environmental attributes from imageryleveragingvisual cues metadata and learned representations. This includes supporting image-based geolocation landmark recognition and contextual scene analysis used in intelligence workflows.
Multi-Modal AI & Image Search
You will support multimodal AI systems that combine vision models with LLMs embeddings and retrieval pipelines to enable natural-language search and reasoning over images and image collections. You will help integrate visual understanding into broader intelligence applications and workflows.
What you will do:
- Build andmaintaincomputer vision pipelines for image ingestion preprocessing inference and evaluation.
- Implement facial matching and identity-related vision workflowsin accordance withaccuracy safety and compliance requirements.
- Develop and support object detection image similarity and scene understanding models.
- Contribute to image-based geolocation and location inference capabilities using visual features and contextual signals.
- Support multimodal AI workflows that combine image embeddings with LLM-based search and reasoning.
- Write clean maintainable Python code and contribute to production services and APIs.
- Assistwith model evaluation bias testing and accuracy monitoring for vision systems.
- Optimizeinference pipelines for performance scalability and cost efficiency (GPU usage batching modelselection).
- Collaborate with Product and Engineering teams to integrate vision capabilities into user-facing intelligence applications.
What you will bring:
- 3 years of experience in computer vision image processing or applied machine learning.
- Hands-on experience with computer vision models and techniques (e.g. CNNs transformers for vision feature embeddings).
- Experience building or integrating image analytics such as facial recognition object detection or image similarity.
- Strong programming skills inPython; experience with common CV/ML libraries (PyTorch TensorFlow OpenCV etc.).
- Solid understanding of machine learning fundamentals model evaluation and performance tradeoffs.
- Experience working with large image datasets andproductionML pipelines.
- Ability to work collaboratively in a fast-moving mission-driven engineering environment.
Preferred
- Experience withfacial matching or biometric systemsin regulated or high-stakesenvironments.
- Experience withimage-based geolocationor scene/location inference.
- Familiarity withmultimodal AI systems including combining vision models with LLMs or natural-language search.
Education:
- Bachelors degree in Computer Science Engineering Data Science ora relatedtechnical fieldrequired.
Advanced degree is a plus but notrequired.
We may use artificial intelligence (AI) tools to support parts of the hiring process such as reviewing applications analyzing resumes or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed please contact us.
Required Experience:
IC
hatch I.T. is partnering with Babel Street to find an Image & Computer Vision AI Engineer.Please see details below:About the RoleAs an Engineer on the Image & Computer Vision AI team you will play a hands-on role in developing and deploying computer vision capabilities that support Babel Streets int...
hatch I.T. is partnering with Babel Street to find an Image & Computer Vision AI Engineer.Please see details below:
About the Role
As an Engineer on the Image & Computer Vision AI team you will play a hands-on role in developing and deploying computer vision capabilities that support Babel Streets intelligence applications. You will build systems that extract analyze and reason over visual dataenabling facial matching object and scene understanding geolocation and location inference from imagery and multimodal intelligence workflows.
This role is execution-focused and suited for engineers with strong foundations in computer vision image processing and machine learning who want to apply their skills to real-world mission-driven problems. You will work closely with AI Product and Engineering teams to deliver reliable scalable and cost-efficient vision capabilities including integration with multimodal LLM systems that allow users to search and reason over images using natural language.
This is a hybrid role to be based out of either their Reston VA/Washington DC office or their Somerville MA office.
About the Company
Babel Street is the trusted technology partner for the worlds most advanced identity intelligence and risk operations. They deliver advanced AI and data analytics solutions providing unmatched analysis-ready data regardless of language proactive risk identification 360-degree insights high-speed automation and seamless integration into existing systems. Babel Street empowers government and commercial organizations to transform high-stakes identity and risk operations into a strategic advantage. The actionable insights we deliver safeguard lives and protect critical assets around the world.Babel Street is headquartered inReston Virginia withregionaloffices in BostonMAand Cleveland OH andinternationaloffices inAustralia Canada Israel Japan and the U.K.
Role Focus:
This role spans three practical execution areas:
Computer Vision & Image Analytics
You will implement andoperateimage analytics pipelines that support facial matching object detection scene understanding and image similarity. This includes image preprocessing feature extraction model inference evaluation and performance optimization to meet mission-grade accuracy and latency requirements.
Geospatial & Location Inference from Imagery
You will contribute to capabilities that infer location context or environmental attributes from imageryleveragingvisual cues metadata and learned representations. This includes supporting image-based geolocation landmark recognition and contextual scene analysis used in intelligence workflows.
Multi-Modal AI & Image Search
You will support multimodal AI systems that combine vision models with LLMs embeddings and retrieval pipelines to enable natural-language search and reasoning over images and image collections. You will help integrate visual understanding into broader intelligence applications and workflows.
What you will do:
- Build andmaintaincomputer vision pipelines for image ingestion preprocessing inference and evaluation.
- Implement facial matching and identity-related vision workflowsin accordance withaccuracy safety and compliance requirements.
- Develop and support object detection image similarity and scene understanding models.
- Contribute to image-based geolocation and location inference capabilities using visual features and contextual signals.
- Support multimodal AI workflows that combine image embeddings with LLM-based search and reasoning.
- Write clean maintainable Python code and contribute to production services and APIs.
- Assistwith model evaluation bias testing and accuracy monitoring for vision systems.
- Optimizeinference pipelines for performance scalability and cost efficiency (GPU usage batching modelselection).
- Collaborate with Product and Engineering teams to integrate vision capabilities into user-facing intelligence applications.
What you will bring:
- 3 years of experience in computer vision image processing or applied machine learning.
- Hands-on experience with computer vision models and techniques (e.g. CNNs transformers for vision feature embeddings).
- Experience building or integrating image analytics such as facial recognition object detection or image similarity.
- Strong programming skills inPython; experience with common CV/ML libraries (PyTorch TensorFlow OpenCV etc.).
- Solid understanding of machine learning fundamentals model evaluation and performance tradeoffs.
- Experience working with large image datasets andproductionML pipelines.
- Ability to work collaboratively in a fast-moving mission-driven engineering environment.
Preferred
- Experience withfacial matching or biometric systemsin regulated or high-stakesenvironments.
- Experience withimage-based geolocationor scene/location inference.
- Familiarity withmultimodal AI systems including combining vision models with LLMs or natural-language search.
Education:
- Bachelors degree in Computer Science Engineering Data Science ora relatedtechnical fieldrequired.
Advanced degree is a plus but notrequired.
We may use artificial intelligence (AI) tools to support parts of the hiring process such as reviewing applications analyzing resumes or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed please contact us.
Required Experience:
IC
View more
View less