Senior Inference Engineer AI I
Job Summary
Thomson Reuters is seeking a Senior Inference Engineer AI. This person will collaborate with platform teams to enhance capacity forecasting for AI workloads and work with Product Data Science Architecture and Enterprise AI teams to onboard new research models into production.
About the Role
As a Senior Inference Engineer AI responsibilities include/you will:
- Within Platform Engineering and Enterprise AI Services an AI Inference Engineer is responsible for productionizing optimizing and scaling AI and LLM workloads that power TRs AI driven products.
- This role ensures that our trained modelsfrom classical ML to generative AIrun efficiently across TRs multi cloud footprint (AWS Azure GCP OCI) meet strict enterprise reliability requirements and integrate seamlessly with our data backbone (Snowflake OpenSearch vector search API managed model routing).
- The successful candidate will help build the next generation of TRs AI infrastructure working alongside cloud engineering data engineering product teams and AI Services.
- Optimize LLMs and ML models for high-performance inference using techniques such as quantization pruning distillation and hardware specific tuning
- Deploy and scale inference workloads on GPUs across AWS Azure GCP and internal Kubernetes clusters ensuring predictable performance during peak traffic hours especially during business hours
- Implement routing and failover strategies for OpenAI/Anthropic/Vertex AI traffic
- Integrate models into production grade APIs supporting TR products and enterprise workflows.
- Develop highly optimized environment and eliminate performance bottlenecks to reduce latency.
- Collaborate with Platform Engineering teams (Landing Zones Network Storage Compute AI) to ensure inference workloads align with TRs cloud native patterns (AWS Azure GCP OCI)
- Build and optimize containerized inference pipelines using Kubernetes for large-scale distributed workloads
- Ensure compliance with TRs AI standards for deployment monitoring governance and drift detection
- Profile inference performance identify GPU/CPU bottlenecks and optimize compute utilization across heterogeneous hardware
- Implement observability and health monitoring for inference pipelines ensuring reliability of enterprise AI services
- Collaborates closely with AI engineers to invent new quantization techniques improve numerical precision and explore nonstandard architectures and support the scale out of AI infrastructure during critical releases and global product rollouts
- Partner with Cloud Engineers (Azure AWS GCP) to develop guardrails and automation that support inference workloads
About You
You are a potential fit for the role Senior Inference Engineer AI if your background includes:
- 5 years of relevant experience
- Strong understanding of ML/LLM fundamentals and inference optimization techniques.
- Hands-on experience with GPU programming (CUDA preferred) inference runtimes (TensorRT ONNX
- Runtime) and deep learning frameworks (PyTorch/TensorFlow)
- Proficiency in Python and at least one systems language (C strongly preferred for performance
- critical inference paths)
- Experience deploying AI workloads to AWS/GCP/Azure and Kubernetes
- Familiarity with vector search systems (OpenSearch vectors) and retrieval augmented generation pipelines
- Knowledge of distributed systems microservices CI/CD and cloud native architecture
#LI-MW1
Whats in it For You
Hybrid Work Model: Weve adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected.
Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities whether caring for family giving back to the community or finding time to refresh and reset. This builds upon our flexible work arrangements including work from anywhere for up to 8 weeks per year empowering employees to achieve a better work-life balance.
Career Development and Growth: By fostering a culture of continuous learning and skill development we prepare our talent to tackle tomorrows challenges and deliver real-world solutions. Our Grow My Way programming and skills-first approach ensures you have the tools and knowledge to grow lead and thrive in an AI-enabled future.
Industry Competitive Benefits: We offer comprehensive benefit plans to include flexible vacation two company-wide Mental Health Days off access to the Headspace app retirement savings tuition reimbursement employee incentive programs and resources for mental physical and financial wellbeing.
Culture: Globally recognized award-winning reputation for inclusion and belonging flexibility work-life balance and more. We live by our values: Obsess over our Customers Compete to Win Challenge (Y)our Thinking Act Fast / Learn Fast and Stronger Together.
Social Impact: Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental Social and Governance (ESG) initiatives.
Making a Real-World Impact:We are one of the few companies globally that helps its customers pursue justice truth and transparency. Together with the professionals and institutions we serve we help uphold the rule of law turn the wheels of commerce catch bad actors report the facts and provide trusted unbiased information to people all over the world.
Our use of AI within the recruitment process Thomson Reuters utilizes Artificial Intelligence (AI) to support parts of our global recruitment process. Unless you opt-out our AI system will assess the information provided by you and compare it to the requirements listed for the role and present the result to our recruitment personnel for further review. The AI system acts as a supporting tool but there is always a human making the decision if you will be considered for the role
Thomson Reuters complies with local laws that require upfront disclosure of the expected pay range for a position. The base compensation range varies across locations. For Ontario Canada the base compensation range for this role is $100000 CAD - $145000 CAD. Base pay is positioned within the range based on several factors including an individuals knowledge skills and experience with consideration given to internal equity. Base pay is one part of a comprehensive Total Reward program which also includes flexible and supportive benefits and other wellbeing programs. This role may also be eligible for an Annual Bonus based on a combination of enterprise and individual performance.About Us
Thomson Reuters informs the way forward by bringing together the trusted content and technology that people and organizations need to make the right decisions. We serve professionals across legal tax accounting compliance government and media. Our products combine highly specialized software and insights to empower professionals with the data intelligence and solutions needed to make informed decisions and to help institutions in their pursuit of justice truth and transparency. Reuters part of Thomson Reuters is a world leading provider of trusted journalism and news.
We are powered by the talents of 26000 employees across more than 70 countries where everyone has a chance to contribute and grow professionally in flexible work environments. At a time when objectivity accuracy fairness and transparency are under attack we consider it our duty to pursue them. Sound exciting Join us and help shape the industries that move society forward.
As a global business we rely on the unique backgrounds perspectives and experiences of all employees to deliver on our business goals. To ensure we can do that we seek talented qualified employees in all our operations around the world regardless of race color sex/gender including pregnancy gender identity and expression national origin religion sexual orientation disability age marital status citizen status veteran status or any other protected classification under applicable law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace.
We also make reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs in accordance with applicable law. More information on requesting an accommodation here.
Learn more on how to protect yourself from fraudulent job postings here.
Required Experience:
Senior IC
About Company
Thomson Reuters CoCounsel is AI technology built by industry experts, backed by authoritative content and equipped with best-in-class security.