Staff AI Engineer: Foundational AI Services
We are looking for a visionary Staff AI Engineer to architect and scale our core AI intelligence this role you wont just be building features; you will be building the AI Operating System for our companythe foundational services that empower every product team to deploy production-grade GenAI.
The Mission
As a Staff Engineer you will own the end-to-end lifecycle of our internal AI infrastructure moving beyond simple wrappers to create high-performance resilient and autonomous systems.
Key Responsibilities
- LLM Fine-Tuning & Distillation: Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost latency and reasoning capability.
- Architect Foundational RAG: Build next-gen Retrieval-Augmented Generation pipelines using hybrid search cross-encoders and self-correcting retrieval loops.
- Agentic Orchestration: Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI enabling autonomous task planning and tool-use (Function Calling).
- Enterprise-Grade Evaluation: Build LLM-as-a-judge frameworks and robust eval pipelines to measure hallucination rates groundedness and safety.
- Inference Optimization: Implement high-throughput low-latency serving strategies including quantization speculative decoding and prompt caching.
Why Join Us
You will be the technical lead for a mission-critical team setting the standard for how AI is built and scaled. This is a high-impact role where your architecture will directly influence the intelligence of our entire ecosystem.
Qualifications :
To be successful in this role you have:
- Typically requires 8 years of overall software engineering experience.
- Core AI: Expert-level mastery of Transformers attention mechanisms and the latest frontier models (GPT-4o Claude 3.5 Llama 3).
- The Stack: Deep experience with vector databases (Pinecone Weaviate Milvus) orchestration layers (LangChain LlamaIndex) and MLOps tools.
- Software Craft: You are a Staff-level coder in Python/Rust who understands distributed systems concurrency and API design.
- Modern Buzz: You live and breathe Chain-of-Thought (CoT) DSPy GraphRAG and Semantic Caching.
Additional Information :
Work Personas
We approach our distributed world of work with flexibility and trust. Work personas (flexible remote or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work and their assigned work location. Learn more here. To determine eligibility for a work persona ServiceNow may confirm the distance between your primary residence and the closest ServiceNow office using a third-party service.
Equal Opportunity Employer
ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race color creed religion sex sexual orientation national origin or nationality ancestry age disability gender identity or expression marital status veteran status or any other category protected by addition all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements.
Accommodations
We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process or are unable to use this online application and need an alternative method to apply please contact for assistance.
Export Control Regulations
For positions requiring access to controlled technology subject to export control regulations including the U.S. Export Administration Regulations (EAR) ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities.
From Fortune. 2025 Fortune Media IP Limited. All rights reserved. Used under license.
Remote Work :
No
Employment Type :
Full-time
Staff AI Engineer: Foundational AI ServicesWe are looking for a visionary Staff AI Engineer to architect and scale our core AI intelligence this role you wont just be building features; you will be building the AI Operating System for our companythe foundational services that empower every product ...
Staff AI Engineer: Foundational AI Services
We are looking for a visionary Staff AI Engineer to architect and scale our core AI intelligence this role you wont just be building features; you will be building the AI Operating System for our companythe foundational services that empower every product team to deploy production-grade GenAI.
The Mission
As a Staff Engineer you will own the end-to-end lifecycle of our internal AI infrastructure moving beyond simple wrappers to create high-performance resilient and autonomous systems.
Key Responsibilities
- LLM Fine-Tuning & Distillation: Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost latency and reasoning capability.
- Architect Foundational RAG: Build next-gen Retrieval-Augmented Generation pipelines using hybrid search cross-encoders and self-correcting retrieval loops.
- Agentic Orchestration: Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI enabling autonomous task planning and tool-use (Function Calling).
- Enterprise-Grade Evaluation: Build LLM-as-a-judge frameworks and robust eval pipelines to measure hallucination rates groundedness and safety.
- Inference Optimization: Implement high-throughput low-latency serving strategies including quantization speculative decoding and prompt caching.
Why Join Us
You will be the technical lead for a mission-critical team setting the standard for how AI is built and scaled. This is a high-impact role where your architecture will directly influence the intelligence of our entire ecosystem.
Qualifications :
To be successful in this role you have:
- Typically requires 8 years of overall software engineering experience.
- Core AI: Expert-level mastery of Transformers attention mechanisms and the latest frontier models (GPT-4o Claude 3.5 Llama 3).
- The Stack: Deep experience with vector databases (Pinecone Weaviate Milvus) orchestration layers (LangChain LlamaIndex) and MLOps tools.
- Software Craft: You are a Staff-level coder in Python/Rust who understands distributed systems concurrency and API design.
- Modern Buzz: You live and breathe Chain-of-Thought (CoT) DSPy GraphRAG and Semantic Caching.
Additional Information :
Work Personas
We approach our distributed world of work with flexibility and trust. Work personas (flexible remote or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work and their assigned work location. Learn more here. To determine eligibility for a work persona ServiceNow may confirm the distance between your primary residence and the closest ServiceNow office using a third-party service.
Equal Opportunity Employer
ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race color creed religion sex sexual orientation national origin or nationality ancestry age disability gender identity or expression marital status veteran status or any other category protected by addition all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements.
Accommodations
We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process or are unable to use this online application and need an alternative method to apply please contact for assistance.
Export Control Regulations
For positions requiring access to controlled technology subject to export control regulations including the U.S. Export Administration Regulations (EAR) ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities.
From Fortune. 2025 Fortune Media IP Limited. All rights reserved. Used under license.
Remote Work :
No
Employment Type :
Full-time
View more
View less