This is a remote position.
Vertex is looking for Applied AI Engineers who can take large language models from concept to production and integrate them into real products.
We are building a curated pool of senior AI engineers for upcoming roles with partner/ client companies. Selection into the pool is based on experience technical depth and demonstrated production impact.
Responsibilities
- Design and implement LLM-powered features for real-world applications
- Build and maintain RAG pipelines prompt workflows and evaluation loops
- Integrate AI systems with APIs backend services and data stores
- Optimize models for latency cost and reliability in production
- Collaborate with product and engineering teams to ship AI features
Requirements
Requirements
- Strong Python experience and backend engineering fundamentals
- Hands-on experience deploying LLMs or GenAI systems in production
- Experience with embeddings vector databases and retrieval systems
- Ability to reason about trade-offs between models cost and performance
- Clear evidence of shipped AI features used by real users
Benefits
For this role its selection into the Vertex Talent pool based on experience technical depth and demonstrated production impact.
- An energised upbeat environment that strongly fosters employee work-life balance.
- A work culture that rewards goal-oriented professionals who enjoy meeting challenges head-on.
- Amazing personal growth experience
- Working with a motivated and talented team.
- More importantly an opportunity to meaningfully contribute to bringing cutting-edge Tech solutions to life
Required Skills:
Production RAG Architecture (Chunking Retrieval & Re-ranking) Advanced Python & Backend Engineering (FastAPI Asyncio) Vector Database Management (Pinecone Weaviate Milvus) LLM Evaluation Frameworks (Eval Loops & Accuracy Metrics) Model Optimization (Latency Cost & Reliability) Orchestration Frameworks (LangChain LlamaIndex or DSPy) System Integration (REST APIs Data Stores & Microservices) Prompt Engineering & Workflow Design (Structured Outputs/JSON) AI Observability & Monitoring (Tracing & Debugging) Model Selection & Trade-off Analysis
Required Education:
BSc.
This is a remote position. Vertex is looking for Applied AI Engineers who can take large language models from concept to production and integrate them into real products. We are building a curated pool of senior AI engineers for upcoming roles with partner/ client companies. Selection into the p...
This is a remote position.
Vertex is looking for Applied AI Engineers who can take large language models from concept to production and integrate them into real products.
We are building a curated pool of senior AI engineers for upcoming roles with partner/ client companies. Selection into the pool is based on experience technical depth and demonstrated production impact.
Responsibilities
- Design and implement LLM-powered features for real-world applications
- Build and maintain RAG pipelines prompt workflows and evaluation loops
- Integrate AI systems with APIs backend services and data stores
- Optimize models for latency cost and reliability in production
- Collaborate with product and engineering teams to ship AI features
Requirements
Requirements
- Strong Python experience and backend engineering fundamentals
- Hands-on experience deploying LLMs or GenAI systems in production
- Experience with embeddings vector databases and retrieval systems
- Ability to reason about trade-offs between models cost and performance
- Clear evidence of shipped AI features used by real users
Benefits
For this role its selection into the Vertex Talent pool based on experience technical depth and demonstrated production impact.
- An energised upbeat environment that strongly fosters employee work-life balance.
- A work culture that rewards goal-oriented professionals who enjoy meeting challenges head-on.
- Amazing personal growth experience
- Working with a motivated and talented team.
- More importantly an opportunity to meaningfully contribute to bringing cutting-edge Tech solutions to life
Required Skills:
Production RAG Architecture (Chunking Retrieval & Re-ranking) Advanced Python & Backend Engineering (FastAPI Asyncio) Vector Database Management (Pinecone Weaviate Milvus) LLM Evaluation Frameworks (Eval Loops & Accuracy Metrics) Model Optimization (Latency Cost & Reliability) Orchestration Frameworks (LangChain LlamaIndex or DSPy) System Integration (REST APIs Data Stores & Microservices) Prompt Engineering & Workflow Design (Structured Outputs/JSON) AI Observability & Monitoring (Tracing & Debugging) Model Selection & Trade-off Analysis
Required Education:
BSc.
View more
View less