AI Systems Architect

Cloudious LLC

Job Location:

San Francisco, CA - USA

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

Job Title: AI Systems Architect

Location: Dallas Charlotte San Francisco Bay Area

Type of Hire C2C

Job Summary

We are seeking an experienced AI Systems Architect to design build and scale high-performance distributed AI systems. The ideal candidate will have deep expertise in GenAI LLMs and cloud-native architectures along with hands-on experience in building enterprise-scale AI/ML platforms and agent-based systems.

Must-Have Skills

Strong experience in designing and implementing high-performance large-scale distributed systems
Proven experience in implementing and deploying AI/ML platforms at scale
Expertise in building agent-based architectures evaluation frameworks and prompt/context engineering
Knowledge of MCP (Model Context Protocol) servers
Hands-on experience in LLM inference optimization including batching and caching strategies
Strong experience with Kubernetes and cloud infrastructure (AWS/Azure/GCP)
Proficiency in at least one programming language (Python Java Go etc.)
Expertise in designing agent data stacks & retrieval systems including:
Vector databases
Hybrid search
Data freshness strategies
Memory systems
Graph reasoning
BM25 and advanced retrieval techniques

Key Responsibilities

Architect and deliver scalable high-performance distributed systems
Design and deploy AI/ML and GenAI platforms at enterprise scale
Build and manage agent-based architectures including:
Prompt and context engineering
MCP servers
Evaluation frameworks
Optimize LLM inference pipelines for latency throughput and efficiency
Design and implement agent data & retrieval systems (vector DBs hybrid search memory graph-based reasoning)
Lead Kubernetes-based cloud-native deployments
Provide technical leadership architecture governance and hands-on mentoring to engineering teams

Nice to Have

Experience with RAG (Retrieval-Augmented Generation) frameworks
Familiarity with multi-agent systems and orchestration frameworks
Exposure to real-time data pipelines and streaming architectures

Job Title: AI Systems Architect Location: Dallas Charlotte San Francisco Bay Area Type of Hire C2C Job Summary We are seeking an experienced AI Systems Architect to design build and scale high-performance distributed AI systems. The ideal candidate will have deep expertise in GenAI LLMs ...