Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
Location: hybrid from Milton Keynes (min 3 days/month in office)
Team: Engineers across UK US India and Philippines
Reports to: CTO (US)
Works closely with: Head of Engineering (India)
Were a scrappy well-funded (4.5 million seed closed) AI startup turning raw customer feedback into real-time insight for businesses that care about CX. Our 2025 roadmap is ambitious: break apart our monolith into microservices double our AI-driven workflows and harden infrastructure for 100 traffic. Everyone ships. Everyone is on-call. Bureaucracy is nil. Velocity is high.
Define service boundaries and lead the transition to a microservices architecture. Implement REST SQS communication between services containerized via ECS Fargate. Youll design services that scale not snowball.
Build features using OpenAI APIs today and pave the path for tomorrow: think private model deployment vector DBs prompt orchestration frameworks and usage monitoring. Youll lead the evolution toward multi-model support caching layers and Bedrock/RAG-native infrastructure.
Design training/inference workflows. Spin up model-serving infra on AWS (Bedrock SageMaker or container-based). Help make our AI systems observable secure and cost-efficient. Youll apply DevOps instincts to support LLM-powered production systems at scale.
Were not wrapping GPT were building composable infrastructure for experimentation scale and optional self-hosting. Youll help define boundaries between orchestration and inference expose tracing and prompt history and make systems that our team can iterate on without chaos.
Define and enforce robust testing strategies: unit integration and load. Youll design systems that are testable by default with clear mocks interface contracts and fast CI.
Youll own delivery for complex features breaking them down into sensible milestones identifying hidden risks and clearly communicating tradeoffs. We dont over-spec; we trust senior engineers to lead the build and help shape the spec.
Design systems with first-class telemetry: structured logs metrics traces and alerting. Youll help us make LLM behavior debuggable and traceable from token usage to prompt mutation.
We handle sensitive customer data. Youll make security a first-class concern in system designPII handling IAM design secrets management rate limiting and GDPR-readiness.
Youll work primarily in soon!) using MongoDB (Mongoose) Redis and job schedulers (cron/EventBridge).
Help keep our infrastructure cloud-agnostic. Were AWS-first today but use modular Terraform so we can pivot to GCP for customer workloads if needed.
Lead code reviews pair with engineers and mentor the team. Youll help reinforce best practices without slowing things down. Youll also know when to lean on AI tools (and when not to).
Full Time