At Pythian we are experts in strategic database and analytics services driving digital transformation and operational excellence. Pythian a multinational company was founded in 1997 and started by ensuring the reliability and performance of missioncritical databases. We quickly earned a reputation for solving tough data challenges. We were there when the industry moved from onpremises to cloud environments and as enterprises sought more from their data we expanded our competencies to include advanced analytics.
Today we empower organizations to embrace transformation and leverage advanced technologies including AI to stay competitive. We deliver innovative solutions that meet each clients data goals and have built strong partnerships with Google Cloud AWS Microsoft Oracle SAP and Snowflake. The powerful combination of our extensive expertise in data and cloud and our ability to keep on top of the latest bleeding edge technologies make us the perfect partner to help mid and largesized businesses transform to stay ahead in todays rapidly changing digital economy.
Why you
As a Site Reliability Consultant you will serve as both a technology leader and trusted advisor to our customers while mentoring teammates in cuttingedge tools and approaches. Your projects will focus on infrastructure design and modernization automation of CI/CD pipelines and building out intelligent monitoring and observability systemsspanning Linux Cloud container orchestration and other opensource technologies. Youll become our resident expert for Gitbased source code management artifact repository solutions and Kubernetes in both cloud (e.g. AWS EKS) and onprem environments.
If this is you and you wonder what it would be like to work at Pythian reach out to us and find out!
Intrigued to see what a life is like at Pythian Check out #pythianlife on LinkedIn and follow @loveyourdata on Instagram!
Not the right job for you Check out what other great jobs Pythian has open around the world! Pythian Careers
What you will you be doing:
Operate & Maintain
Administer and optimize platforms such as GitLab (CI/CD pipelines runners) and artifact repository solutions (e.g. JFrog Artifactory).
Maintain and troubleshoot Kubernetes clusterseither in the cloud (AWS EKS) or onprem distributionswith a focus on availability performance and security.
Automation & CI/CD
Champion infrastructure as code using tools like Terraform (or CloudFormation) building repeatable processes for provisioning and updating clusters repos and associated services.
Implement or improve CI/CD pipelines to reduce manual toil and ensure quick reliable deployments across multiple environments.
Monitoring & Incident Response
Design and configure observability solutions (e.g. Prometheus Dynatrace Grafana) to proactively detect and address issues in container orchestration environments code repositories and artifact repositories.
Participate in an oncall rotation troubleshooting incidents at all tiers (from firstcontact resolution to escalation) and driving continuous improvement based on Root Cause Analysis.
Architectural Guidance & Roadmaps
Collaborate with clients to shape infrastructure strategies around container orchestration secure CI/CD and DevSecOps best practices.
Provide leadership and technical direction on automating repetitive administrative tasks enforcing security policies (RBAC TLS container scanning) and adopting GitOps workflows.
Documentation & Mentorship
Create and maintain design documents runbooks and operational playbooks for container platforms CI/CD pipelines and code management services.
Mentor fellow consultants and client stakeholders on Kubernetes infrastructure automation and advanced CI/CD usage to enhance knowledge across the organization.
Process Management
Plan and coordinate maintenance activities ensuring minimal downtime and clear communication with stakeholders.
Provide ITILoriented support (Incident Change Problem Management) and champion continuous improvement of operational processes and service reliability.
What we need from you:
Kubernetes & Containerization
Must have strong experience with container orchestration (Kubernetes Docker) in cloud (AWS EKS) or onprem distributions.
Familiarity with related ecosystem tools (Helm Operators GitOps etc..
AWS & Cloud Expertise
Handson experience using AWS (VPC EC2 EKS IAM S3 etc. including provisioning with IaC tools like Terraform (or AWS CloudFormation).
AWS certifications (Solutions Architect DevOps Engineer) are a plus.
CI/CD & Source Code Management
Experience setting up GitLab or similar platforms (GitHub Bitbucket) for CI/CD pipelines managing runners and integrating code scanning.
Familiarity with artifact repository solutions (e.g. JFrog Artifactory) including repository creation access controls and automation of artifact flows.
DevOps & Automation
Track record of infrastructure automation using Terraform Ansible Puppet or Chef to reduce manual intervention and ensure repeatable deployments.
Strong scripting skills (Bash Python Go etc. to automate system tasks and streamline operational workflows.
Monitoring & Observability
Experience with modern monitoring stacks (Prometheus Dynatrace Grafana ELK/EFK) for analyzing logs metrics and traces.
Proven ability to design alerts dashboards and runbooks that enable rapid firstcontact resolution.
Linux & Networking
Solid understanding of Linuxbased systems performance tuning and troubleshooting.
Network fundamentals (TCP/IP load balancers DNS NTP etc. and ability to diagnose connectivity or performance issues in complex distributed environments.
Security & Compliance
Familiarity with container security best practices (RBAC TLS vulnerability scanning) and how to apply them at scale.
Understanding of compliance frameworks (HIPAA PCI etc. and data privacy constraints a plus.
Soft Skills & Collaboration
Adept at communicating technical concepts to both engineering and nontechnical stakeholders.
Ability to mentor junior team members champion DevOps culture and contribute to an inclusive knowledgesharing environment.
Education & Experience
Bachelors Degree in Computer Science Information Systems or equivalent experience.
Several years of progressive DevOps or SRE experience managing largescale systems in a production environment.
AI/Automation Tooling
Experience or strong interest in leveraging AIbased services or scripts for operational efficiency and faster issue resolution is highly desirable.
Love your work/life balance: Flexibly work remotely from your home theres no daily travel requirement to an office! All you need is a stable internet connection.
Love your coworkers: Collaborate with some of the best and brightest in the industry!
Love your development: Hone your skills or learn new ones with our substantial training allowance; participate in professional development days attend training become certified whatever you like!
Love your workspace: We give you all the equipment you need to work from home including a laptop with your choice of OS and an annual budget to personalize your work environment!
Love yourself: Pythian cares about the health and wellbeing of our team. You will have an annual wellness budget to make yourself a priority (use it on gym memberships massages fitness and more). Additionally you will receive a generous amount of paid vacation and sick days as well as a day off to volunteer for your favorite charity.
Disclaimer
The successful applicant will need to fulfill the requirements necessary to obtain a background check.
Accommodations are available upon request for candidates taking part in any aspect of the selection process.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.