At Visa we are passionate about making a difference. We lead the way in disrupting fraud from multiple vectors. In this role you will be joining an exciting innovative business new to the Visa family.
At Featurespace we strive to be the worlds best software company at protecting our clients and their customers from fraud attacks. We do that with personality heart and professionalism cultivating an innovative fun and positive team atmosphere where everybody can contribute to solving our clients problems in new innovative ways. We are always seeking to be the best at what we do and make our customers smile.
The Opportunity
In your role as Senior Site Reliability Engineer you will help us achieve our goals and deliver success on behalf of our customers by operating Featurespaces world leading product ARIC Risk Hub as a robust cloud-based SaaS solution. Continuously improving our SaaS offerings features and robustness and building new services and integrations from scratch.
At Featurespace you will participate in designing developing deploying monitoring supporting documenting and troubleshooting our SaaS solution. You will endeavor to make our SaaS solution extremely robust scalable measurable repeatable and cost-effective. You will work with cloud networking storage compute security containerisation orchestration disaster recovery the ARIC Risk Hub application and many other areas as needed. You will collaborate closely with the Cloud Operations team the wider organisation external vendors and customers.
We are looking for an SRE that also has the engineering ability to build/deploy/support a service around a Windows-hosted application.
Responsibilities
- Design and operate production infrastructure for a Windows/IIS-based applications.
- Build and maintain deployment pipelines and configuration management for Windows workloads
- Create tooling and automation around the deployment of a customer-specific Windows-based SaaS product
- Ensure high availability reliability and scalability of Windows services.
- Integrate observability tooling (metrics logs traces) into IIS-hosted services
- Harden Windows infrastructure for security compliance and operational best practices
- Lead incident response for Windows-related systems
- Contribute to internal documentation and deployment guides
- Deploying maintaining monitoring and upgrading production deployments of ARIC Risk Hub SaaS and third-party integrated services
- Building software and systems to manage platform infrastructure and applications
- Continually evaluating and improving our technology and processes to increase quality decrease costs and improve time-to-market
- Periodically testing the service with predictable and unpredictable failures
- Providing 2nd-line operational support for our SaaS customers
- Gathering data and generating reports on the service performance
- Developing and documenting internal processes
- Working with engineering/data science to drive and develop new and improved ARIC Risk Hub capabilities
This is a hybrid position. There is an expectation of 3 days in the office per week.
Qualifications :
Required experience:
- Strong experience running Windows Server in production
- Bachelors Masters or higher qualification in Computer Science or a related field
- In-depth knowledge of IIS PowerShell and Windows internals
- Proven ability to build infrastructure-as-code and CI/CD for Windows environments
- Comfort wrapping a Windows software product with the surrounding infrastructure services automation and observability required to run it as a SaaS offering.
- Hands-on experience administering cloud infrastructure or building cloud-native applications (preferably on AWS)
- Comfortable using AWS EC2
- Proficiency with command-line tools and shell scripting
- Experience with infrastructure as code and configuration management
- Proficiency in one or more programming languages (e.g. Python)
- Solid understanding of networking fundamentals (DNS routing firewalls)
- Experience with version control tools such as Git
- Familiarity with CI/CD pipelines and tools
- Proficient in setting up and managing monitoring metrics and alerting systems
- Experience operating production-grade services at scale
Great to have:
- Experience with tools such as: Terraform SaltStack MongoDB Elasticsearch Kafka Prometheus Grafana or HashiCorp Vault
- Experience with securing applications services and data including authentication authorization TLS and encryption
- Exposure to Kubernetes (administering deploying or developing apps on K8s clusters)
- Understanding of compliance and system hardening in regulated environments (e.g. HIPAA PCI-DSS SOC 2)
Additional Information :
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
Remote Work :
No
Employment Type :
Full-time