Software Development Engineer- Product Reliability Engineering

Visa

Not Interested
Bookmark
Report This Job

profile Job Location:

Austin, TX - USA

profile Monthly Salary: Not Disclosed
Posted on: 2 days ago
Vacancies: 1 Vacancy

Job Summary

Every time someone taps swipes or clicks to pay- Visa infrastructure makes it happen in milliseconds across 200 countries. As a Software Development Engineer on the Product Reliability Engineering (PRE) team you wont just watch those systems run- youll be one of the engineers building automating and evolving them.

PRE is not a traditional ops team. We are a software engineering organization that treats infrastructure as code reliability as a product and automation as a strategic advantage. Youll write Python build agentic AI tools manage data platforms and contribute to the distributed systems that process billions of real-time transactions. From day one you are an engineer- and from day one your work matters.

If you are endlessly curious about how large-scale systems stay resilient obsess over elegant automation and want to launch your career at the intersection of AI infrastructure and global financial technology this role was built for you.
 

Build Automation That Scales

Design and ship end-to-end automation for deployment pipelines infrastructure provisioning and release orchestration code that runs millions of times so engineers never have to repeat themselves.

Write clean production-grade Python (and Go or Bash where it counts) to eliminate toil reduce manual intervention and make systems self-managing.

Develop modular frameworks for release scheduling validation rollback and reporting that integrate across the full software delivery lifecycle.

Manage & Evolve Data Platforms

Support the build deployment and operations of relational database systems contributing to schema design architecture decisions and solution engineering for critical payment data infrastructure.

Gain exposure to real-time event streaming architectures that support payment processing at scale

Perform database health operations including patching upgrades backups and recovery to maintain the availability and integrity of tier-1 production databases.

Optimize query performance through index tuning execution plan analysis and replication monitoring targeting metrics like query execution time CPU usage and replication latency.

Automate database tasks and configuration management using tools like Ansible and Liquibase and contribute to CI/CD pipelines that govern schema changes through TEST and PROD environments safely.

Build predictive and reactive monitoring dashboards for database anomalies surfacing health signals before they become incidents.

Ship Agentic AI & ML-Powered Tools

Build GenAI-powered engineering assistants that automate deployment orchestration release governance and environment lifecycle management.

Integrate LLMs into observability incident response and developer support workflows transforming reactive operations into proactive AI-driven intelligence.

Contribute to prompt engineering model fine-tuning and agentic automation initiatives that position PRE as one of the most AI-forward reliability organizations in financial technology.

Own Observability & Platform Health

Build dashboards alerts and metrics using Prometheus Grafana Splunk or ELK that give engineers real-time clarity on complex globally distributed systems.

Analyze system performance and availability data and turn insights into infrastructure improvements that prevent incidents before they occur.

Contribute to self-healing and auto-scaling capabilities that keep critical payment infrastructure resilient without human intervention.

Engineer for Reliability & Security

Ensure infrastructure and data platforms meet security and compliance standards across cloud-native deployments supporting global financial services at scale.

Support zero-downtime deployment strategies and high-availability architectures that Visas partners and billions of cardholders depend on around the clock.

Participate in threat modeling vulnerability remediation and audit readiness activities as part of a team that treats security as a first-class engineering concern.

Collaborate Learn & Grow Fast

Embed within Agile squads working alongside senior engineers product managers and global PRE peers across sprint planning reviews and release discussions.

Document runbooks SOPs and engineering guides that make the team smarter faster and more autonomous over time.

Participate in on-call rotations (with robust support structures and mentorship) to build the incident response instincts that distinguish great reliability engineers.


Qualifications :

Basic Qualifications:

  • Bachelors degree OR 3 years of relevant work experience

Preferred Qualifications: 

Bachelors degree in Computer Science Software Engineering or a related technical field ( graduates preferred; December 2025 graduates welcome).

Solid foundations in data structures algorithms and systems design -you can reason about complexity tradeoffs and failure modes.

Proficiency in Python and comfort writing scripts or tools in at least one additional language (Go Java or Bash).

Foundational understanding of relational databases (RDBMS): SQL data modeling query optimization and database connectivity troubleshooting.

Familiarity with Linux/Unix environments and meaningful command-line fluency.

Exposure to cloud platforms (AWS GCP or Azure) and a conceptual understanding of containerization (Docker Kubernetes).

Understanding of CI/CD principles and how modern software delivery pipelines are structured and maintained.

Genuine curiosity about GenAI platforms and agentic systems (OpenAI Anthropic Claude LangChain or similar)- hands-on exposure is a plus intellectual interest is a must.
 

Bonus Points:

Hands-on experience with infrastructure-as-code tools: Terraform Ansible or Pulumi -even from coursework a capstone or an internship.

Experience with database CI/CD tooling particularly Liquibase for schema change management across environments.

Experience with observability tooling: Prometheus Grafana Splunk ELK or Datadog.

Database administration exposure: backup/recovery procedures performance tuning index management or replication monitoring.

Familiarity with Git workflows and modern DevOps toolchains (Jenkins GitHub Actions ArgoCD).

Academic or project experience with ML frameworks: scikit-learn PyTorch or LangChain / LangGraph.

Understanding of networking fundamentals: DNS load balancing service mesh or TCP/IP.

A GitHub profile personal project hackathon entry or open-source contribution that shows us how you think and build.


Additional Information :

Work Hours: Varies upon the needs of the department.

Travel Requirements: This position requires travel 5-10% of the time.

Mental/Physical Requirements: This position will be performed in an office setting.  The position will require the incumbent to sit and stand at a desk communicate in person and by telephone frequently operate standard office equipment such as telephones and computers.

Visa is an EEO Employer.  Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status.  Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.

Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law including the requirements of Article 49 of the San Francisco Police Code.

U.S. APPLICANTS ONLY: The estimated salary range for a new hire into this position is 88000 to 136900.00 USD per year which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge skills experience and addition this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical Dental Vision 401 (k) FSA/HSA Life Insurance Paid Time Off and Wellness Program.


Remote Work :

No


Employment Type :

Full-time

Every time someone taps swipes or clicks to pay- Visa infrastructure makes it happen in milliseconds across 200 countries. As a Software Development Engineer on the Product Reliability Engineering (PRE) team you wont just watch those systems run- youll be one of the engineers building automating and...
View more view more

About Company

Company Logo

Visa (NYSE: V) is a world leader in digital payments, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories. Our purpose is to uplift everyone, everywhere by being the best way to pay and b ... View more

View Profile View Profile