Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailCrossteam teamwork build and maintain relationships with the customer teams the user community architects and engineering teams jointly work on key deliverables ensuring production scalability and stability
Effective Root cause analysis of major production incidents and developing learning documentation .
Plan and perform capacity expansion and upgrades in timely manner avoiding any scaling issues and bugs.
Automation of repetitive tasks to reduce manual effort and avoid Human errors.
Tune alerting and setup observability to proactively identify the issues and performance problems.
Work closely with L3 teams in reviewing new use cases cluster hardening techniques for building a robust and reliable platforms.
leverage Devops tools disciplines( Incident problem and change management) and standards in day to operations.
Core Skills (Some combination of:)
10 years of experience with modern middleware technologies. These might include (Tomcat Apache Springboot SQS JBoss IBM MQ IBM DataPower Hazelcast Flink Connect Direct SSL)
Understanding of Linux/Unix systems networking cloud platforms (AWS Azure GCP) containerization (Kubernetes Docker) and infrastructureascode tools (Terraform Ansible).
Proficiency with monitoring tools (Prometheus Grafana Datadog etc.) logging systems (ELK stack Splunk) and tracing tools (Jaeger Zipkin).
Proven track record of automating complex tasks and processes to improve efficiency and reliability using Python Go Java or similar.
Technical Areas Youll Grow In:
Cloud & System Architecture: Design scalable resilient systems across hybrid cloud platforms (AWS GCP Azure)
AI/ML Operations: Support and optimize ML model deployment pipelines and monitoring systems
Observability & Performance: Master advanced monitoring tracing and performance optimization techniques
Automation & Intelligence: Build smart alerting systems and automated remediation workflows
Distributed Systems: Design and maintain globally distributed payment processing systems
What Makes You Thrive:
Youre energized by solving complex problems
You believe in automation over manual processes
You enjoy mentoring others and sharing knowledge
Youre comfortable with ambiguity and rapid change
You value building reliable systems over quick fixes
This is a hybrid position. Expectation of days in office will be confirmed by your Hiring Manager.
Qualifications :
Basic Qualifications
8 years of relevant work experience with a Bachelors Degree or at least 5 years of experience with an Advanced Degree (e.g. Masters MBA JD MD) or 2 years of work experience with a PhD OR 11 years of relevant work experience.
Preferred Qualifications
9 or more years of relevant work experience with a Bachelor Degree or 7 or more relevant years of experience with an Advanced Degree (e.g. Masters MBA JD MD) or 3 or more years of experience with a PhD
Hands on experience working as a Payment System SRE in managing cross platforms.
Excellent Python or Java programming skills for automation requirement for repetitive devops tasks
Person will be responsible to perform SRE and Engineering activities Payment platforms
Understanding of Linux networking CPU memory and storage and resilient system design.
Knowledge on Java and Python is must.
Excellent interpersonal verbal and written communication skills are mandatory for this role.
Hands on experience and knowledge on GenAI LLMs AIOps Copilot is a plus.
Additional Information :
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.
Remote Work :
No
Employment Type :
Fulltime
Full-time