Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid.
Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.
We're looking for a curious and innovative Site Reliability Engineer to join our team at Visa. Here, you'll be part of a diverse group of problem-solvers who ensure billions of transactions flow seamlessly across the world's largest payment network.
What You'll Do
• Design for Reliability: Architect and implement solutions that keep Visa's application services running with Always On availability.
• Automate Everything: Create intelligent automation for monitoring, deployment, and incident response.
• Lead Investigations: Use your detective skills to solve complex technical puzzles and prevent future incidents.
• Drive Innovation: Contribute to our evolution from traditional application to cloud-native solutions.
• Collaborate Globally: Work with talented engineers across the world to build, support, and deploy application services.
Why You'll Love It
• Real Impact: Your code will help process millions of transactions, enabling commerce worldwide.
• Growth Opportunities: Regular learning sessions, mentorship programs, and exposure to cutting-edge technology.
• Work-Life Integration: Hybrid work model (2-3 days in office) with flexible scheduling.
• Inclusive Culture: Join a team that actively promotes diverse perspectives and collaborative problem-solving.
Your Experience & Skills
We encourage you to apply even if you don't meet every requirement. We value potential and enthusiasm over perfection.
Core Skills (Some combination of):
• Programming Languages: Extensive knowledge and experience with C++, Java, Python, JavaScript, HTML, CSS. Proficient in writing efficient, maintainable, and scalable code across different programming paradigms.
• Application Technologies: Extensive experience with modern application technologies including Tomcat, Apache, Spring Boot, SQS, JBoss, IBM MQ, IBM DataPower, Hazelcast, Flink, Connect Direct, and SSL. Skilled in designing and deploying high-performance applications using these technologies.
• Systems and Networking: Strong understanding of Linux/Unix systems and networking. Experienced in leveraging cloud platforms such as AWS, Azure, and GCP for scalable and resilient infrastructure solutions.
• Containerization and DevOps: Proficient in containerization technologies like Kubernetes and Docker. Skilled in using infrastructure-as-code tools such as Terraform and Ansible to automate deployment and management of complex systems.
• Monitoring and Logging: Expertise in monitoring and logging systems including Prometheus, Grafana, Datadog, and the ELK stack.
• Automation and Scripting: Proven track record of automating complex tasks and processes to improve efficiency and reliability using programming languages such as Python, Go, and Java. Adept at writing scripts and developing tools to streamline operations and reduce manual intervention.
Technical Areas You'll Grow In:
• Cloud & System Architecture: Design scalable, resilient systems across hybrid cloud platforms (AWS, GCP, Azure).
• AI/ML Operations: Support and optimize ML model deployment pipelines and monitoring systems.
• Observability & Performance: Master advanced monitoring, tracing, and performance optimization techniques.
• Automation & Intelligence: Build smart alerting systems and automated remediation workflows.
• Distributed Systems: Design and maintain globally distributed payment processing systems.
What Makes You Thrive:
• You're energized by solving complex problems.
• You believe in automation over manual processes.
• You enjoy mentoring others and sharing knowledge.
• You're comfortable with ambiguity and rapid change.
• You value building reliable systems over quick fixes.
Job Responsibilities
• Build in-depth expertise on the 24*7 systems of Visa that support our merchants.
• Support issues within SLAs for established issue priorities, fix issues/bugs.
• Collaborate with the development and product team to improve the overall development process.
• Build and test next-generation monitoring tools to reduce mean-time-to-detect issues.
• Automate production support activities to enhance efficiency and productivity.
• Participate in post-release monitoring and validation. Collaborate with the DEV team to ensure no “release issues” occur in PROD.
• Create application and infrastructure performance plans/models for a highly scalable, low-latency, highly-available, and high-throughput payment processing system.
• Monitor application performance, optimize performance bottlenecks, and usage to create an application capacity model.
• Contribute to capacity planning and disaster recovery exercises.
• Support in triaging and troubleshooting of performance degradation incidents in the production environment.
Required Skills
• Hands-on experience in Unix/Linux OS environments, operating system internals, file systems, disk/storage, and networking protocols.
• Working knowledge with VMs and Containers.
• Expert knowledge and strong experience using tools such as Loadrunner/Jmeter.
• Understanding of APM solutions like Appdynamics, Dynatrace, OPNET, etc.
• Coding Knowledge in programming languages like Java/Python and scripting languages.
• Should have experience in defining Performance Strategy and reporting performance baselines.
• Ability to work independently and be a team player.
• Professional work experience in highly scalable web services.
• Exposure to containerized micro-services architecture and stacks.
Preferred Skills
• Understanding of Disaster Recovery methodologies.
• Experience working with Agile teams.
• Knowledge of monitoring tools like Splunk/Keynote/Graphana.
• A Bachelor's degree in Computer Science or Engineering; a Master's degree is a plus.
• Experience working in fast-paced 24*7 environments.
• Excellent oral and written communication skills.
• Knowledge of GenAI, Chatgpt,LLM is a plus
The Team Culture
You'll join a collaborative team that:
• Celebrates diverse perspectives and approaches to problem-solving.
• Values teaching and learning from each other.
• Promotes work-life balance and sustainable on-call rotations.
• Encourages innovation and experimentation.
• Champions personal growth and career development.
Impact & Growth
In this role, you'll:
• Shape the reliability standards for global payment systems.
• Mentor and be mentored by talented engineers.
• Drive automation and observability initiatives.
• Influence architectural decisions.
• Build systems that process billions in transactions.
This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.
Basic Qualifications
• 5+ years of relevant work experience with a Bachelor’s Degree or at least 2 years
of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0
years of work experience with a PhD, OR 8+ years of relevant work experience.
Preferred Qualifications
•7 to 12 Yr years of work experience with a Bachelor’s Degree or more than 7 years
of work experience with an Advanced Degree (e.g., Masters, MBA, JD, MD).
•Bachelor's or Master's degree in Computer Science or related field, or equivalent
experience.
•We value hands-on experience and continuous learning over specific degrees.
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.