PA2025Q3JB090 Site Reliability Engineer (SRE), Cloud Incident Response

Bangkok, Thailand

SS&C Technologies

Leading cloud-based provider of financial services technology solutions. SS&C Technologies owns and maintains the best financial technology in the industry

View all jobs at SS&C Technologies

Apply now Apply later

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Job Description

Overall job purpose:

Be part of a global team that ensures the performance, scalability, and reliability of critical cloud-based applications. As part of the Global Investor and Distribution Solutions (GIDS) Platform Services team, you’ll play a key role in keeping our systems running smoothly and efficiently—while helping shape the future of our platform.

What You’ll Do:

  • Collaborate with global teams as part of a follow-the-sun support model.
  • Respond to, troubleshoot, and resolve Level 2 application incidents.
  • Ensure critical applications are effectively monitored using tools like Prometheus and Grafana.
  • Create and maintain dashboards and alerts to enhance visibility into application health.
  • Define, implement, and track key SRE metrics (SLOs, SLIs, error budgets).
  • Partner with development teams to improve application reliability and resilience.
  • Analyze incident trends and recommend improvements to reduce recurrence.
  • Automate repetitive support tasks to improve efficiency.
  • Participate in post-incident reviews and drive reliability initiatives.

Qualifications:

Minimum Qualification

  • Bachelor’s degree in Computer Science, Computer Engineering, IT, or related field.
  • 5+ years of experience for senior roles; fresh graduates welcome for junior roles.
  • Proficiency in one or more programming languages, preferably Java, JavaScript or Python.
  • Proven ability to troubleshoot complex systems.
  • Skilled in debugging, code optimization, and automation.
  • Experience with relational databases and data analysis.

Highly Preferred

  • Experience working in Site Reliable Engineer (SRE) roles or incident response environments.
  • Hands-on experience with cloud infrastructure, preferably AWS.
  • Familiarity with observability tools such as Grafana, ELK Stack, or similar.
  • Experience deploying and managing applications on Kubernetes platforms.
  • Strong skills in analyzing and troubleshooting issues in large-scale, distributed systems.

Why You Will Love It Here!

  • Hybrid Work Model and Business Casual Dress Code, including jeans, Centralized location – 6 minutes’ walk from Phromphong BTS or 10 minutes’ walk from Sukhunvit MRT
  • Your Future: Retirement Program, Professional Development Reimbursement  
  • Work/Life Balance: Flexible Personal/Vacation Time Off, Sick Leave, Paid Holidays, Business Leave, Maternity Leave, Ordination Leave
  • Your Wellbeing: Medical, Dental, Vision, Life Insurance, Annual Health Check Up, Employee Assistance Program, Parental Leave, Well-Stocked Pantry and Provident Fund Contribution
  • Diversity & Inclusion: Committed to Welcoming, Celebrating and Thriving on Diversity
  • Hands-On, Team-Customized, including SS&C University
  • Paid further education opportunities for employees who are eligible
  • Extra Perks: Bonus Scheme, SS&C Stock(s) Allocation for employees who are eligible
  • Welfare Committee: Discounts on fitness clubs, travel and more!

#LI-NW1

#CA-NW

Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.

Apply now Apply later

* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰

Tags: Automation AWS C Cloud Computer Science ELK Grafana Incident response Java JavaScript Kubernetes Prometheus Python RDBMS SLOs

Perks/benefits: Career development Equity / stock options Fitness / gym Flex hours Flex vacation Health care Insurance Medical leave Parental leave Salary bonus

Region: Asia/Pacific
Country: Thailand

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.