Lead Engineer, Site Reliability Engineering
Tasks
- Conduct incident reviews and root cause analysis
- Conduct proof of concept testing
- Create testing and validation plans for environment builds
- Define SLIs and SLOs with development teams
- Develop remediation and mitigation strategies
- Drive observability improvements and unified dashboards
- Evaluate vendor upgrade roadmaps
- Implement automation and self-healing
- Lead infrastructure health monitoring and alerting
- Lead networking reliability training and knowledge sharing
- Perform capacity analysis and forecasting
- Reduce mean time to detect and mean time to mitigate
- Run disaster recovery exercises
Perks/Benefits
Skills/Tech-stack
Alerting | Ansible | Automation | Capacity Planning | Capacity forecasting | Cause analysis | Chef | Disaster Recovery | Dynatrace | EFK | ELK | Grafana | Incident Management | Infrastructure as Code | JSON | Monitoring | Netscout | Observability | OpenTelemetry | Packet Analysis | Packet capturing | Performance Tuning | Prometheus | Reliability Engineering | Root Cause Analysis | Root cause | Site Reliability | Site Reliability Engineering | SolarWinds | Splunk | TCPDump | Terraform | Wireshark | YAML | “as-code”
Education
N/A
Related jobs
-
DevSecOps Engineer - A26187 SGD 70K-100KAWS | AWS Lambda | AWS WAF | Access Management | Amazon AuroraEmployee wellness program | Fun working environment | Growth opportunities | Learning and development opportunitiesMid-level Contract Full TimeSingapore, Singapore, Singapore18h ago
-
App-ID | Cause analysis | Change Management | Configuration backup | DNSSenior-level Full TimeSingapore, Singapore1d ago
-
IT Security Officer SGD 96K-118KAgile | Ansible | Application Security | Application Security Testing | Automated securitySenior-level Full TimeSingapore1d ago
-
Platform & Security Engineering Lead SGD 148K-180KAWS | AmazonEKS | CloudFormation | CloudTrail | DevSecOpsSenior-level Full TimeSingapore1d ago
-
Cloud Infrastructure Engineer / DevOps Engineer SGD 60K-63KAmazon Web Services | Automation | CI/CD | Cloud Security | Cloud platformMid-level Full TimeSingapore, Singapore, Singapore1d ago
-
AWS | Access Management | Ansible | Azure | BackupSenior-level Full TimeCAA-Changi Airport Terminal 2, Singapore1d ago
-
Active Directory | Alerting | Change Management | Domain Controller | File ServerMid-level Full TimeSingapore2d ago
-
Mid-level Full TimeSingapore, Singapore4d ago
-
Asset hardening | Azure | Business impact | Business impact assessment | By DesignSenior-level Full TimeSingapore, Singapore4d ago
-
DevSecOps Engineer SGD 95K-120KAWS Bedrock | AWS CloudFormation | AWS ECS | AWS WAF | Amazon EKSFlexible work practices | Paid learning opportunities | Self-development timeMid-level Full TimeSingapore, SG4d ago
-
Security Engineer SGD 60K-92KAccess Management | BeyondTrust | Cause analysis | Content Disarm Reconstruction | CyberArk24x7 on-call supportMid-level Full TimeSG Ensign Kallang Place, L8 (Left …4d ago
-
Senior-level Full TimeSingapore5d ago
-
API Gateway | API Security | Akamai | Akamai WAF | Application FirewallMid-level Contract Full Time TemporaryLTA HSO B6 02, Singapore5d ago
-
M02 - DevSecOps Engineer SGD 54K-84K.NET | Automation | Azure | Azure Pipelines | Azure environmentsMid-level Full TimeSingapore6d ago
-
Security Engineer -CT-FNC240612 003/01 SGD 60K-92KChange Management | Configuration backups | IP Networking | Incident Management | LinuxMid-level Contract Full TimeSingapore, Singapore, Singapore6d ago
-
Mid-level Full TimeSingapore, Singapore6d ago
-
Infra Security Engineer SGD 60K-96KAnsible | Cause analysis | Elastic Stack | Error budget | GrafanaMid-level Full TimeSingapore, Singapore6d ago
-
Lead Virtualisation Engineer, SRE SGD 160K-222KAnsible | Artificial Intelligence | Automation | Cause analysis | ChefSenior-level Full TimeSingapore6d ago
-
AWS | Alert triage | Automated Baseline Log Review | Azure | Cause analysisMid-level Full TimeTemasek Polytechnic, Singapore6d ago
-
Activity monitoring | App-ID | Cause analysis | Change Management | DNSSenior-level Full TimeSingapore, Singapore7d ago
-
Agile | Automation | Compliance | Cybersecurity | DashboardsMid-level Contract Full TimeMAS: MAS Building, Singapore7d ago
-
Automation | Cloud services | Cyber Threat | Cyber Threat Detection | Digital forensics24/7 SOC environment | Standby DutyMid-level Contract Full TimeMAS: MAS Building, Singapore7d ago
-
.NET | Automated Monitoring | C# | CI/CD | Chaos EngineeringSenior-level Contract Full TimeMAS: MAS Building, Singapore7d ago
-
Database Administrator (Contract) SGD 88K-88KAWS | Always On | Backup and Recovery | Database Automation | Database monitoringMid-level Contract Full TimeMAS: MAS Building, Singapore7d ago
-
API Integration | Asset Management | Attack surface | Attack surface management | AutomationSenior-level Contract Full TimeSingapore, Singapore, Singapore7d ago