Site Reliability Engineer
Tasks
- Automate operations tasks
- Build and improve CI/CD pipelines
- Conduct reliability design reviews
- Define and maintain SLIs/SLOs
- Design and operate containerized workloads
- Drive platform improvements
- Implement canary deployments and blue/green strategies
- Implement observability systems
- Integrate resilience patterns
- Lead incident response and postmortems
- Lead technical discussions on reliability
- Mentor junior engineers
- Monitor SLIs/SLOs
- Optimize cloud performance and costs
- Participate in architecture discussions
Perks/Benefits
- N/A
Skills/Tech-stack
Automation | CD pipelines | CI/CD | CI/CD pipelines | Cloud Agnostic | Cloud-agnostic certification | Containerization | Distributed Systems | ELK | Go | Grafana | IaC | Jaeger | Kubernetes | Monitoring | Observability | OpenTelemetry | Prometheus | Python | Resilience Engineering | Terraform
Education
N/A
Regions
Countries
States
Related jobs
-
APIs | Azure | Azure Functions | Azure Redis | Azure Redis CacheRemote workSenior-level Full TimeRemote but local to Bogotá, Colombia R1d ago
-
API Management | AWS | Azure | Cloud infrastructure | Data SecurityCareer growth opportunities | Flexible hours | Remote workMid-level Full TimeColombia - Remote R9d ago
-
API Development | Blockchain | Crypto Libraries | Cryptography | Distributed SystemsFlexible schedule | Medical insurance | Remote workSenior-level Full TimeLatin America#LATAM, Remote, Colombia R15d ago