SRE Observability SLO Engineer
Tasks
- Build synthetic monitoring plans
- Build telemetry stack
- Conduct observability health reviews
- Correlate FinOps cost metrics with reliability
- Create synthetic checks
- Define SLIs and SLOs
- Design golden signals dashboards
- Implement error budget burn rate alerting
- Implement symptom based alerting
- Implement telemetry standards
- Improve MTTR and MTTD
- Maintain SLO dashboards and reports
- Tune alert routing and escalation
Perks/Benefits
Skills/Tech-stack
AWS | AWS X-Ray | Amazon EKS | Ansible | Bash | CloudWatch | CloudWatch Logs Insights | CloudWatch Synthetics | Cloudwatch Logs | Datadog | Datadog Query Language | Distributed tracing | ELK | Grafana | Helm | Kube-State-Metrics | Kubernetes | Logs Insights | New Relic | Node Exporter | OpenTelemetry | PromQL | Prometheus | Python | Query Language | Splunk | Structured Logging | Terraform | X-Ray
Education
Related jobs
-
Ansible | Automation | BGP | Bash | CMDBOn call pager rotation | Remote-first culture | Work-life balanceMid-level Full TimeRemote - Argentina; Remote - Colombia … R2d ago
-
Lead Security Engineer (AI-Native) MXN 1040K-1300KAI tools | AWS | Access Management | Business Continuity | Compliance100 percent remote | Continuous learning membership | Feedback Rich Collaborative Culture | Flexible paid time off | Local holiday paySenior-level Full TimeRemote - Mexico R3d ago
-
Access Management | Amazon Web Services | Ansible | Backup and Recovery | Cloud SecurityMid-level Full TimeMexico City, CDMX, Mexico6d ago
-
Data Technology DMX SSA Data Engineer USD 95K-124KAmazon DynamoDB | Amazon Redshift | Autosys | Avro | BashCollaborative workspaces | Employee benefits | Employee resource groups | Flexible working arrangements | Global orientation programSenior-level Full TimeMonterrey, NLE, Mexico7d ago
-
Especialista Servicios Administrados MXN 152K-152K.Net Core | AWS EKS | Amazon Web Services | Azure | Cloud platformMid-level Full TimeCIUDAD DE MEXICO, Ciudad de México, …7d ago
-
Solutions Engineer MXN 208K-242KAWS CloudFormation | Amazon Web Services | Anti-Malware | Data Loss Prevention | Data lossCustomer facing events | Regional travel | User group events | WebinarsMid-level Full TimeMexico City7d ago
-
Infraestructure & DevOps / Automation Engineer MXN 420K-620KAWS | Ansible | Azure | Azure DevOps | Azure Key VaultOff-hours support | On-call rotation | Patching supportSenior-level Contract Full TimeGuadalajara, Jalisco, Mexico - Remote R8d ago
-
DevX Principal Engineer - Tech Lead MXN 1040K-1300KARM Templates | Ansible | Ansible Tower | App Service | Application InsightsSenior-level Full TimeMiguel Hidalgo, MX, 115209d ago
-
SaaS Cloud Engineer MXN 1040K-1300KAWS CloudFormation | AWS Control Tower | AWS Cost Explorer | AWS Organizations | Amazon Web ServicesRelocation assistance | Remote workSenior-level Full TimeRemote, Mexico R9d ago
-
SaaS Cloud Engineer MXN 1040K-1300KAWS CloudFormation | AWS CloudTrail | AWS CloudWatch | AWS Config | AWS Control TowerRelocation assistance | Remote workSenior-level Full TimeRemote, Mexico R9d ago
-
Site Reliability Engineer (Automation & virtualization) MXN 750K-960KAIOps | Ansible | CI/CD | Chaos Engineering | DynatraceSenior-level Full TimeMexico City, Mexico10d ago
-
Senior-level Full TimeMexico14d ago
-
LINUX System Administrator L1 MXN 600K-700KAWS S3 | Azure Blob | Azure Blob Storage | Bash | Blob StorageFlexible work from home | Major medical insurance | Paid maternity and paternity leave | Paid vacation | Personal development plansSenior-level Full TimeZapopan, Mexico14d ago
-
Mid-level Full TimeCiudad de México - Toreo, MX14d ago
-
Lead Platform Engineer MXN 1040K-1300KAPI Integration | Ansible | Artifact Repositories | Automated testing | Azure DevOpsSenior-level Full TimeMexico City, Mexico14d ago
-
Software Application Engineer MXN 375K-516K.Net Core | Agile methodology | Application Programming | Application Programming Interface | AutomationSenior-level Full TimeGLF02 - Tlaquepaque, JAL, Mexico (GLF02)14d ago
-
Lead Security Engineer MXN 721K-1062KAWS | Azure | Bitbucket | Bot Protection | CDNEmployee resource groups | Learning opportunities | Remote work | Social events | Work-life balanceSenior-level Full TimeWork from Home, Mexico R14d ago
-
Consulting Systems Engineer MXN 721K-960KAWS | Access Control | Access Management | Ansible | Application Aware RoutingWorldwide travel short noticeSenior-level Full TimeMexico City, CDMX, Mexico14d ago
-
Information Systems Security Engineer MXN 300K-336KArchitecture diagramming | Auditing | Bash | Cloud Architecture | Cloud SecurityCareer development | Flexible schedule | Hybrid work | Mental health days | Retirement planEntry-level Full TimeMexico, Mexico City17d ago
-
AWS Security | Bash | CI/CD | CI/CD Security | CISSPCompetitive benefits | Employee resource groups | Inclusive culture | Professional development | Work-life balanceMid-level Full TimeWork from Home, Mexico R17d ago
-
AWS Security | Bash | CISSP | Cloud Security | Code ReviewsCompetitive benefits | Employee resource groups | Inclusive culture | Professional development opportunities | Social eventsMid-level Full TimeWork from Home, Mexico R17d ago
-
FBS Site Reliability Engineer MXN 721K-721KAWS | Azure | C++ | Cause analysis | Chaos TestingCompetitive salary | Flexible work arrangements | Health insurance | Inclusive work culture | Paid time offSenior-level Full TimeMexico - Remote R19d ago
-
Infrastructure Software Engineer, Telemetry MXN 720K-900KC# | C++ | Databases | Debugging | Distributed SystemsEntry-level Full TimeRemote - Mexico R20d ago
-
Director, Site Reliability Engineering MXN 440K-600KAnsible | Automation | Disaster Recovery | Dynatrace | GrafanaExecutive-level Full TimeMexico City, Mexico23d ago
-
Senior Security Engineer (Compliance & Controls) MXN 780K-1200KAccess Management | Bash | Cloud Architecture | Cryptography | Distributed SystemsDental plans | Health plans | Performance bonus | Remote work | Stock optionsSenior-level Full TimeMexico23d ago