SRE Observability SLO Engineer
Tasks
- Build SLO burn rate alerts and dashboards
- Build synthetic checks for API health UI flows and integrations
- Build telemetry standards for metrics logs distributed traces
- Conduct observability health reviews to improve MTTD and MTTR
- Correlate infrastructure costs with reliability data
- Define SLIs and SLOs
- Define data retention and telemetry cost controls
- Design golden signals dashboards
- Design synthetic monitoring plan
- Drive observability improvement cycle
- Facilitate SLO review cycle
- Implement Kubernetes metrics collection
- Implement symptom based alerting and alert deduplication
- Publish observability runbook library
- Set alert routing and escalation policies
- Set synthetic monitor thresholds and incident detection integration
- Translate SLOs into customer SLAs
Perks/Benefits
Skills/Tech-stack
AWS CloudWatch | AWS CloudWatch Synthetics | Amazon EKS | Ansible | Bash | CloudWatch | CloudWatch Synthetics | Datadog | Distributed tracing | ELK | Elasticsearch | Error budget | Grafana | Kubernetes | Linux | OpenTelemetry | PromQL | Prometheus | Python | Rancher | SLI | SLO | Splunk | Structured Logging | Synthetic Monitoring | Terraform
Education
Related jobs
-
Active Directory | Azure | Azure Backup | Azure ExpressRoute | Azure FilesMid-level Full TimeGuadalajara, Mexico1d ago
-
Sr DevSecOps Engineer - IAM Engineer MXN 721K-1001KAWS | Access Management | ArgoCD | CI/CD | CrossplaneEmployee resource groups | Remote work | Social events | Work-life balanceSenior-level Full TimeGuadalajara, Mexico R2d ago
-
Product Security Engineer MXN 360K-480K800-53 | ASPM | Access Management | Azure DevOps | CI/CDEmployee assistance program | Health insurance | Life insurance | Paid Holidays | Paid time offMid-level Full TimeMexico City R4d ago
-
Active Directory | Azure Backup | Azure Files | Azure Monitor | Azure Site RecoveryMid-level Full TimeGuadalajara, Mexico6d ago
-
Senior SRE/DevOps MXN 780K-1200KAmazon Web Services | Ansible | Azure | CI/CD | DynatraceRemote workSenior-level Full TimeMexico6d ago
-
Ansible | Bash | Bitbucket | Brocade | ChefDisaster recovery support | Out of hours operational supportMid-level Full TimeMexico City, Mexico7d ago
-
Adversarial Machine Learning | Anomaly Detection | Deep learning | DevSecOps | Graph AnalysisInclusive workplace | Remote-friendlyMid-level Full TimeCDMX, MEX, Mexico7d ago
-
Senior-level Full TimeMonterrey, NLE, MX10d ago
-
Cyber Security PHP 1200K-1440KApplication Security | Authentication and Authorization | CI/CD | Cloud Security | DevSecOpsMid-level Full TimeNaucalpan de Juárez, México, México11d ago
-
Mid-level Full TimeMexico12d ago
-
Senior Security Infrastructure Engineer USD 60K-114KAWS | Application Security | CI/CD | Cause analysis | Container SecuritySenior-level Full TimeMexico, Remote R12d ago
-
Especialista Servicios Administrados MXN 240K-300KAgents | Automation | Backup | Bash | Data AnalysisMid-level Full TimeCIUDAD DE MEXICO, Ciudad de México, …14d ago
-
Administrador Kubernetes (GCP/Huawei) 100% oficina MXN 180K-180KAIX | CI/CD | GCP | Huawei Cloud | Infrastructure as CodeBirthday day off | Extra days off | Health insurance | Meal vouchers | Training and Career PlanEntry-level Full TimeCiudad de México14d ago
-
Active Directory | Azure | Azure Backup | Azure Files | Azure MonitorMid-level Full TimeGuadalajara, Mexico14d ago
-
Active Directory | Azure Backup | Azure Files | Azure Monitor | Azure Site RecoveryMid-level Full TimeGuadalajara, Mexico14d ago
-
Staff Engineer MXN 750K-780KAgile | Application Gateway | Azure Application Gateway | Azure DevOps | Azure MonitorSenior-level Full TimeGuadalajara, Mexico15d ago
-
Senior Cyber Security Engineer MXN 554K-620KAWS | Automation and response | Azure | Cloud platform | Cyber Kill ChainCareer development | Global opportunities | Hybrid work | Pay transparencySenior-level Full TimeGuadalupe, Mexico16d ago
-
Senior Cyber Security Engineer MXN 554K-620KAWS | Azure | Cloud platform | Detection engineering | EDRCareer development | Global opportunities | Pay transparencySenior-level Full TimeGuadalupe, Mexico16d ago
-
Especialista Sr Ingenieria Seguridad MXN 192K-216KADFS | AWS Shield | Active Directory | Akamai Guardicore | Akamai WAFSenior-level Full TimeQUERETARO, Querétaro, MX23d ago
-
Access Management | Amazon Web Services | Ansible | Backup and Recovery | Cloud SecurityMid-level Full TimeMexico City, CDMX, Mexico25d ago
-
Data Technology DMX SSA Data Engineer USD 95K-124KAmazon DynamoDB | Amazon Redshift | Autosys | Avro | BashCollaborative workspaces | Employee benefits | Employee resource groups | Flexible working arrangements | Global orientation programSenior-level Full TimeMonterrey, NLE, Mexico25d ago
-
Especialista Servicios Administrados MXN 152K-152K.Net Core | AWS EKS | Amazon Web Services | Azure | Cloud platformMid-level Full TimeCIUDAD DE MEXICO, Ciudad de México, …26d ago
-
Solutions Engineer MXN 208K-242KAWS CloudFormation | Amazon Web Services | Anti-Malware | Data Loss Prevention | Data lossCustomer facing events | Regional travel | User group events | WebinarsMid-level Full TimeMexico City26d ago
-
Infraestructure & DevOps / Automation Engineer MXN 420K-620KAWS | Ansible | Azure | Azure DevOps | Azure Key VaultOff-hours support | On-call rotation | Patching supportSenior-level Contract Full TimeGuadalajara, Jalisco, Mexico - Remote R27d ago
-
DevX Principal Engineer - Tech Lead MXN 1040K-1300KARM Templates | Ansible | Ansible Tower | App Service | Application InsightsSenior-level Full TimeMiguel Hidalgo, MX, 1152028d ago