Director, Infrastructure & Site Reliability Engineering
MXN 580K-1050K (estimate) Executive-level Full Time
Tasks
- Align engineering with DevOps and agile
- Architect observability solutions
- Build and scale SRE organization
- Collaborate with security and audit teams
- Communicate technical strategies to executives
- Define SLOs, SLIs, and error budgets
- Define SRE strategy roadmaps
- Drive automation resilience and continuous improvement
- Ensure disaster recovery and high availability
- Ensure infrastructure compliance with security baselines
- Establish performance metrics and development plans
- Implement Infrastructure-as-Code automation
- Influence enterprise architecture decisions
- Lead incident management and root cause analysis
- Lead infrastructure modernization
- Optimize alerting and telemetry
- Oversee VMware clusters and Oracle Linux environments
- Partner with application network and storage teams
- Reduce operational toil with self healing systems
Perks/Benefits
- N/A
Skills/Tech-stack
Agile | Alerting | Ansible | Automation | Cause analysis | Chef | DevOps | Disaster Recovery | Dynatrace | ESXi | Error Budgets | Grafana | High Availability | Incident Management | Infrastructure as Code | Jenkins | Linux | Observability | Oracle Linux | PowerCLI | Prometheus | Python | Reliability Engineering | Root Cause Analysis | Root cause | Service Level | Service level indicators | Service-Level Objectives | Site Reliability | Site Reliability Engineering | Splunk | Telemetry | VMware | “as-code”
Education
N/A
Related jobs
- No jobs found.