SRE Platform Engineer
Tasks
- Automate cluster upgrades and OS patching
- Build monitoring dashboards with Prometheus and Grafana
- Create and maintain Terraform and Ansible modules
- Enforce Kubernetes resource quotas and limit ranges
- Harden compute networking and storage layers
- Implement policy as code guardrails
- Lead root cause analysis for platform outages
- Manage ingress and service mesh architecture
- Provide tier 3 Kubernetes troubleshooting and escalation
- Provision hardened EKS Kubernetes clusters
- Right size containerized workloads for performance and cost
- Run smoke load and disaster recovery testing
- Standardize run books and operating processes
Perks/Benefits
Skills/Tech-stack
AWS | Amazon EKS | Ansible | Argo CD | Cause analysis | Certificate management | Datadog | Disaster Recovery | Dynatrace | Encryption | FinOps | Flux | Go | Grafana | Incident Management | Kubernetes | Load Balancing | Networking | Policy-as-Code | Prometheus | Python | Resource Quotas | Root Cause Analysis | Root cause | Service Mesh | Splunk | Terraform | VPC | “as-code”
Education
Related jobs
-
AWS EC2 | Amazon ALB | Amazon EKS | Amazon MSK | Amazon RDSRelocation assistance | Remote workMid-level Full TimeRemote, Mexico R2d ago
-
Lead Security Engineer MXN 721K-1062KAWS | Azure | Bitbucket | Bot Protection | CDNEmployee resource groups | Learning opportunities | Remote work | Social events | Work-life balanceSenior-level Full TimeWork from Home, Mexico R6d ago
-
AWS Security | Bash | CI/CD | CI/CD Security | CISSPCompetitive benefits | Employee resource groups | Inclusive culture | Professional development | Work-life balanceMid-level Full TimeWork from Home, Mexico R9d ago
-
AWS Security | Bash | CISSP | Cloud Security | Code ReviewsCompetitive benefits | Employee resource groups | Inclusive culture | Professional development opportunities | Social eventsMid-level Full TimeWork from Home, Mexico R9d ago
-
FBS Site Reliability Engineer MXN 721K-721KAWS | Azure | C++ | Cause analysis | Chaos TestingCompetitive salary | Flexible work arrangements | Health insurance | Inclusive work culture | Paid time offSenior-level Full TimeMexico - Remote R10d ago
-
Infrastructure Software Engineer, Telemetry MXN 720K-900KC# | C++ | Databases | Debugging | Distributed SystemsEntry-level Full TimeRemote - Mexico R12d ago
-
Automation | Azure | CI/CD | Cloud Security | Cloud platformFlexible work arrangements | Professional development opportunitiesMid-level Full TimeRemote - Mexico R16d ago
-
Senior Architect (.NET/IAM) - GovTech MXN 1040K-1410K.Net Core | AWS | Angular | Azure | C#Autonomy | Competitive USD pay | Paid time off | Remote work | Work with Top U.S. CompaniesSenior-level Full TimeMexico City R20d ago
-
Cloud Support Engineer MXN 750K-1200KAWS | CloudWatch | EC2 | Networking | RDSFlexible schedule | Remote work | Supportive environmentEntry-level Full TimeVirtual - Mexico R21d ago
-
AI | Certificates | Customer support | Cybersecurity | LinuxCareer advancement opportunities | Collaborative environment | Remote workMid-level Full TimeMexico - Remote R27d ago
-
Lead Azure Cloud Engineer - OpenShift Red Hat Experience MXN 1040K-1300KARM Templates | Active Directory | Alerting | Architecture Diagrams | Azure Active DirectorySenior-level Full TimeHome Based Mexico R1mo ago
-
Onboarding Engineer MXN 1040K-1474KAPIs (GraphQL) | APIs REST | AWS Cloud | AWS cloud management | AutomationSenior-level Full TimeVirtual - Mexico R1mo ago