SRE Platform Engineer
Tasks
- Automate cluster upgrades and OS patching
- Conduct disaster recovery exercises
- Define and enforce Kubernetes resource governance
- Design and deploy hardened EKS clusters
- Implement observability with Prometheus and Grafana
- Implement policy as code guardrails
- Lead platform smoke and load testing
- Manage ingress strategy and service mesh architecture
- Perform incident response and root cause analysis
- Provision cloud infrastructure using Infrastructure as Code
- Right size containerized workloads
- Serve as tier 3 escalation for Kubernetes issues
- Standardize run books and operating processes
- Troubleshoot pod failures memory leaks and network partitions
Perks/Benefits
Skills/Tech-stack
AWS EC2 | Amazon ALB | Amazon EKS | Amazon MSK | Amazon RDS | Amazon S3 | Ansible | ArgoCD | Cause analysis | Certificate management | Datadog | Disaster Recovery | Dynatrace | Encryption | Flux | Go | Grafana | Ingress | Kubernetes | Kubernetes Resource Quotas | Load Balancing | Pod Priority Classes | Pod Resource Limit Ranges | Policy-as-Code | Prometheus | Python | Resource Quotas | Root Cause Analysis | Root cause | Routing | Service Mesh | Splunk | Terraform | VPC | “as-code”
Education
Related jobs
-
Lead Security Engineer MXN 721K-1062KAWS | Azure | Bitbucket | Bot Protection | CDNEmployee resource groups | Learning opportunities | Remote work | Social events | Work-life balanceSenior-level Full TimeWork from Home, Mexico R5d ago
-
AWS Security | Bash | CI/CD | CI/CD Security | CISSPCompetitive benefits | Employee resource groups | Inclusive culture | Professional development | Work-life balanceMid-level Full TimeWork from Home, Mexico R8d ago
-
AWS Security | Bash | CISSP | Cloud Security | Code ReviewsCompetitive benefits | Employee resource groups | Inclusive culture | Professional development opportunities | Social eventsMid-level Full TimeWork from Home, Mexico R8d ago
-
FBS Site Reliability Engineer MXN 721K-721KAWS | Azure | C++ | Cause analysis | Chaos TestingCompetitive salary | Flexible work arrangements | Health insurance | Inclusive work culture | Paid time offSenior-level Full TimeMexico - Remote R10d ago
-
Infrastructure Software Engineer, Telemetry MXN 720K-900KC# | C++ | Databases | Debugging | Distributed SystemsEntry-level Full TimeRemote - Mexico R11d ago
-
Automation Scripting | Cortex XDR | Event Correlation | Log Analysis | Log pipeline technologiesCareer development opportunities | Flexible working arrangements | Health and wellness programs | Inclusive cultureMid-level Full TimeRemote Mexico R13d ago
-
Automation | Azure | CI/CD | Cloud Security | Cloud platformFlexible work arrangements | Professional development opportunitiesMid-level Full TimeRemote - Mexico R15d ago
-
Cloud Support Engineer MXN 750K-1200KAWS | CloudWatch | EC2 | Networking | RDSFlexible schedule | Remote work | Supportive environmentEntry-level Full TimeVirtual - Mexico R20d ago
-
Lead Azure Cloud Engineer - OpenShift Red Hat Experience MXN 1040K-1300KARM Templates | Active Directory | Alerting | Architecture Diagrams | Azure Active DirectorySenior-level Full TimeHome Based Mexico R30d ago
-
Onboarding Engineer MXN 1040K-1474KAPIs (GraphQL) | APIs REST | AWS Cloud | AWS cloud management | AutomationSenior-level Full TimeVirtual - Mexico R30d ago