Lead Cloud Infrastructure Engineer / Site Reliability Engineer (SRE)
Tasks
- Build and automate data and model pipelines
- Collaborate with engineering teams for reliability and security
- Design deploy and scale AI ML LLM infrastructure in cloud
- Ensure cloud infrastructure stability performance and security
- Implement monitoring observability and reliability best practices
- Lead incident response during on-call rotations
- Manage and optimize Kubernetes environments for AI services
- Own infrastructure end to end and lead scaling deployments and automation
- Perform performance tuning and cost optimization
Perks/Benefits
Skills/Tech-stack
AWS | AWS Lambda | ArgoCD | Azure | Azure Functions | Bash | CI/CD | Capacity Planning | CloudFormation | Docker | ELK | Flux | GCP | GitOps | Go | Grafana | Helm | Incident Response | Istio | Kubernetes | Linkerd | OpenTelemetry | Powershell | Prometheus | Python | SLA | SLI | SLO | Serverless | Service Mesh | Terraform
Education
Related jobs
-
AWS CloudWatch | AWS Session Manager | AWS Workspaces | Access Management | Amazon EKSSenior-level Full TimeUnited States3h ago
-
Robotics Platform Security Engineer USD 90K-300KAppArmor | Auditd | C# | C++ | CIS BenchmarksHybrid work option | On-site collaboration | Remote work optionSenior-level Full TimeIrvine, CA4h ago
-
Robotics Application & Product Security Engineer USD 90K-300KAPI Security | Adversarial analysis | Application Security | Artifact signing | AuthenticationHybrid or remote optionSenior-level Full TimeIrvine, CA4h ago
-
Especialista Seguridad Aplicativa MXN 216K-216KAPI Gateway | API Security | AWS | Application Security | Burp SuiteMid-level Full TimeCIUDAD DE MEXICO, Ciudad de México, …7h ago
-
Infrastructure Engineer USD 175K-210KAnsible | Backup and Disaster Recovery | Bash | Certificates | Command LineLearning opportunities | Office based work in Santa Clara | People-first cultureMid-level Full TimeSanta Clara, CA9h ago
-
Senior Detection Engineer USD 192K-242KAWS | Azure | Cloud platform | Continuous Delivery | Continuous integrationEquity grant | Flexible work location | Remote workSenior-level Full TimeUnited States - Remote R9h ago
-
Local Information Security Advisory MXN 132K-132KDetection Systems | Incident Response | Intrusion Detection | Intrusion detection systems | Network SecurityEntry-level Full TimeGuadalajara - La Tijera, Jalisco, Mexico11h ago
-
Sr Staff Cyber Security Engineer (AI) USD 145K-175KApplication Security | Azure OpenAI | CCPA | CIS Critical Security Controls | Cloud Security401k | Dental insurance | Discounts | Fully remote | Medical insuranceSenior-level Full TimeNew York, NEW YORK, United States R11h ago
-
Application Engineer 4 (DevOps) USD 120K-155KAWS | Ansible | Apache NiFi | Docker | ElasticsearchBranded clothing | Dental insurance | Employee referral bonus | HSA | Health insuranceSenior-level Full TimeLinthicum Heights, Maryland11h ago
-
Software Engineer USD 164K-229KAmazon Web Services | Apache Kafka | Cloud platform | Computer Networking | Distributed Systems401k employer match | Caregiving support | Comprehensive healthcare benefits | Family planning support | Flexible vacationMid-level Full TimeSan Francisco, CA12h ago
-
Staff IAM Engineer, Sailpoint USD 115K-234KAPI | Access Control | Access Management | Access Policy | Access ReviewHybrid work modelSenior-level Full TimeRemote, US R12h ago
-
Lead Security Engineer USD 220K-260KAudit Logging | CMEK | Cloud key management | Compliance Management | DLP401k | Flexible spending account | Health insurance | Hybrid work | Office equipment allowanceSenior-level Full TimeMountain View, California, United States R12h ago
-
Senior CIAM Software Engineer CAD 150K-200KAPI Design | AWS | Access Control | Adaptive Authentication | Auth0Dental insurance | ESPP | Flexible spending accounts | Health insurance | Paid time offSenior-level Full TimeRemote Canada R12h ago
-
Senior CIAM Software Engineer USD 169K-240KAWS | AWS Cloud | Abuse detection | Access Control | AuthorizationDental and vision coverage | ESPP | Family forming expenses | Flexible spending wallets | Food stipendSenior-level Full TimeRemote US R12h ago
-
Enterprise IAM Software Engineer II CAD 125K-175KAPI Development | AWS | CI/CD | Debugging | GitHubDental and vision coverage | Employee stock purchase plan | Health care coverage | Paid time off | Remote work flexibilityMid-level Full TimeRemote Canada R15h ago
-
Software Engineer, Connected Warfare USD 129K-292KAPI Design | AWS | Azure | CI/CD | Cloud Computing401k match | Adoption support | Caregiver leave | Commuter benefits | Disability insuranceMid-level Full TimeWashington, District of Columbia, United States15h ago
-
Software Engineer, Connected Warfare USD 129K-292KAPIs | AWS | Azure | Build and deployment | CI/CDCaregiver and wellness leave | Commuter benefits | Family planning and parenting support | Healthcare benefits | Income protectionMid-level Full TimeSeattle, Washington, United States15h ago
-
SPLUNK Engineer USD 102K-157KAWS | Alerts | Ansible | Architectural Diagrams | Automation401k match | Dental coverage | Holiday pay | Life insurance | Medical insuranceMid-level Full TimeFalls Church, VA, United States16h ago
-
AI/MI Intern, Agenic Cloud USD 90K-130KDeep learning | Java | Machine Learning | NumPy | PandasEducation reimbursement | Health plans | Hybrid work | Parental leave | Retirement optionsEntry-level InternshipSan Jose, California, USA16h ago
-
Senior Security Infrastructure Engineer USD 60K-114KAWS | Application Security | CI/CD | Cause analysis | Container SecuritySenior-level Full TimeMexico, Remote R16h ago
-
Senior Security Infrastructure Engineer USD 60K-114KAWS | CI/CD | Container Security | Data Exfiltration Detection | Data exfiltrationSenior-level Full TimeLatin America R16h ago
-
Ingénieur du développement et de l’exploitation CAD 95K-120KCI/CD | Caching | Cloud | Command Line | DNSHybrid work environment | Remote work supportSenior-level Full TimeMontréal, Québec17h ago
-
Lead DevOps Engineer USD 157K-223KAWS | Ansible | ArgoCD | Artifact Repositories | ArtifactoryHybrid schedule | Remote workSenior-level Full TimeDenver, CO R18h ago
-
Lead DevOps Engineer USD 157K-223KAWS | Alerting | Ansible | Azure | CI/CDAgile team collaboration | Hybrid work model | Remote workSenior-level Full TimeSan Diego, CA R18h ago
-
Lead DevOps Engineer USD 157K-223KAWS | Ansible | Argo CD | Artifact Repositories | ArtifactoryHybrid work schedule | Remote work flexibilitySenior-level Full TimeWoburn, MA R18h ago