Lead Cloud Infrastructure Engineer / Site Reliability Engineer (SRE)**
Tasks
- Build automate data and model pipelines
- Design deploy and scale cloud AI ML LLM infrastructure
- Ensure cloud platform reliability
- Implement monitoring, observability, and reliability
- Lead incident response and performance tuning
- Manage infrastructure availability latency and performance
- Optimize Kubernetes environments
- Own infrastructure end to end for scaling and deployments
- Perform 24x7 on call and cost optimization
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | AWS Lambda | Amazon EKS | Argo CD | Argo Workflows | Azure | Azure Functions | Azure Kubernetes | Azure Kubernetes Service | Bash | CI/CD | CloudFormation | Docker | EFK | ELK | Flux CD | GitHub Actions | GitLab CI | GitOps | Go | Google Cloud | Google Kubernetes | Google Kubernetes Engine | Grafana | Helm | Istio | Kubernetes | Kubernetes Engine | Kubernetes Service | Langfuse | Linkerd | Machine Learning | Model Inference | OpenShift | OpenTelemetry | Powershell | Prometheus | Python | RAG | Rancher | SLA | SLI | SLO | Serverless | Service Mesh | Terraform
Education
Related jobs
-
Staff Security Engineer - Product Security USD 230K-275KAI Risk Management Framework | Access Control | Application Security | CI/CD | Cloud SecurityHybrid work | Medical, dental, and vision insurance | Paid time offSenior-level Full TimeSouth San Francisco, California, USA7h ago
-
Application Security Engineer USD 100K-215K800-53 | AWS | Application Security Testing | Azure | CI/CDIn person five days per weekMid-level Full TimeTysons Corner, VIRGINIA, United States12h ago
-
Operations Engineer USD 86K-176KData Feeds | Grafana | Incident Management | Kibana | Nagios24 7 operations environment | Rotating shift scheduleSenior-level Full TimeAnnapolis Junction, MD13h ago
-
AWS | Azure | Big Data | Cloud infrastructure | EncryptionDental insurance | Health insurance | In-office hybrid schedule | Relocation assistance | Vision insuranceSenior-level Full TimeTysons13h ago
-
Software Security Engineer USD 103K-166KAmazon Web Services | Artificial Intelligence | Automation | Cloud Computing | Cloud platformEmployee stock purchase plan | Flexible paid time off | Growth and development fund | Home office support | Parental leaveSenior-level Full TimeRemote, Canada; Remote, US R14h ago
-
Sr. Embedded Detection Analyst USD 140K-207KAI tools | Alert Correlation | Cause analysis | Data Analysis | Detection engineeringSenior-level Full TimeRemote - USA R14h ago
-
Senior Cybersecurity Engineer, Advanced Security USD 145K-204KAPI Security | AWS | Azure | BGP | BGP RoutingSenior-level Full TimeRemote, United States R14h ago
-
[8PP] Senior Cloud Security Engineer USD 123K-188KAWS | Access Control | Application Security | Azure | CSPMSenior-level Full TimeGuadalupe, San José Province, Costa Rica14h ago
-
VMWare Enterprise Engineer USD 90K-132KAria Operations | GitLab | Kubernetes | NSX | Omnissa Horizon24/7 on-call support | Documentation supportMid-level Full TimeHampton, VA14h ago
-
Cloud Systems Administrator, Senior (Job 1331) USD 164K-174KAPI Integrations | ARM Templates | Access Management | Amazon CloudWatch | Amazon Relational Database Service401k matching | Dental insurance | E-learning access | Education assistance | Flexible spending accountsSenior-level Full TimeBethesda, Maryland15h ago
-
Sr. Software Development Engineer - Control Plane, Reliability, Backend (Flexibility on level) USD 112K-160KAWS | Ansible | Backpressure | C++ | CI/CDHybrid workSenior-level Full TimeSan Jose, California, USA15h ago
-
System Administrator - Digital Media & Technology MXN 310K-310KAccess Management | Apple iOS | Automation | Bash | Cloud MigrationPaid time off | Remote work | Work autonomyMid-level Full TimeMexico City R15h ago
-
System Administrator - Digital Media & Technology USD 148K-203KBash | Cloud Migration | ESXi | GitOps | Google WorkspacePaid time off | Remote work | Work autonomy | Work with top companiesMid-level Full TimeLatAm R15h ago
-
Software Engineer USD 131K-229KAWS Batch | AWS Cloud | AWS Cloud Development Kit | AWS IAM | AWS Lambda401k employer match | Employer-covered health insurance | Employer-covered life and disability insurance | Paid government holidays | Paid time offSenior-level Full TimeChantilly, VA15h ago
-
Senior Software Engineer (C++), Intelligence Systems USD 166K-220KC# | C++ | Containerization | Distributed Systems | Edge Computing401k matching | Caregiver leave | Commuter benefits | Dental benefits | Generous time offSenior-level Full TimeReston, Virginia, United States16h ago
-
Principal Systems Engineer USD 140K-140KActive Directory | Amazon Web Services | Backup and Disaster Recovery | Bash | Cloud platformSenior-level Full TimeSaint George, Utah, United States17h ago
-
Application Security Manager CAD 150KApplication Security | Authentication Protocols | Azure | Azure Security | Azure deploymentSenior-level Full TimeCanada - Remote R17h ago
-
Senior Reverse Engineer USD 130K-265KDynamic analysis | Ghidra | IDA Pro | Indicators of compromise | Malware analysisSenior-level Full TimeSan Antonio, TX17h ago
-
IT Systems Administrator USD 85K-100KAD Connect | Azure | Azure AD | Azure AD Connect | Backup and RecoveryMid-level Full TimeTroy, MI, United States17h ago
-
Security Engineer, Product Security USD 106K-212KAWS | Anti-abuse | Application Security | Azure | Cloud SecurityContract extension possibility | Remote workMid-level Full TimeWoodinville, Washington, United States17h ago
-
Senior Software Engineer (Infrastructure and DevOps) USD 166K-220KAzure DevOps | Bash | C++ | CI/CD | Compliance Automation401k match | Commuter benefits | Dental insurance | Disability insurance | Health insuranceSenior-level Full TimeReston, Virginia, United States18h ago
-
Cloud System Architect 2 - Terraform/AWS/Ansible/DevOps USD 130K-270KAWS | Ansible | DevOps | Eucalyptus | Kubernetes401k contribution | Accidental death and dismemberment insurance | Dental insurance | Health Savings Account contribution | Life insuranceSenior-level Full TimeAnnapolis Junction, MD18h ago
-
AWS | AWS CloudFormation | Access Management | Ansible | Docker401k matching | Dental insurance | Disability insurance | Health insurance | Life insuranceSenior-level Full TimeBoston, Massachusetts, United States18h ago
-
AWS | Ansible | Azure | CloudFormation | DockerDental benefits | Generous time off | Healthcare benefits | Life and disability insurance | Mental health resourcesSenior-level Full TimeWashington, District of Columbia, United States18h ago
-
Access Management | Amazon Web Services | Ansible | Cloud Security | CloudFormationHealthcare benefits | Professional development reimbursement | Relocation assistance | Time offSenior-level Full TimeSeattle, Washington, United States18h ago