Site Reliability Engineer (SRE)

Tel Aviv-Jaffa, Tel Aviv District, IL

Apply now Apply later

Description

Dream is a pioneering AI cybersecurity company delivering revolutionary defense through artificial intelligence. Our proprietary AI platform creates a unified security system safeguarding assets against existing and emerging generative cyber threats. Dream's advanced AI automates discovery, calculates risks, performs real-time threat detection, and plans an automated response. With a core focus on the ""unknowns,"" our AI transforms data into clear threat narratives and actionable defense strategies. 

Dream's AI cybersecurity platform represents a paradigm shift in cyber defense, employing a novel, multi-layered approach across all organizational networks in real-time. At the core of our solution is Dream's proprietary Cyber Language Model, a groundbreaking innovation that provides real-time, contextualized intelligence for comprehensive, actionable insights into any cyber-related query or threat scenario. 

We are seeking an experienced Senior Site Reliability Engineer to join our SRE team as part of our Platform Engineering group. This role involves taking ownership of monitoring, deploying, and ensuring the reliability of production-grade modern SaaS platforms across Cloud and On-Premise environments. 

 

Responsibilities

  • Lead initiatives to enhance product reliability and system readiness. 
  • Design and implement sophisticated monitoring solutions to ensure high availability and performance of our production platform. 
  • Oversee and refine the entire product reliability pipeline. 
  • Proactively troubleshoot and resolve issues across production environments. 
  • Champion an "Everything as Code" approach using a wide range of technologies including Ansible, Terraform, Helm, Python and more. 
  • Develop advanced tools for automation, deployment, monitoring, and operations. 
  • Exhibit excellent communication and interpersonal skills to effectively collaborate within the team and across departments. 
  • Promoting best practices in reliability and system operations. 


Requirements

None

Skills

  • At least 4-5 years of experience as a DevOps or Site Reliability Engineer. 
  • In-depth knowledge of microservices architectures and technologies such as Kubernetes. 
  • Comprehensive understanding of cloud & on-prem environments and hybrid solutions. 
  • Proficiency with one or more major cloud providers. (AWS experience is an advantage) 
  • Advanced experience with CI/CD technologies including Jenkins, GitHub Actions, and ArgoCD. 
  • Proficient coding and scripting capabilities in Python, Bash, or similar languages. 
  • Strong team player with proven ability to lead and inspire. 


Advantages:

  • Prior experience with endpoint security products (agents, sensors, collectors). 
  • Background in working with AI components (training, inference, serving). 
  • Tech Stack: AWS, Kubernetes, EKS, RKE2, ECS, SageMaker, Jenkins, GitHub, Terraform, Python, Ansible, Docker + Compose, ArgoCD, MongoDB, RabbitMQ, Redis, Go, Neo4J, AI, and more. 


Apply now Apply later

* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰

Job stats:  0  0  0

Tags: Ansible Artificial Intelligence Automation AWS Bash CI/CD Cloud Cyber defense DevOps Docker Endpoint security GitHub Helm Jenkins Kubernetes Microservices MongoDB Monitoring Neo4j Python RabbitMQ Redis SaaS Scripting Terraform Threat detection

Region: Middle East
Country: Israel

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.