SRE Team Leader
Tel Aviv, IL
Claroty
Claroty secures the Extended Internet of Things (XIoT) to achieve unmatched visibility, protection, and threat detection across all cyber-physical systems – OT, IoT, BMS, IoMT and more – in your environment.Description
We’re growing and looking to hire SRE Team Leader who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.
About Claroty:
Claroty is on a mission to secure cyber-physical systems across industrial, healthcare, commercial and public sector environments: the Extended Internet of Things (XIoT). The Claroty Platform integrates with customers’ existing infrastructure to provide a full range of controls for visibility, exposure management, network protection, threat detection, and secure access. Our solutions are deployed by over 1,000 organizations at thousands of sites across all seven continents.
Claroty is headquartered in New York City, with employees across the Americas, Europe, Asia-Pacific, and Tel Aviv. The company is widely recognized as the industry leader in cyber-physical systems protection, with backing from the world’s largest investment firms and industrial automation vendors, as well as recognition from KLAS Research as Best in KLAS for Healthcare IoT Security, the Deloitte Technology Fast 500, the Forbes Cloud 100, and the Fortune Cyber 60.
About the Engineering team
The Claroty Engineering (R&D) Department is a group of talented engineers with specialties including BE, FE, DevOps, QA and automation, who come from a variety of backgrounds and organizations with strong experience and skills in software development and cybersecurity.
Our engineers use the most state-of-the-art technologies available to build our products – from Kafka to K8s, Spark and latest React/Angular, AWS lambdas & Argo workflows and many others – ensuring the fastest and highest-quality delivery to our customers.
We are solving some of the most complex technical challenges in the industry today – anything from OS level in-depth activity, networking traffic analysis, big-data analysis, multi-tenancy architecture and limited resources design and implementation to cope with high performance requirements, and sophisticated UX concepts.
Overview
The SRE and NOC Team Leader is tasked with leading, coordinating, and overseeing the management of the Production Cloud environment and infrastructure. This role includes ensuring efficient, seamless rollouts, high system performance, and quick response times when disruptions occur. This professional works collaboratively on the SRE and NOC side to balance rapid technology rollouts and upgrades with reliability and dependability.
The SRE and NOC Team Leader role is a strategic and critical position tasked with leading, coordinating, and overseeing the management of two international squads:
The Site Reliability Engineering (SRE) team is based primarily in Israel and the US, and the 24/7 Network Operations Center (NOC) squad will be based in a location to be determined.
The role requires the candidate to be available mainly during local working hours.
A significant part of this role also includes reestablishing and recruiting members to both squads and defining and implementing relevant tools and processes.
This is a tremendous opportunity for the right candidate to build these squads from the ground up, shaping the future direction of our SRE and NOC operations.
Responsibilities
As an SRE Team Leader, Your impact will be:
Site Reliability Engineering (SRE)
- Production Gatekeeper: Design and enforce the rollout strategy for new technologies and oversee their execution to ensure minimal disruption to existing systems.
- Production On-Call: Act as the first line of response for critical incidents, assessing issues, triaging, and coordinating with the team to prevent further issues and swiftly restore services.
- Monitor Production Performance and Degradation: Keep a close eye on system performance metrics and detect any degradation early to prevent outages and disruptions.
- Production Maintenance: Conduct regular infrastructure upgrades to accommodate changes, developments, and advancements in the technological landscape.
- Manage Release Flow: Oversee the release of updates and new functionalities, ensuring a seamless transition while handling any potential negative impacts on production.
- Staging Management: Oversee the management of the staging environment, ensuring that it accurately represents the production environment for effective testing and simulation.
Network Operations Center (NOC)
- Build Playbooks: Develop and maintain comprehensive playbooks for managing system issues and incidents, setting guidelines for troubleshooting, escalation, and resolution processes.
- Build Monitoring Dashboards: Design, set up, and maintain monitoring dashboards to visualize and track system performance and incidents in real-time.
- Alerts and Incident Management: Establish protocols for issuing alerts in the event of system issues or anomalies and lead the team in incident resolution.
Requirements
What do you need to succeed in this role?
- Proven experience in SRE/DevOps roles (NOC role - advantage) and team management experience
- Strong leadership qualities and team management skills.
- Tech stack - Jenkins, TF, Ansible, Bash, Python, AWS, Argo
- Expertise in system monitoring and incident management tools
- Exceptional problem-solving and analytical skills
- Excellent written and verbal communication abilities.
- A Bachelor's degree in Computer Science, Information Technology, or a related field - Advantage
- Familiarity with Agile methodologies
Why Claroty? Our Culture and Benefits:
- Claroty is a people first company. With strong bonds amongst the team, we believe in prioritizing personal care and support over work, confident that results follow from a harmonious environment. We celebrate professional and personal successes, committed to fostering a diverse and inclusive space.
- Stability, we demonstrate continued growth over the past few years, raised over 700M$ from top tier investors, we have top tier board members and our products are sold worldwide, over 1000 customers.
- We understand the importance of maintaining a healthy work-life balance, and encourage people to take the time they need to rest and prioritize their mental and physical health. We also provide a biannual “ClaroBreak”, a company-wide long weekend shutdown so we can all rest, recharge and spend time with our loved ones.
- We care about your development. At Claroty, we prioritize excellence and uphold high professional and ethical standards. We encourage career growth and exploration within the company, facilitated by biannual performance reviews, feedback sessions, and individual development planning, complemented by professional courses.
- We believe in transparency and openness. That’s why we regularly hold company all-hands, town hall meetings, and “Coffee with the CEO” sessions. We also conduct round table sessions and employee satisfaction surveys, to keep a pulse on what matters most to our team members and make our culture the best it can be.
- While we have physical offices in New York, Tel Aviv, London and Singapore, we also embrace a hybrid working culture. This flexibility allows us to tap into a diverse talent pool and enables our team members to work in a way that suits their individual preferences and circumstances.
Claroty is an equal-opportunity employer committed to fostering a diverse and inclusive work environment for all. We encourage applications from candidates of ALL diverse backgrounds, and special accommodations are available upon request in all selection phases.
* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰
Tags: Agile Ansible Automation AWS Bash Cloud Computer Science DevOps Industrial Internet of Things IoT Jenkins Kafka Kubernetes Monitoring NetOps Python R&D Strategy Threat detection
Perks/benefits: Career development Health care Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.