Principal Site Reliability Engineer

Remote

UltraViolet Cyber

Evolve your security operations into your proactive risk reducing superpower through the combination of MDR with Red Teams that validate every alert.

View all jobs at UltraViolet Cyber

Apply now Apply later

Make a difference here.
UltraViolet Cyber is a leading platform-enabled unified security operations company providing a comprehensive suite of security operations solutions. Founded and operated by security practitioners with decades of experience, the UltraViolet Cyber security-as-code platform combines technology innovation and human expertise to make advanced real-time cybersecurity accessible for all organizations by eliminating risks of separate red and blue teams.
By creating continuously optimized identification, detection, and resilience from today’s dynamic threat landscape, UltraViolet Cyber provides both managed and custom-tailored unified security operations solutions to the Fortune 500, Federal Government, and Commercial clients. UltraViolet Cyber is headquartered in McLean, Virginia, with global offices across the U.S. and in India. 
UltraViolet is seeking a highly skilled Principal Site Reliability Engineer (SRE) with expert-level experience in Amazon Elastic Kubernetes Service (EKS), DevOps, and AWS to enhance the scalability, reliability, and security of our cloud infrastructure. As a key member of our engineering team, you will work across multiple disciplines to ensure the resilience and efficiency of our systems, employing automation and modern DevOps practices to drive operational excellence. This is a highly dynamic role that requires a combination of hands-on expertise, leadership skills, and continuous learning to help mature our infrastructure and reliability processes. 

Work You'll Do:

  • System Reliability & Performance: Ensure the availability, performance, scalability, and security of our cloud-based services using best practices in SRE and DevOps. 
  • Kubernetes & EKS Management: Architect, deploy, and maintain Kubernetes clusters, primarily using Amazon Elastic Kubernetes Service (EKS) 
  • Infrastructure as Code (IaC): Automate infrastructure provisioning, configuration, and management using Terraform, Pulumi, or similar tools. 
  • CI/CD Pipelines: Build, maintain, and enhance continuous integration and continuous deployment (CI/CD) pipelines, optimizing deployment workflows for speed and reliability. 
  • Monitoring & Incident Response: Design and implement comprehensive monitoring, alerting, and logging solutions using tools such as Prometheus, Grafana, and CloudWatchto proactively identify and address system issues. 
  • Security & Compliance: Enforce security best practices, implement access controls, and ensure compliance with industry standards 
  • Capacity Planning & Scaling: Analyze system performance and scalability, implementing proactive strategies to accommodate growth and prevent downtime. 
  • Collaboration & Cross-Functional Leadership: Work closely with Engineering and Product teams to integrate reliability principles into the software development lifecycle. 
  • Incident Management & Root Cause Analysis: Lead post-mortem investigations for critical incidents, identifying actionable improvements to enhance system resilience. 
  • Cost Optimization: Assess and optimize cloud costs while maintaining performance and reliability, leveraging AWS savings plans, right-sizing resources, and improving infrastructure efficiency.  

What You Have:

  • Extensive experience in AWS, with deep expertise in managing EKS clusters, networking, IAM, security groups, and other core AWS services. 
  • Strong proficiency in Kubernetes (EKS, Helm, Kubectl, Operators) with a proven track record of deploying, maintaining, and scaling containerized applications. 
  • Hands-on experience in DevOps tools & methodologies, including Terraform, Ansible or SaltStack, Helm, GitOps, ArgoCD, and CI/CD platforms such as GitHub Actions or Jenkins 
  • Proficiency in scripting and automation using Python, Bash, or Golang to enhance system reliability and efficiency. 
  • Experience with observability and monitoring tools, including Prometheus, Grafana, Loki, or AWS CloudWatch. 
  • Deep understanding of networking principles, including DNS, VPC, Load Balancers, VPNs, and Service Mesh architectures 
  • Strong background in security best practices, including IAM policies, encryption, secrets management, and vulnerability scanning (AWS KMS, HashiCorp Vault, etc.). 
  • Experience working with highly available, distributed systems, including microservices architecture and cloud-native applications. 
  • Previous experience in an Agile or DevOps culture, promoting collaboration, automation, and iterative improvements. 
  • Excellent troubleshooting skills, with the ability to analyze complex system failures and drive solutions. 
  • Strong communication and leadership skills, with the ability to mentor junior engineers and collaborate effectively with cross-functional teams. 
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience. 

What We Offer:

  • 401(k), including an employer match of 100% of the first 3% contributed and 50% of the next 2% contributed  
  • Medical, Dental, and Vision Insurance (available on the 1st day of the month following your first day of employment)  
  • Group Term Life, Short-Term Disability, Long-Term Disability  
  • Voluntary Life, Hospital Indemnity, Accident, and/or Critical Illness  
  • Participation in the Discretionary Time Off (DTO) Program  
  • 11 Paid Holidays Annually 
UltraViolet Cyber maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect our company's differing products, services, industries and lines of business. Candidates are typically placed into the range based on the preceding factors.
We sincerely thank all applicants in advance for submitting their interest in this position. We know your time is valuable.
UltraViolet Cyber welcomes and encourages diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability, or veteran status. 
If you want to make an impact, UltraViolet Cyber is the place for you!
Apply now Apply later

* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰

Job stats:  2  1  0

Tags: Agile Ansible Automation AWS Bash CI/CD Cloud Compliance Computer Science DevOps DNS Encryption GitHub Golang Grafana Helm IAM Incident response Jenkins Kubernetes Loki Microservices Monitoring Prometheus Python Scripting SDLC Terraform VPN

Perks/benefits: 401(k) matching Career development Health care Insurance

Region: Remote/Anywhere

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.