Principal Site Reliability Engineer

Arlington, VA

Clarity Innovations

We are your trusted partner for edtech strategy, content, and engineering.

View all jobs at Clarity Innovations

Apply now Apply later

Clarity Innovations is a trusted national security partner, dedicated to safeguarding our nation’s interests and delivering innovative solutions that empower the Intelligence Community (IC) and Department of Defense (DoD) to transform data into actionable intelligence, ensuring mission success in an evolving world.

Our mission-first software and data engineering platform modernizes data operations, utilizing advanced workflows, CI/CD, and secure DevSecOps practices. We focus on challenges in Information Warfare, Cyber Operations, Operational Security, and Data Structuring, enabling end-to-end solutions that drive operational impact.

We are committed to delivering cutting-edge tools and capabilities that address the most complex national security challenges, empowering our partners to stay ahead of emerging threats and ensuring the success of their critical missions. At Clarity, we are people-focused and set on being a destination employer for top talent, offering an environment where innovation thrives, careers grow, and individuals are valued. Join us as we continue to lead innovation and tackle the most pressing challenges in national security.

Principal Site Reliability Engineer 

Description: 

We are seeking a Principal Site Reliability Engineer (SRE) to support a mission-critical, on-premises “IT department in a box” model. This role ensures the reliability, security, and performance of a highly complex, compliance-driven hybrid infrastructure. You will provide broad support — from architecture and automation to daily system operations and hardening — with a focus on Red Hat Enterprise Linux (RHEL), Windows integration, and VMware-based virtualization. 

As a senior technical leader, you will drive the execution of site reliability engineering across system design, observability, security posture, and automation frameworks. This role requires deep experience across infrastructure layers and a strong commitment to operational excellence, security compliance, and continuous improvement. 

Key Responsibilities 

Mission-Critical Reliability & Recovery 

  • Lead response and remediation efforts for complex, high-impact infrastructure issues such as: 
  • Large-scale file permission repairs 
  • System and service recovery 
  • Act as the escalation point for high-severity incidents, providing root cause analysis and durable fixes. 

 

Linux & Hybrid Infrastructure Expertise 

  • Apply deep knowledge of Red Hat Enterprise Linux (RHEL) at scale, including SELinux configuration and enforcement. 
  • Design and implement secure, high-availability systems that span VMware, Windows, and Linux environments. 
  • Manage distributed file operations and migration using tools such as rsync, find, and advanced shell scripting. 

Automation and Configuration Management 

  • Lead the development of infrastructure-as-code using Ansible and YAML for system provisioning, state enforcement, and compliance. 
  • Leverage Red Hat Satellite or equivalent tooling to manage configuration drift, patching, and lifecycle policies. 

Security Posture & Compliance Alignment 

  • Own the implementation and enforcement of security hardening measures, including file integrity, SELinux policy tuning, access controls, and secure configuration baselines. 
  • Collaborate with security teams to maintain posture in line with compliance frameworks (e.g., CIS Benchmarks, NIST, STIGs). 

Identity Federation & Systems Integration 

  • Architect and support identity federation solutions that enable seamless authentication across Linux and Windows domains. 
  • Guide integration of Active Directory and SSSD to support hybrid user access and policy enforcement. 

Leadership & Mentorship 

  • Serve as a technical thought leader, translating infrastructure and security goals into actionable engineering initiatives. 
  • Mentor SRE team members and collaborate across engineering, security, and operations to uplift team practices and performance. 

 
Key Technologies & Focus Areas: 

  • Identity & Federation: Red Hat IdM, FreeIPA, Keycloak, SAML, OAuth2, LDAP, Active Directory 
  • Systems & Virtualization: Red Hat Linux (RHEL), Windows Server, VMware vSphere/ESXi, Satellite, Ansible 
  • Automation & Configuration Management: Ansible, Git, CI/CD Pipelines 
  • Security & Compliance: CIS Benchmarks, STIGs, vulnerability scanning, system hardening 
  • Documentation & Planning Tools: Visio, Confluence, Jira, or equivalent 

Required Qualifications: 

  • 8+ years of experience in site reliability, infrastructure engineering, or systems administration. 
  • Advanced proficiency with Red Hat Linux, SELinux, and VMware vSphere
  • Deep knowledge of file system management, rsync, and automation using Ansible (including role-based YAML automation). 
  • Proven experience in high-security or compliance-driven environments. 
  • Hands-on expertise in system troubleshooting, recovery, and hardening. 
  • Experience with Windows-Linux integration and cross-platform identity systems. 
  • Clearance required: current TS with SCI eligibility 

Preferred Qualifications: 

  • Experience working in compliance-driven environments (e.g., FISMA, FedRAMP, HIPAA, or similar). 
  • Familiarity with security frameworks and hardening guides (e.g., CIS Benchmarks, STIGs). 
  • Red Hat certifications (RHCE, RHCSA) and/or VMware certifications (VCP). 
  • Scripting experience in Bash, Python, or PowerShell. 
  • Familiarity with modern CI/CD concepts and DevSecOps tooling. 

Work Environment: 

  • On-site role; must be available for full-time presence at Arlington, VA. 
  • May require occasional after-hours support or weekend work to support mission needs or system updates. 
  • Role demands initiative, ownership, and the ability to operate independently while collaborating across functional areas. 

If you are passionate about creating exceptional user outcomes and thrive in a collaborative team setting, we invite you to apply and be a key contributor to our product development efforts. 

We are an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
Apply now Apply later

* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰

Job stats:  0  0  0

Tags: Active Directory Ansible Automation Bash CI/CD Clearance Clearance Required Compliance Confluence DevSecOps DoD FedRAMP FISMA HIPAA Jira LDAP Linux NIST PowerShell Python Red Hat SAML Scripting STIGs VMware Windows

Region: North America
Country: United States

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.