Incident Response Engineer
Lehi
DigiCert
DigiCert is the leading TLS/SSL Certificate Authority specializing in digital trust for the real world through PKI, IoT, DNS, Document & Software security solutions.Who we are
We're a leading, global security authority that's disrupting our own category. Our encryption is trusted by the major ecommerce brands, the world's largest companies, the major cloud providers, entire country financial systems, entire internets of things and even down to the little things like surgically embedded pacemakers. We help companies put trust - an abstract idea - to work. That's digital trust for the real world.
Job Summary
The Incident Response Engineer, is a key role within the Site Reliability Engineering operations team responsible for the deployment, configuration, and optimization of tools used to detect, investigate, respond and manage incidents.
What you will do
- Perform proactive daily monitoring of our services including reviewing system and applications logs and manage Incident life cycle (detection, confirmation, notification, repair/Isolation, escalation, resolution and reporting) to ensure quick turnaround in service restoration.
- Repair and recover from hardware or software failures. Coordinate and communicate with impacted stakeholders and clients, escalating where appropriate.
- Work closely with development and engineering teams helping to build, maintain and extend support for all production services.
- Review entire environment and execute initiatives to reduce failures, defects and improving overall performance.
- Monitor and troubleshoot issues across the entire stack - hardware, software, application and network.
- Demonstrate technical leadership with incident handling and troubleshooting.
- Document current and future configuration processes and policies.
- Assist with the implementation and development of SRE tools and applications.
- Manage and support SRE tools and applications.
- Perform periodic on-call duty as part of a global team.
- Able to install and manage web certificates (SSL, Client Auth).
- Prior working knowledge of Salt, Splunk, JIRA, Atlassian Wiki, NewRelic,
What you will have
- 5+ years of experience in IT, Service Operations, or Development Operations related roles.
- 3+ years of experience with Deployment Tools: SALT, Kubernetes, Docker, Jenkins.
- 5+ years of experience with multiple OS flavors : Linux, AWS.
- 5+ years of experience in the Hi-tech industry.
- 2+ years of experience with Database Environments: MySQL, Casandra.
- 2+ years of experience with multiple programming languages.
- This is a US night shift position (10:00pm – 8:00am US CA time (PT)) 4 days a week.
Benefits
- Generous time off policies
- Top shelf benefits
- Education, wellness and lifestyle support
DigiCert is an Equal Opportunity employer and is committed to diversity in its workforce. In compliance with applicable federal and state laws, DigiCert prohibits discrimination on the basis of race or ethnicity, religion, color, national origin, sex, age, sexual orientation, gender identity/expression, veteran’s status, status as a qualified person with a disability, or genetic information. Individuals from historically underrepresented groups, such as minorities, women, qualified person with disabilities, and protected veterans are strongly encouraged to apply.
#LI-RR1
* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰
Tags: AWS Cloud Compliance Docker E-commerce Ecommerce Encryption Incident response Jenkins Jira Kubernetes Linux Monitoring MySQL Splunk
Perks/benefits: Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.