Director, Systems Reliability Engineering

USA - CA - 820 S Flower St

The Walt Disney Company

The mission of The Walt Disney Company is to be one of the world's leading producers and providers of entertainment and information.

View all jobs at The Walt Disney Company

Apply now Apply later

Job Posting Title:

Director, Systems Reliability Engineering

Req ID:

10105307

Job Description:

At Disney, we’re storytellers. We make the impossible, possible. We are a world-class entertainment and technological leader. Walt’s passion was to continuously envision new ways to move audiences around the world—a passion that remains our touchstone in an enterprise that stretches from theme parks, resorts and a cruise line to sports, news, movies and a variety of other businesses. Uniting each endeavor is a dedication to creating and delivering unforgettable experiences — and we’re constantly looking for ways to enhance these exciting experiences.

The Systems Reliability Engineering (SRE) team helps elevate reliability practices at Disney, promoting and onboarding new technologies, solving sophisticated problems, and integrating with next-generation digital platforms. 

As the Director of Systems Reliability Engineering (SRE), you are responsible for building and leading a team of high performing Managers and SREs, supporting multiple sophisticated systems, underlying infrastructure, and applications for Disney. Your team uses a software engineering approach to architect, design, build, automate, monitor, and operate applications at scale. This includes building and operating processes, hardware ,and software solutions that are efficient, effective and resilient – helping Disney businesses and supporting teams create, deliver, and power our content, experiences and products – better, faster, safer, and happier.

What You Will Do:

  • Clearly communicate a vision to the team that defines the team’s purpose, ownership, contribution, and commitment to Disney quality, excellence, and a business enabling agile DevOps culture – promoting and driving technical innovation

  • Oversee financial planning, test and production environments, architectural fit, compliance, resource scheduling, delivery landmarks, collaboration, and service level objectives

  • Oversee, mentor, and develop managers and their teams of engineers who are proficient in software, system reliability, and security. With that, you will also oversee career management and set operational direction for your areas of ownership

  • Act as a highly technical leader (hands on if the need arises), comfortable providing senior level technical direction on enterprise level projects while being a champion of change and proactive business development

  • Hold high standards, practice continual improvement, and have a focus on the details – all while solving multiple issues simultaneously

  • Driven on delivering stellar customer service experiences to our internal business partners, always looking for opportunities to improve that experience

  • Regularly meet with key partners and collaborators to gain feedback, align with need, and grow overall team impact

  • Hold exemplary leadership and communication abilities (both verbal and written), partnering closely with business, creative, and technology leaders to pitch new insights, and deliver public presentations

  • Own outcomes and resolve project or program delays, conflicts, and performance issues

  • Influence others through data-based persuasion and facts based arguments

  • Lead others through inspiration, so they follow your example, bring a can-do attitude, and deliver magic to your team and business partners

Required Qualifications & Skills:

  • 12+ years of experience leading engineering teams responsible for software and reliability engineering, supporting and deploying products or services 

  • 5+ years of previous hands-on experience as an engineer

  • Detailed knowledge of core internet protocols (TCP/IP, DNS, HTTP, etc.), programming languages (Python, Go, C++, Ruby, Java, etc.), & application development platforms

  • Experience scripting infrastructure with tools like Terraform or CloudFormation

  • Holds understanding of configuration management frameworks (Chef, Puppet, Ansible, Salt, etc.),

  • Experience managing multiple hosting environments including public and private cloud solutions

  • Experience with data visualization, metrics, and data analysis

  • Familiarity with test case design, test case management, version control, and configuration management

  • Experience with vulnerability management, defect tracking, and bug reporting 

  • Experience with risk management/risk assessment, and continuous integration/continuous delivery

  • Adept at overall software development process and agile methodologies

  • Holds compliance and regulatory knowledge (SOX, HIPAA, GDPR, confidential data management, etc.)

  • Excellent written and verbal communications, validated ability to develop and deliver presentations geared for Senior and Executive Management

  • Ability to manage budgets, contract language, and vendor management relationships

  • Bachelor's degree in Computer Science, Information Systems, Software, Electrical or Electronics Engineering, or comparable field of study, and/or equivalent work experience

Preferred Qualifications:

  • Related certification/s (CSTE, CTM, CSM, CSPO, AWS-CDE, AZ-400, CKA, CKAD, CISSP, CEH, CDMP, CRISC, Google Cloud Professional Cloud DevOps Engineer, etc.)

  • Masters Degree in a related field

#DISNEYTECH

The hiring range for this position in Washington DC, Bristol CT, and Burbank CA is $223,700 to $300,000 per year. The hiring range for this position in Seattle WA & New York City, NY is $234,400 to $314,300 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.

Job Posting Segment:

Enterprise Technology

Job Posting Primary Business:

Cloud & Data Transformation Engineering

Primary Job Posting Category:

Site/System Reliability Engineer

Employment Type:

Full time

Primary City, State, Region, Postal Code:

Burbank, CA, USA

Alternate City, State, Region, Postal Code:

USA - CT - ESPN Building 13, USA - DC - Fox - 1145 17th St NW, USA - FL - Kirkman Point 1, USA - NY - Marvel - NY, USA - WA - 925 4th Ave

Date Posted:

2024-11-14
Apply now Apply later
Job stats:  0  0  0

Tags: Agile Ansible AWS C CEH CISSP Cloud Compliance Computer Science CRISC DevOps DNS GCP GDPR HIPAA Java Puppet Python Risk assessment Risk management Ruby Scripting SOX TCP/IP Terraform Vendor management Vulnerability management

Perks/benefits: Career development Equity / stock options Salary bonus

Region: North America
Country: United States

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.