IT Surveillance, Incident Management & BCP

Sg. Besi

Maxis

Maxis stands as Malaysia's leading telco company, presenting top-tier offerings including postpaid plans, internet plans, phone plans, and more. Enhance your connectivity with our steadfast services today!

View all jobs at Maxis

Apply now Apply later

Are you ready to get ahead in your career?

  • We want to empower you to turn your ambitions into achievements.
  • We thrive in inclusiveness, diversity and embrace close collaborations for you to create impact for yourself and others.
  • Together, we aim to bring the best of technology to help people, businesses and the nation to be ahead in a changing world.
  • To realise our vision to become Malaysia’s leading converged solutions company, we are looking for a new talent to innovate and grow with us in a culture that values commitment, performance and possibilities.

Why does this job exist and why is it critical?​

The IT Surveillance, Incident Management, and Business Continuity Specialist is to is responsible to monitor & lead the incident management events including coordinating with relevant teams, declaring service level/impact, communicate with stakeholders by providing timely updates on incident status, resolution & postmortem review. The role focuses on ensuring the integrity, availability, and confidentiality of critical systems and data. It also includes developing, maintaining, and testing the Business Continuity Plan (BCP) for smooth recovery from disruptions.

What are you accountable for?

IT Surveillance and Monitoring

  • Continuously monitor IT systems, applications, and networks to detect irregularities or threats.

  • Utilize monitoring tools (e.g., BMC, log management, network monitoring tools) to analyze system behavior and identify security or performance issues.

  • Investigate alerts, logs, and system anomalies to determine their impact and take appropriate action.

  • Generate and review reports related to system performance, availability, and security events.

Incident Management

  • Respond to and manage IT incidents, including system outages, security breaches, and application failures.

  • Coordinate cross-functional teams to troubleshoot, resolve, and mitigate incidents in a timely manner.

  • Maintain an incident management process and ensure documentation of incident reports and post-incident reviews (PIRs).

  • Escalate critical incidents to senior management and provide status updates during the lifecycle of the incident.

  • Conduct root cause analysis (RCA) for major incidents and implement corrective actions to prevent recurrence.

  • Ensure that Service Level Agreements (SLAs) are met during incident resolution.

Business Continuity Planning (BCP)

  • Develop and maintain the organization’s Business Continuity Plan (BCP), ensuring it is aligned with business priorities.

  • Conduct regular risk assessments and impact analysis to identify potential threats to IT operations.

  • Design strategies for disaster recovery, data backups, and redundancy to minimize downtime and data loss.

  • Test and update the BCP regularly, ensuring that all stakeholders are familiar with the procedures.

  • Lead BCP simulations and drills to ensure the readiness of the organization in case of emergencies.

  • Collaborate with key business units to ensure that their continuity requirements are met.

Risk Management

  • Identify potential IT risks (security, operational, or environmental) and develop mitigation strategies.

  • Work closely with cybersecurity teams to prevent and mitigate security threats such as malware, phishing attacks, and data breaches.

  • Maintain compliance with relevant IT and business continuity standards (e.g., ISO 27001, ISO 22301).

End User Computing/Service Desk

  • Proactive Issue Resolution - Implement automation for common tasks (e.g., password resets, access requests) and provide self-service options through a portal. This reduces waiting time and empowers users to solve minor issues independently.

  • AI-Driven Insights: Use analytics to predict common issues before they occur, allowing the service desk to proactively reach out to users or prepare knowledge base articles that address these concerns.

  • Comprehensive Documentation - Maintain a well-organized knowledge base that’s regularly updated and easy to navigate, with guides, video tutorials, and FAQs.

  • User Training Sessions: Conduct periodic training sessions for end users to familiarize them with IT tools, cybersecurity best practices, and self-service options

Reporting and Communication

  • Provide regular status reports on IT surveillance, incident management, and BCP activities.
  • Communicate effectively with stakeholders regarding incidents, risk exposure, and mitigation efforts.
  • Maintain accurate and updated documentation of procedures, policies, and incidents.

What do you need to have for the role?

  • Bachelor's degree in Information Technology, Computer Science, or related field.

  • Total 10-15 years of experience in IT Operations, Incident Management, or Business Continuity Planning.

  • Experience in incident response, disaster recovery, or security operations is highly preferred.

Technical Skills

  • Strong understanding of IT infrastructure (servers, networks, databases, cloud services).

  • Familiarity with monitoring tools (e.g., BMC, SolarWinds, Nagios, Splunk).

  • Knowledge of incident management tools (e.g., BMC, Jira, PagerDuty).

  • Understanding of cybersecurity principles and common IT risks.

  • Experience with disaster recovery technologies (e.g., backups, data replication).

Competencies

  • Strong troubleshooting skills with the ability to quickly diagnose issues.

  • Capable of conducting root cause analysis and implementing effective resolutions.

  • Excellent communication skills, both written and verbal.

  • Ability to communicate complex technical issues to non-technical stakeholders.

  • Ability to work collaboratively in a fast-paced, cross-functional environment.
  • Strong leadership skills to manage teams during incident resolution or BCP drills.

  • Meticulous in monitoring, documentation, and implementation of risk management processes.

Certifications (preferred but not mandatory)

  • ITIL Foundation (Incident Management)
  • Certified Information Systems Security Professional (CISSP)
  • Certified Business Continuity Professional (CBCP)
  • CompTIA Security+

What’s next?

  • Once you’ve applied online, our team will carefully review your application. Due to a high volume of applications, we appreciate your patience to allow for a fair and timely review process.
  • Should you be shortlisted for the role, we will send you an invitation via email for a digital interview. You can also check on your application status by logging into your candidate account.

Maxis values diverse voices & people. We hire and reward our employees based on capability & performance — regardless of ethnicity, gender, age, education, religion, nationality or physical ability.

Apply now Apply later

* Salary range is an estimate based on our InfoSec / Cybersecurity Salary Index 💰

Job stats:  0  0  0

Tags: Analytics Automation CISSP Cloud Compliance CompTIA Computer Science Incident response ISO 22301 ISO 27001 ITIL IT infrastructure Jira Malware Monitoring Nagios Risk assessment Risk management SLAs Splunk Surveillance

Perks/benefits: Career development Team events

Region: Asia/Pacific
Country: Malaysia

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.