Communication, Collaboration & Orchestration: 10 Vital Steps to IT Alerting Automation

No matter how robust your IT infrastructure and IT systems, change and testing processes, it is almost impossible to prevent an application slowdown, major service outage or security breach. This can be quite costly to businesses with Ponemon finding potential losses of almost $9000 per minute. Often human error is to blame causing four out of five data breaches in the UK, according to the ICO, showing that even the most robust system can fail if mismanaged.

The challenge for companies is to plan for these major IT incidents before they occur and when they happen, to be able to respond with the velocity, control and communications that are required in today’s digital world, thereby avoiding huge financial costs from lost revenues or damage to brand reputation.

A fast response is vital because as bad as an initial failure may be, when the situation continues and an outage makes it impossible to give customers or employees accurate updates, the repercussions can only become more serious.

A better approach is to define procedures for the proper recovery of workflows and communicate them to all stakeholders. Alongside this, companies should automate the alerting and response processes for when something goes wrong. Here are 10 important steps that can help minimize the damage in such a scenario:

Develop standard processes for the shutting down and restoration of servers, network gear and their power supplies, including each step to be performed, its expected result, and how to return to a previous safe condition if a change produces an unexpected result.
Require approval by management for any changes to these processes.
Create advisories within the workflow about what warning signs (such as noises or error messages) might signal various failures.
Develop safeguards (such as required approval by a second person) for any action taken during the restoration process that could disrupt business-critical systems.
Conduct periodic tests of the configuration and status of backup power systems and the switches that move the power load to backup sources.
Conduct periodic audits to assure that any new or upgraded hardware is provided for in the backup and restoration plan, and is properly configured for the failover of power and connectivity.
Require periodic tests of the recovery of servers and databases and the rapid updating of production data.
Share the approved processes with all outside service providers and internal stakeholders to assure they are followed.
Automate response workflows when something does go wrong (e.g. when an outage is detected) to assure timely communication of information to the right resolvers and stakeholders such as customers.
Keep audit trails to verify which resolvers received which information, whether they confirmed the receipt of the information, and if they took the required action.

This sort of approach highlights the importance of communication as part of a response plan to a crisis. The ‘call us and we’ll fix it’ model no longer works. Support really needs to be far more proactive, and that means less work that involves ‘firefighting’ or ‘keeping the lights on’. We are living in a time when ‘instant’ is expected. We would not use Google or Bing if it meant typing in a question and waiting 10 or 15 minutes for an answer.

We are constantly told that the new-age worker is always on the go, and this is true, but it is also important to remember that people also spend a good deal of time in meetings, take holidays, and even sleep! All of these things must be accounted for as part of your IT response, and traditional methods can’t solely be trusted to provide the speed and accuracy that modern IT systems require for maintenance. For example, a 24-hour online clothing store caters to a non-stop influx of customers. Any downtime, even at 3am, needs to be quickly addressed and resolved.

Asynchronous communication isn’t suitable for situations requiring rapid response. Businesses need to know beyond any doubt that the other party has received and is responding to their message. What’s more, these messages need to be sent right away and responses received as soon as possible. As the velocity of business increases, more and more rapid responses are required. It is not simply enough to leave a voicemail in the midst of a critical situation and rely on the fact that it will be heard and responded to rapidly.

Instead, an automated system that can process responses as well as sending alerts and notify you on whether critical stakeholders have responded should be deployed. Given the threats businesses face day-to-day and the speed that is required to address them, relying on manual action is no longer acceptable. By taking into account the 10 steps laid out above, companies can ensure that their policies and systems are robust enough to handle the modern threats of today’s cyber-landscape.

Communication, Collaboration & Orchestration: 10 Vital Steps to IT Alerting Automation

Mike Beckett

You may also like

Four Ways to Leverage DevOps for Compliance

ISACA launches security audit programs for BYOD, data privacy and outsourcing

Cybersecurity in 2021: People, Process and Technology to Integrate More Than Ever Before

The Future of Security: AI and Cognitive

Leaving a trace

What’s Hot on Infosecurity Magazine?

New Hacking Campaign Exploits Microsoft Windows WinRAR Vulnerability

Hundreds of Malicious Crypto Trading Add-Ons Found in Moltbot/OpenClaw

Two Critical Flaws in n8n AI Workflow Automation Platform Allow Complete Takeover

Smartphones Now Involved in Nearly Every Police Investigation

AI Drives Doubling of Phishing Attacks in a Year

SolarWinds Web Help Desk Vulnerability Actively Exploited

NSA Publishes New Zero Trust Implementation Guidelines

Cybersecurity M&A Roundup: CrowdStrike and Palo Alto Networks Lead Investment in AI Security

Data Privacy Day: Why AI’s Rise Makes Protecting Personal Data More Critical Than Ever

Over 80% of Ethical Hackers Now Use AI

New CISA Guidance Targets Insider Threat Risks

Number of Cybersecurity Pros Surges 194% in Four Years

Securing M365 Data and Identity Systems Against Modern Adversaries

Five Non-Negotiable Strategies to Get Identity Security Right in 2026

How to Implement Attack Surface Management in the AI and Cloud Age

Cyber Resilience in the AI Era: New Challenges and Opportunities

Safeguarding Critical Supply Chain Data Through Effective Risk Assessment

Dispelling the Myths of Defense-Grade Cybersecurity

Regulating AI: Where Should the Line Be Drawn?

What Is Vibe Coding? Collins’ Word of the Year Spotlights AI’s Role and Risks in Software

Risk-Based IT Compliance: The Case for Business-Driven Cyber Risk Quantification

Bridging the Divide: Actionable Strategies to Secure Your SaaS Environments

NCSC Set to Retire Web Check and Mail Check Tools

Beyond Bug Bounties: How Private Researchers Are Taking Down Ransomware Operations

Communication, Collaboration & Orchestration: 10 Vital Steps to IT Alerting Automation

Written by

You may also like

What’s Hot on Infosecurity Magazine?