A number of factors can pose a risk to a data center's operational availability, but three are the most pervasive:...
By submitting your personal information, you agree that TechTarget and its partners may contact you regarding relevant content, products and special offers.
power failure, human error and data center-level failure.
Make sure you're aware of these elements that could cause major problems for your organization's data.
The need for a reliable power source is one thing that even the most advanced computers have in common with the much more primitive systems from many decades ago. There are contingencies against power failures, but they may not always be adequate. Backup generators can run out of fuel. Batteries can die. It is even possible for a backup power source to malfunction.
While this operational availability risk is probably the most common, it is also the most difficult to mitigate. Human error can come in many different forms, as the following examples describe:
- An incident in which a secretary entered a data center to give a member of the IT team an important message. While there, the secretary accidentally hit the safety shutoff that cut power to the entire data center.
- An administrator improperly configured failover clusters, causing operational availability risk. The configuration error prevented workloads from failing over in the event of a node failure. Unless failover clusters are periodically tested, a configuration error may not be discovered until a node-level failure actually occurs.
Data center-level failure
This is not the type of event that happens on a daily basis, but, when a data center-level failure does occur, it causes major problems to operational availability. Today, mission-critical workloads are usually run on systems equipped with full hardware redundancy. Since no single point of failure exists within the system itself, the data center can become the single point of failure. An event such as a fire or a hurricane could easily impact a data center and bring down mission-critical workloads.
Eliminate performance issues in storage systems
Data center downtime often caused by human error
Increase system availability with DR plan template
Dig Deeper on Disaster recovery facilities - operations
Related Q&A from Brien Posey
Having a strategy to back up SAP HANA is a must. It's important to decide exactly what you'll be backing up, along with which method best suits your ...continue reading
Picking an NVMe drive is an important decision. Consider thermal control, proprietary software and drive architecture to make the right choice.continue reading
While data compression can effectively reduce space, be careful with how you use it, because the three issues outlined here could cause problems in ...continue reading
Have a question for an expert?
Please add a title for your question
Get answers from a TechTarget expert on whatever's puzzling you.