Q
Manage Learn to apply best practices and optimize your operations.

What are major operational availability challenges?

With system availability so critical to an organization's health, administrators need to constantly monitor their environments to ensure mission-critical data is not disturbed.

Service-level agreements have always been challenging to meet. Often, the SLAs imposed on an IT department are...

unrealistic. Even if an SLA is not overly ambitious, there are three major challenges that can stand in the way of meeting operational availability goals.

1. Unforeseen events. A natural disaster such as a flood or a fire, for example, could potentially bring down mission-critical systems. Most IT shops plan for these types of situations in hopes of keeping critical workloads online throughout the duration of the disaster. Even so, it doesn't always take something as dramatic as a fire or flood to impact operational availability.

Suppose an organization suffers a security breach or a widespread malware infection. While these types of events might not always force mission-critical systems offline directly and interrupt operational availability, such systems may need to be manually taken offline to prevent further damage or to perform a forensic analysis.

One pixel George Crump, president of Storage
Switzerland, discusses his service-level
objective criteria.

2. System maintenance. Windows-based failover clusters support cluster-aware updates that allow one node to be patched at a time, while the other nodes -- and your workloads -- remain online. As such, it may be tempting to establish an SLA that doesn't leave room for maintenance. Keep in mind, however, that maintenance does not always center on servers or cluster nodes. You might, for example, need to momentarily take a system offline to expand its storage or replace an aging switch.

3. Conditions that give the perception a problem has occurred even when operational availability remains. Suppose your organization's Internet service provider loses connectivity. Although the workloads in your data center remain functional, those attempting to access these resources from the Internet will receive an error message and may assume that the servers hosting the resources do not have system availability.

Next Steps

Case study: System availability key at high-traffic times

VM clusters and high availability help protect data

SLA template for disaster recovery

This was last published in March 2016

Dig Deeper on Disaster Recovery Facilities-Operations

PRO+

Content

Find more PRO+ content and other member only offers, here.

Have a question for an expert?

Please add a title for your question

Get answers from a TechTarget expert on whatever's puzzling you.

You will be able to add details on the next page.

Join the conversation

1 comment

Send me notifications when other members comment.

By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Please create a username to comment.

What is your biggest challenge to meeting system availability goals?
Cancel

-ADS BY GOOGLE

SearchSolidStateStorage

SearchCloudStorage

SearchDataBackup

SearchStorage

Close