Myth of the nines

Myth of the nines

In information technology, the myth of the nines is the idea that standard measurements of high availability can be misleading. Availability is sometimes described in units of nines, as in "five nines", or 99.999%. Having a computer system's availability of 99.999% means the system is highly available, delivering its service to the user 99.999% of the time it is needed.

The myth explained

The myth of the nines is the implicit assumptionFact|date=July 2008 that if the computer is operating 0.99999 of the time, then the user's business is operating 0.99999 of the time. In fact, this is often far from the truth. After an outage, the humans using the computer have to scramble to catch up, perhaps apologising to customers, calling them back, entering data written on paper during the outage, and other unfamiliar chores. In the case of a drive failure, the server downtime might be small, but the time to restore from backup might be considerably longer. A computer outage of a minute might cause a business outage of hours.

A further assumption in this model is that ten outages of one minute each have the same effect on the user as one outage of ten minutes. Again, this is not usually true. If a system is experiencing repeated outages, the user is justified in believing that the system cannot be trusted. In this case, the user may regard the computer as a liability. The user may measure ten one-minute outages over a period of six months as a downtime of six months, while the computers manufacturer measures it as a downtime of ten minutes. There is no way to calculate the number of outages over a given period from the uptime percentage alone.

Also, failure probabilities are not always independent. For example, a system made up of five-nine components does not have five-nine availability: each component must be up for the system to be up, and so the component uptimes must be multiplied to give the system uptime. A system of ten components (e.g., disks, motherboard, PSU, RAM, mains-power, network...), each with 99.999% availability, only has approximately 99.99% overall availability.

In contrast, a system might also be made of redundant parts, such that failure events are independent. This would have the effect of increasing up time, for the system to fail, two or more components need to fail. The best example of this is the RAID, where normally 2 or more drives need to fail for the RAID to fail. In this case, a system such as a level 5 RAID, 2 drives need to fail, and for 99.999 disks, the RAID uptime would be 99.99999999%. There are more complicating factors in redundant systems where the redundant components need to fail within the time it takes for the 1st component to be replaced.

Lastly, in many cases, "scheduled maintenance" is not included within the reliability calculation. So if the computer must be taken down to replace a failing disk, but the downtime is notified a week in advance, it "doesn't count".

See also

* Reliability engineering

External links

* [http://www.jiploo.com/blog/99999-or-five-nines-uptime/ 99.999 or 5 Nines Uptime Article]
* [http://basicstate.com/htm/9999.htm uptime values chart for values down to 90 percent on a daily basis]


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Myth of Er — A Renaissance manuscript Latin translation of The Republic The Myth of Er is an eschatological legend that concludes Plato s The Republic (10.614 10.621). The story includes an account of the cosmos and the afterlife that for many centuries… …   Wikipedia

  • Network monitoring — The term network monitoring describes the use of a system that constantly monitors a computer network for slow or failing components and that notifies the network administrator (via email, pager or other alarms) in case of outages. It is a subset …   Wikipedia

  • High availability — is a system design approach and associated service implementation that ensures a prearranged level of operational performance will be met during a contractual measurement period. Users want their systems, for example wrist watches, hospitals,… …   Wikipedia

  • Carrier grade — In telecomunications, a carrier grade or carrier class refers to a system, or a hardware or software component that is extremely reliable, well tested and proven in its capabilities. Carrier grade systems are tested and engineered to meet or… …   Wikipedia

  • List of mathematics articles (M) — NOTOC M M estimator M group M matrix M separation M set M. C. Escher s legacy M. Riesz extension theorem M/M/1 model Maass wave form Mac Lane s planarity criterion Macaulay brackets Macbeath surface MacCormack method Macdonald polynomial Machin… …   Wikipedia

  • Amelia Earhart — Earhart redirects here. For the asteroid, see 3895 Earhart. Amelia Earhart Amelia Earhart …   Wikipedia

  • List of unusual units of measurement — For units of measure primarily used in countries where English is not the main language, see the article specific to that country, a list of which can be found in the systems of measurement article. An unusual unit of measurement is a unit of… …   Wikipedia

  • List of common misconceptions — This incomplete list is not intended to be exhaustive. This is a list of current, widely held, false ideas and beliefs about notable topics which have been reported by reliable sources from around the world. Each has been discussed in published… …   Wikipedia

  • literature — /lit euhr euh cheuhr, choor , li treuh /, n. 1. writings in which expression and form, in connection with ideas of permanent and universal interest, are characteristic or essential features, as poetry, novels, history, biography, and essays. 2.… …   Universalium

  • Triple Goddess (Neopaganism) — This article discusses the Maiden, Mother, Crone goddess triad of certain forms of Neopaganism. See triple goddesses for other uses. The Triple Goddess is the subject of much of the writing of Robert Graves, and has been adopted by some neopagans …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”