or , we take or respectively. Otherwise, we find an integer such that , and take [cite web
url=http://www.mathworks.com/access/helpdesk/help/toolbox/stats/prctile.html
title=Matlab Statistics Toolbox - Percentiles
accessdate=2006-09-15] When , the formula gives the median. When is even and , the formula gives the median of the first values.Linked with the percentile function, there is also a weighted percentile, where the percentage in the total weight is counted instead of the total number. In most spreadsheet applications there is no standard function for a weighted percentile. One method for weighted percentile extends the method described above. Suppose we have positive weights , , ,..., , associated respectively with our sample values. Let be the -th partial sum of these weights. Then the formulae above are generalized by taking and
Alternative methods
Many software packages, such as Microsoft Excel, use the following method to estimate the value, , of the percentile of an ascending ordered dataset containing elements with values ;
is then split into its integer component, and decimal component, , such that .
If , then the value for that percentile, , is the first member of the ordered dataset, .
If , then the value for that percentile, , is the member of the ordered dataset, .
Otherwise, and
An alternative method is as above, with calculated as
Relation between percentile, decile and quartile
* P25 = Q1
* P50 = D5 = Q2 = median value
* P75 = Q3
* P100 = D10 = Q4
* P10 = D1
* P20 = D2
* P30 = D3
* P40 = D4
* P60 = D6
* P70 = D7
* P80 = D8
* P90 = D9
Note: One quartile is equivalent to 25 percentile while 1 decile is equal to 10 percentile.
Examples
When ISPs bill "burstable" internet bandwidth, the 95th or 98th percentile usually cuts off the top 5% or 2% of bandwidth peaks in each month, and then bills at the nearest rate. In this way infrequent peaks are ignored, and the customer is charged in a fairer way. The reason this statistic is so useful in measuring data throughput is that it gives a very accurate picture of the cost of the bandwidth. The 95th percentile says that 95% of the time, your usage is below this amount. Just the same, the remaining 5% of the time, your usage is above that amount.
Physicians will often use infant and children's weight and height percentile as a gauge of relative health.
The normal curve and percentiles
Percentiles are often represented graphically, using a "normal curve". A normal curve is always divided in the same respective manner. At the peak, in the center, stands the point of the mean of the distribution being graphed. On both the right and left sides each, the graph is divided into 3 equal parts, 1, 2, and 3 to the right and -1, -2, -3 to the left respectively. The important thing to remember is that at each of these standard deviation represents a fixed percentile. In other words, every standard deviation unit on the axis, including standard deviation units -3 to +3 have specific percentiles that are always paired with them, regardless the data or values in the distribution. So, what are the pairs of percentiles/standard deviation units? -2 = 2.5th percentile; -1 = 16th percentile; 0 = 50th percentile (also the mean of the distribution as previously stated); +1 = 84th percentile; +2 = 97.5th percentile; +3 = 99.8th percentile.
Percentage also becomes a factor in measuring a distribution graphically. On any normal curve, 99.7% of data lies between the -3 and +3 values, 95% between -2 and +2, 68% between -1 and +1, 34% between 0 and -1 or 0 and +1, 16% between -1 and -2 or +1 and +2 and 2.5% between -2 and -3 or +2 and +3. The remaining 0.3% of the data is between -3 and negative infinity or +3 and positive infinity.
ee also
*Quantile
*Quartile
*Decile
*Summary statistics
*Percentile rank
References
http://www.itl.nist.gov/div898/handbook/prc/section2/prc252.htm
External links
* [http://www.wessa.net/perc.wasp Free Online Software (Calculator)] computes Percentiles for any dataset according to 8 different percentile definitions.