Homogeneity (statistics)

Homogeneity (statistics)

:For "homogeneity of variance" see homoscedasticity.

In statistics, homogeneity arises in describing the properties of a dataset, or several datasets, and relates to the validity of the often convenient assumption that the statistical properties of any one part of an overall dataset are the same as any other part. In meta-analysis, which combines the data from several studies, homogeneity measures the differences or similarities between the several studies.

Homogeneity can be studied to several degrees of complexity. For example, considerations of homoscedasticity examine how much the variability of data-values changes throughout a dataset. However, questions of homogeneity apply to all aspects of the statistical distributions, including the location parameter. Thus, a more detailed study would examine changes to the whole of the marginal distribution. An intermediate-level study might move from looking at the variability to studying changes in the skewness. In addition to these, quesions of homogeneity apply also to the joint distributions.

The concept of homogeneity can be applied in many different ways and, for certain types of statistical analysis, it is used to look for further properties that might need to be treated as varying within a dataset once some initial types of non-homogeneity have been dealt with.

Examples

Regression

Differences in the typical values across the dataset might initially be dealt with by constructing a regression model using certain explanatory variables to relate variations in the typical value to known quantities. There should then be a later stage of analysis to examine whether the errors in the predictions from the regression behave in the same way across the dataset.

Time series

The initial stages in the analysis of a time series may involve plotting values against time to examime homogeneity of the series in various ways: stability across time as opposed to a trend; stability of local fluctuations over time.

Combining information across sites

In hydrology, data-series across a number of sites composed of annual values of the within-year annual maximum river-flow are analysed. A common model is that the distributions of these values are the same for all sites apart from a simple scaling factor, so that the location and scale are linked in a simple way. There can then be questions of examining the homogeneity across sites of the distribution of the scaled values.

Combining information sources

In meteorology, weather datasets are acquired over many years of record and, as part of this, measurements at certain stations may cease occasionally while, at around the same time, measurements may start at nearby locations. There are then questions as to whether, if the records are combined to form a single longer set of records, those records can be considered homgeneous over time.

Homogeneity within populations

Simple populations surveys may start from the idea that responses will be homogeneous across the whole of a population. Assessing the homogeneity of the population would involve looking to see whether the responses of certain identifiable sub-populations differ from those of others. For example car-owners may differ from non-car-owners, or there may be differences between different age-groups.

ee also

*Heterogeneity

References

* Hall, M.J. (2003) The interpretation of non-homogeneous hydrometeorological time series a case study. "Meteorological Applications", 10, 61–67. ( doi:10.1017/S1350482703005061 )
* Krus, D.J., & Blackman, H.S. (1988).Test reliability and homogeneity from perspective of the ordinal test theory. "Applied Measurement in Education," 1, 79–88 [http://www.visualstatistics.net/Scaling/Homogeneity/Homogeneity.htm (Request reprint).]
* Loevinger, J. (1948). The technic of homogeneous tests compared with some aspects of scale analysis and factor analysis. "Psychological Bulletin," 45, 507–529.

External links

* [http://visualstatistics.net/Scaling/Homogeneity%20and%20Reliability/Homogeneity%20and%20Reliability.htm Reliability and homogeneity.]


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Homogeneity — means being similar throughout . Homogeneity may also refer to:* Homogeneous (mathematics), a variety of meanings * In statistics homogeneity can refer to ** Homogeneity of variance: Homoscedasticity ** Logically consistent data matrices:… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Reliability (statistics) — In statistics, reliability is the consistency of a set of measurements or measuring instrument, often used to describe a test. This can either be whether the measurements of the same instrument give or are likely to give the same measurement… …   Wikipedia

  • Homoscedasticity — Plot with random data showing homoscedasticity. In statistics, a sequence or a vector of random variables is homoscedastic (   …   Wikipedia

  • List of mathematics articles (H) — NOTOC H H cobordism H derivative H index H infinity methods in control theory H relation H space H theorem H tree Haag s theorem Haagerup property Haaland equation Haar measure Haar wavelet Haboush s theorem Hackenbush Hadamard code Hadamard… …   Wikipedia

  • Homogeneous (mathematics) — In mathematics, homogeneous may refer to:*Homogeneous polynomial, in algebra *Homogeneous function *Homogeneous equation, in particular: Homogeneous differential equation *Homogeneous system of linear equations, in linear algebra *Homogeneous… …   Wikipedia

  • cosmos — /koz meuhs, mohs/, n., pl. cosmos, cosmoses for 2, 4. 1. the world or universe regarded as an orderly, harmonious system. 2. a complete, orderly, harmonious system. 3. order; harmony. 4. any composite plant of the genus Cosmos, of tropical… …   Universalium

  • McNemar's test — In statistics, McNemar s test is a non parametric method used on nominal data. It is applied to 2 × 2 contingency tables with a dichotomous trait, with matched pairs of subjects, to determine whether the row and column marginal… …   Wikipedia

  • Australia — /aw strayl yeuh/, n. 1. a continent SE of Asia, between the Indian and the Pacific oceans. 18,438,824; 2,948,366 sq. mi. (7,636,270 sq. km). 2. Commonwealth of, a member of the Commonwealth of Nations, consisting of the federated states and… …   Universalium

  • Race and crime in the United States — Race Classification Race (classification of humans) Genetics …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”