Data cluster

Data cluster
Disk structure:
(A) track
(B) geometrical sector
(C) track sector
(D) cluster

In computer file systems, a cluster or allocation unit is the unit of disk space allocation for files and directories. To reduce the overhead of managing on-disk data structures, the filesystem does not allocate individual disk sectors, but contiguous groups of sectors, called clusters.

On a disk that uses 512-byte sectors, a 512-byte cluster contains one sector, whereas a 4-kibibyte (KiB) cluster contains eight sectors.

A cluster is the smallest logical amount of disk space that can be allocated to hold a file. Storing small files on a filesystem with large clusters will therefore waste disk space; such wasted disk space is called slack space. For cluster sizes which are small versus the average file size, the wasted space per file will be statistically about half of the cluster size; for large cluster sizes, the wasted space will become greater. However, a larger cluster size reduces bookkeeping overhead and fragmentation, which may improve reading and writing speed overall. Typical cluster sizes range from 1 sector (512 B) to 128 sectors (64 KiB).

A cluster need not be physically contiguous on the disk; it may span more than one track or, if sector interleaving is used, may even be discontiguous within a track. This should not be confused with fragmentation, as the sectors are still logically contiguous.

The term cluster was changed to allocation unit in DOS 4.0. However the term cluster is still widely used.[1]

  1. ^ Mueller, Scott (2002). Upgrading and repairing PCs, p. 1354. ISBN 0789727455.

See also

External links


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Cluster — A cluster is a small group or bunch of something. Contents 1 In science 2 In astrophysics 3 In biology and health sciences 4 In computing …   Wikipedia

  • Cluster (spacecraft) — Cluster Operator European Space Agency in international collaboration with NASA Major contractors Dornier GmbH (now part of EADS) Mission type Orbiter Satellite of Earth …   Wikipedia

  • Data Intensive Computing — is a class of parallel computing applications which use a data parallel approach to processing large volumes of data typically terabytes or petabytes in size and typically referred to as Big Data. Computing applications which devote most of their …   Wikipedia

  • Cluster analysis (in marketing) — Cluster analysis is a class of statistical techniques that can be applied to data that exhibit “natural” groupings. Cluster analysis sorts through the raw data and groups them into clusters. A cluster is a group of relatively homogeneous cases or …   Wikipedia

  • Cluster genealogy — is a research technique employed by genealogists to learn more about an ancestor by examining records left by the ancestor s cluster. A person s cluster consists of the extended family, friends, neighbors, and other associates such as business… …   Wikipedia

  • Cluster (Wirtschaft) — Cluster können aus ökonomischer Sicht als Netzwerke von Produzenten, Zulieferern, Forschungseinrichtungen (z. B. Hochschulen), Dienstleistern (z. B. Design und Ingenieurbüros), Handwerkern und verbundenen Institutionen (z. B.… …   Deutsch Wikipedia

  • Cluster labeling — is closely related to the concept of text clustering. This process tries to select descriptive labels for the clusters obtained through a clustering algorithm such as Flat Clustering and Hierarchical Clustering. For example, a cluster of… …   Wikipedia

  • Data-centric programming language — defines a category of programming languages where the primary function is the management and manipulation of data. A data centric programming language includes built in processing primitives for accessing data stored in sets, tables, lists, and… …   Wikipedia

  • Cluster (Satellit) — Cluster ist ein Satellitenprojekt der ESA und NASA zur Erforschung der irdischen Magnetosphäre. Es erlitt 1996 einen Rückschlag beim Fehlstart der ersten Ariane 5 Rakete, ist aber seit Sommer 2000 mit Reservesatelliten in Betrieb. Es besteht aus… …   Deutsch Wikipedia

  • Cluster Exploratory — (CluE) is a National Science Foundation funded program that will use Google IBM cluster technology to analyze massive amounts of data to search for patterns. The cluster will consist of 1,600 processors, several terabytes of memory, and hundreds… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”