FLAME clustering

FLAME clustering

Fuzzy clustering by Local Approximation of MEmberships (FLAME) is a novel data clustering algorithm that defines clusters in the dense parts of a dataset and performs cluster assignment solely based on the neighborhood relationships among objects. The key feature of this algorithm is that, the neighborhood relationships among neighboring objects in the feature space are used to constrain the memberships of neighboring objects in the fuzzy membership space.

Description of the FLAME algorithm

The FLAME algorithm is mainly divided into three steps:

# Extraction of the structure information from the dataset:
## Construct a neighborhood graph to connect each object to its K-Nearest Neighbors (KNN);
## Estimate a density for each object based on its proximities to its KNN;
## Objects are classified into 3 types:
### Cluster Supporting Object (CSO): object with density higher than all its neighbors;
### Cluster Outliers: object with density lower than all its neighbors, and lower than a predefined threshold;
### the rest.
# Local/Neighborhood approximation of fuzzy memberships:
## Initialization of fuzzy membership:
### Each CSO is assigned with fixed and full membership to itself to represent one cluster;
### All outliers are assigned with fixed and full membership to the outlier group;
### The rest are assigned with equal memberships to all clusters and the outlier group;
## Then the fuzzy memberships of all type 3 objects are updated by a converging iterative procedure called Local/Neighborhood Approximation of Fuzzy Memberships, in which the fuzzy membership of each object is updated by a linear combination of the fuzzy memberships of its nearest neighbors.
# Cluster construction from fuzzy memberships in two possible ways:
## One-to-one object-cluster assignment, to assign each object to the cluster in which it has the highest membership;
## One-to-multiple object-clusters assignment, to assign each object to the cluster in which it has a membership higher than a threshold.

The optimization problem in FLAME

The Local/Neighborhood Approximation of Fuzzy Memberships is a procedure to minimize the Local/Neighborhood Approximation Error (LAE/NAE) defined as the following:

::E({oldsymbol{p}})=sum_{oldsymbol{x}inoldsymbol{X igg| oldsymbol{p(x)}-sum_{ oldsymbol{y in mathcal{N}(x)} } w_{oldsymbol{xy oldsymbol{p(y)} igg|^2

where oldsymbol{X} is the set of all type 3 objects, oldsymbol{p(x)} is the fuzzy membership vector of object oldsymbol{x}, mathcal{N}(x) is the set of nearest neighbors of oldsymbol{x}, and w_{oldsymbol{xy with sum_{oldsymbol{yin mathcal{N}(x)w_{oldsymbol{xy=1 are the coefficients reflecting the relative proximities of the nearest neighbors.

The NAE can be minimized by solving the following linear equations with unique solution which is the unique global minimum of NAE with value zero:

::p_k(oldsymbol{x})-sum_{oldsymbol{yin mathcal{N}(x) w_{ oldsymbol{xy} } p_k(oldsymbol{y}) = 0, quadforall{oldsymbol{x}in oldsymbol{X} },quad k=1,...,M

where M is the number of CSOs plus one (for the outlier group). The following iterative procedure can be used to solve these linear equations:

::{oldsymbol{p}^{t+1}(oldsymbol{x})} = sum_{ oldsymbol{yin mathcal{N}(x)} }w_{oldsymbol{xy {oldsymbol{p}^t (oldsymbol{y})}

A simple illustration on a 2D testing dataset

External links

* [http://www.biomedcentral.com/1471-2105/8/3 BMC Bioinformatics (2007): FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data]
* [http://flame-clustering.googlecode.com/svn/trunk/ FLAME source codes in C released under FreeBSD-like license on GoogleCode]

ee also

* Data clustering
* Fuzzy clustering


Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

  • Fuzzy clustering — is a class of algorithm in computer science. Explanation of clustering Data clustering is the process of dividing data elements into classes or clusters so that items in the same class are as similar as possible, and items in different classes… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • List of mathematics articles (F) — NOTOC F F₄ F algebra F coalgebra F distribution F divergence Fσ set F space F test F theory F. and M. Riesz theorem F1 Score Faà di Bruno s formula Face (geometry) Face configuration Face diagonal Facet (mathematics) Facetting… …   Wikipedia

  • cosmos — /koz meuhs, mohs/, n., pl. cosmos, cosmoses for 2, 4. 1. the world or universe regarded as an orderly, harmonious system. 2. a complete, orderly, harmonious system. 3. order; harmony. 4. any composite plant of the genus Cosmos, of tropical… …   Universalium

  • HSL and HSV — Fig. 1. HSL (a–d) and HSV (e–h). Above (a, e): cut away 3D models of each. Below: two dimensional plots showing two of a model’s three parameters at once, holding the other constant: cylindrical shells (b, f) of constant saturation, in this case… …   Wikipedia

  • Chemometrics — is the science of extracting information from chemical systems by data driven means. It is a highly interfacial discipline, using methods frequently employed in core data analytic disciplines such as multivariate statistics, applied mathematics,… …   Wikipedia

  • Flow cytometry — Analysis of a marine sample of photosynthetic picoplankton by flow cytometry showing three different populations (Prochlorococcus, Synechococcus, and picoeukaryotes) Flow cytometry (abbreviated: FCM) is a technique for counting and examining… …   Wikipedia

  • United States — a republic in the N Western Hemisphere comprising 48 conterminous states, the District of Columbia, and Alaska in North America, and Hawaii in the N Pacific. 267,954,767; conterminous United States, 3,022,387 sq. mi. (7,827,982 sq. km); with… …   Universalium

  • I Ching — For other uses, see I Ching (disambiguation). I Ching Classic of Changes   The …   Wikipedia

  • List of fictional books — A fictional book is a non existent book created specifically for (i.e. within) a work of fiction. This is not a list of works of fiction (i.e., actual novels, mysteries, etc), but rather imaginary books that do not actually exist.UsesSuch a book… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”