Dirty data

Dirty data

Dirty data is a term used by Information technology (IT) professionals when referring to inaccurate information (data) collected from data capture forms. It is also used to refer to data which has not yet been committed to the database, and is currently held in memory.

Dirty data can be misleading, incorrect, without generalized formatting, incorrectly spelled or punctuated, entered into the wrong field or duplicated. Dirty data can be prevented using input masks or validation rules, but completely removing such data from a source can be impossible or impractical

There are several causes of dirty data. In some cases, the information is deliberately distorted. A person may insert misleading or fictional personal information which appears real. Such dirty data may not be picked up by an administrator or a validation routine because it appears legitimate. Duplicate data can be caused by repeat submissions, user error or incorrect data joining. There can also be formatting issues or typographical errors. A common formatting issue is caused by variations in a user's preference for entering phone numbers.

See also

References



Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

  • Data quality — Data are of high quality if they are fit for their intended uses in operations, decision making and planning (J. M. Juran). Alternatively, the data are deemed of high quality if they correctly represent the real world construct to which they… …   Wikipedia

  • Data mining — Not to be confused with analytics, information extraction, or data analysis. Data mining (the analysis step of the knowledge discovery in databases process,[1] or KDD), a relatively young and interdisciplinary field of computer science[2][3] is… …   Wikipedia

  • Data cleansing — Not to be confused with Sanitization (classified information). Data cleansing, data cleaning, or data scrubbing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database. Used… …   Wikipedia

  • Dirty Work (1998 film) — Dirty Work Directed by Bob Saget Produced by Robert Simonds …   Wikipedia

  • Dirty Harry: The War Against Drugs — Cover art of Dirty Harry: The War Against Drugs Developer(s) Gray Matter …   Wikipedia

  • Dirty (disambiguation) — Dirty may refer to: Dirty (album), a 1992 alternative rock album by Sonic Youth Dirty (group), a rap duo from Alabama Dirty (computer science), containing data which need to be written back to a larger memory. Dirrty , a song by Christina… …   Wikipedia

  • Data archaeology — refers to the art and science of recovering computer data encrypted in now obsolete media or formats. Data archaeology can also refer to recovering information from damaged electronic formats after natural or man made disasters. The term… …   Wikipedia

  • Data Panik — were a Scottish rock band Steven Clark (Sci fi Steven), John Clark (John Disco), Amanda MacKinnon (Manda Rin) (all formerly of bis), Stuart Memo (of Multiplies) and Graham Christie (ex Kenickie tour drummer). Their debut single, Cubis (I Love… …   Wikipedia

  • Dirty paper coding — In telecommunications, dirty paper coding (DPC) is a technique for efficient transmission of digital data through a channel subjected to some interference known to the transmitter. The technique consists of precoding the data in order to cancel… …   Wikipedia

  • Dirty Work (film) — Infobox Film name = Dirty Work image size = 200 writer = Frank Sebastiano Norm Macdonald Fred Wolf starring = Norm Macdonald Jack Warden Artie Lange Traylor Howard Don Rickles with Christopher McDonald and Chevy Chase Uncredited: Chris Farley… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”