Data Mining Extensions

Data Mining Extensions

Data Mining Extensions (DMX) is a query language for Data Mining Models supported by Microsoft's SQL Server Analysis Services product.

Like SQL, it supports a data definition language, data manipulation language and a data query language, all three with SQL-like syntax. Whereas SQL statements operate on relational tables, DMX statements operate on data mining models. Similarly, SQL Server supports the MDX language for OLAP databases. DMX is used to create and train data mining models, and to browse, manage, and predict against them. DMX is composed of data definition language (DDL) statements, data manipulation language (DML) statements, and functions and operators.

Contents

DMX Queries

DMX Queries are formulated using the SELECT statement. They can extract information from existing data mining models in various ways.

Data Definition Language

The Data Definition Language (DDL) part of DMX can be used to

  • Create new data mining models and mining structures - CREATE MINING STRUCTURE, CREATE MINING MODEL
  • Delete existing data mining models and mining structures - DROP MINING STRUCTURE, DROP MINING MODEL
  • Export and import mining structures - EXPORT, IMPORT
  • Copy data from one mining model to another - SELECT INTO

Data Manipulation Language

The Data Manipulation Language (DML) part of DMX can be used to

  • Train mining models - INSERT INTO
  • Browse data in mining models - SELECT FROM
  • Make predictions using mining model - SELECT ... FROM PREDICTION JOIN

Example: a prediction query

This example is a singleton prediction query, which predicts for the given customer whether she will be interested in home loan products.

SELECT
  [Loan Seeker],
  PredictProbability([Loan Seeker])
FROM
  [Decision Tree]
NATURAL PREDICTION JOIN
(SELECT 
   35 AS [Age],
   'Y' AS [House Owner],
   'M' AS [Marital Status],
   'F' AS [Gender],
   2 AS [Number Cars Owned],
   2 AS [Total Children],
   18 AS [Total Years of Education]
)

See also

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Data Mining Extensions — Dieser Artikel wurde aufgrund von inhaltlichen Mängeln auf der Qualitätssicherungsseite der Redaktion Informatik eingetragen. Dies geschieht, um die Qualität der Artikel aus dem Themengebiet Informatik auf ein akzeptables Niveau zu bringen. Hilf… …   Deutsch Wikipedia

  • Data mining — Not to be confused with analytics, information extraction, or data analysis. Data mining (the analysis step of the knowledge discovery in databases process,[1] or KDD), a relatively young and interdisciplinary field of computer science[2][3] is… …   Wikipedia

  • Data warehouse — Overview In computing, a data warehouse (DW) is a database used for reporting and analysis. The data stored in the warehouse is uploaded from the operational systems. The data may pass through an operational data store for additional operations… …   Wikipedia

  • Data Vault Modeling — is a database modeling method that is designed to provide historical storage of data coming in from multiple operational systems. It is also a method of looking at historical data that, apart from the modeling aspect, deals with issues such as… …   Wikipedia

  • Data transformation — Data transformation/Source transformation Concepts metadata · data mapping data transformation · model transf …   Wikipedia

  • Data extraction — is the act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage (data migration). The import into the intermediate extracting system is thus usually followed by… …   Wikipedia

  • Data dictionary — Not to be confused with Dictionary (data structure). A data dictionary, or metadata repository, as defined in the IBM Dictionary of Computing, is a centralized repository of information about data such as meaning, relationships to other data,… …   Wikipedia

  • Data warehouse appliance — In computing, a data warehouse appliance consists of an integrated set of servers, storage, operating system(s), DBMS and software specifically pre installed and pre optimized for data warehousing (DW). Alternatively, the term can also apply to… …   Wikipedia

  • Data mart — A data mart (DM) is the access layer of the data warehouse (DW) environment that is used to get data out to the users. The DM is a subset of the DW, usually oriented to a specific business line or team. Contents 1 Terminology 2 Design schemas 3… …   Wikipedia

  • Data store — A data store is a data repository of a set of integrated objects. These objects are modeled using classes defined in database schemas.[1] Data store includes not only data repositories like databases, it is a more general concept that includes… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”