Daitch–Mokotoff Soundex

Daitch–Mokotoff Soundex

Daitch–Mokotoff Soundex (D–M Soundex) is a phonetic algorithm invented in 1985 by Jewish genealogists Gary Mokotoff and Randy Daitch. It is a refinement of the Russell and American Soundex algorithms designed to allow greater accuracy in matching of Slavic and Yiddish surnames with similar pronunciation but differences in spelling.

Daitch–Mokotoff Soundex is sometimes referred to as "Jewish Soundex" and "Eastern European Soundex", although the authors discourage use of these nicknames for the algorithm because the algorithm itself is independent of the fact the motivation for creating the new system was the poor results of predecessor systems when dealing with Slavic and Yiddish surnames.

Contents

Improvements

Improvements over the older Soundex algorithms include:

  • Coded names are six digits long, resulting in greater search precision (traditional Soundex uses four characters)
  • The initial character of the name is coded.
  • Several rules in the algorithm encode multiple character n-grams as single digits (American and Russell Soundex do not handle multi-character n-grams)
  • Multiple possible encodings can be returned for a single name (traditional Soundex returns only one encoding, even if the spelling of a name could potentially have multiple pronunciations)

Examples

Some examples:

Surname American Soundex D–M Soundex
Peters P362 739400, 734000
Peterson P362 739460, 734600
Moskowitz M232 645740
Moskovitz M213 645740
Auerbach A612 097500, 097400
Uhrbach U612 097500, 097400
Jackson J250 154600, 454600, 145460, 445460
Jackson-Jackson J252 154664, 454664, 145466, 445466, 154646, 454646, 145464, 445464

Beider–Morse Phonetic Name Matching Algorithm

To address the large number of false positive results generated by the D–M Soundex, Stephen P. Morse and Alexander Beider created the Beider–Morse Phonetic Name Matching algorithm[1]. This new algorithm cuts down on false positives at the expense of some false negatives. A number of sites are offering the B–M soundex in addition to the D-M soundex[2].

See also

Notes

  1. ^ Beider–Morse Phonetic Matching: An Alternative to Soundex with Fewer False Hits - copy of Avotaynu: the International Review of Jewish Genealogy (Summer 2008)
  2. ^ Nu? What's New? Volume 9, Number 22 Gary Mokotoff, Editor - The E-zine of Jewish Genealogy From Avotaynu

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Daitch-Mokotoff Soundex — (D M Soundex) is a phonetic algorithm invented in 1985 by genealogist Gary Mokotoff, and later improved by Randy Daitch, both of the Jewish Genealogical Society. It is a refinement of the Russell and American Soundex algorithms designed to allow… …   Wikipedia

  • Soundex — is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for names with the same pronunciation to be encoded to the same representation so that they can be matched despite minor differences in spelling. Soundex… …   Wikipedia

  • Soundex — est un algorithme phonétique d indexation de noms par leur prononciation en anglais britannique. L objectif basique est que les noms ayant la même prononciation soient codés avec la même chaîne de manière à pouvoir trouver une correspondance… …   Wikipédia en Français

  • Soundex — es un algoritmo fonético, un algoritmo para indexar nombre por su sonido, al ser pronunciados en Inglés. El objetivo básico de este algoritmo es codificar de la misma forma los nombres con la misma pronunciación. Soundex es el algoritmo fonético… …   Wikipedia Español

  • Soundex — Algorithmus Soundex ist ein phonetischer Algorithmus zur Indizierung von Wörtern und Phrasen nach ihrem Klang in der englischen Sprache. Gleichklingende Wörter sollen dabei zu einer identischen Zeichenfolge kodiert werden. Der Soundex Algorithmus …   Deutsch Wikipedia

  • Gary Mokotoff — ( born 1937 ) is an American genealogist who focuses primarily on Jewish genealogy. He is the first person to receive the Lifetime Achievement Award of the International Association of Jewish Genealogical Societies for which he was president… …   Wikipedia

  • Phonetic algorithm — A phonetic algorithm is an algorithm for indexing of words by their pronunciation. Most phonetic algorithms were developed for use with the English language; consequently, applying the rules to words in other languages might not give a meaningful …   Wikipedia

  • Algorithme Phonétique — Un algorithme phonétique est un algorithme conçu pour indexer les mots selon leur prononciation. La plupart des algorithmes phonétiques sont développés pour être utilisé avec la langue anglaise ; par conséquent appliquer les règles de ces… …   Wikipédia en Français

  • Algorithme phonetique — Algorithme phonétique Un algorithme phonétique est un algorithme conçu pour indexer les mots selon leur prononciation. La plupart des algorithmes phonétiques sont développés pour être utilisé avec la langue anglaise ; par conséquent… …   Wikipédia en Français

  • Algorithme phonétique — Un algorithme phonétique est un algorithme conçu pour indexer les mots selon leur prononciation. La plupart des algorithmes phonétiques sont développés pour être utilisé avec la langue anglaise ; par conséquent appliquer les règles de ces… …   Wikipédia en Français

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”