- SimMetrics
SimMetrics is an
open source extensible library of similarity ordistance metrics (also known asstring metrics ).The SimMetrics
open source library includes the following metrics
*Levenshtein distance ,
*Block distance orcity block distance orL2 distance ,
*Cosine similarity ,
*Jaccard index ,
*Needleman-Wunsch algorithm orSellers algorithm ,
*Smith-Waterman algorithm ,
*Gotoh distance orSmith-Waterman-Gotoh distance ,
*Monge Elkan distance ,
*Jaro distance ,
*Jaro-Winkler ,
*SoundEx distance ,
*Matching coefficient ,
*Dice’s coefficient ,
*Jaccard similarity orJaccard coefficient orTanimoto coefficient ,
*Overlap coefficient ,
*Euclidean distance ,
*Q-gram distance ,
* and more.SimMetrics provides a library of floating-point based (0.0-1.0) similarity measures between pairs of string data as well as the unnormalised metric output.
External links
* [http://sourceforge.net/projects/simmetrics/] [http://www.dcs.shef.ac.uk/~sam/simmetrics.html ]
* [http://secondstring.sourceforge.net/ Second String] similar library
Wikimedia Foundation. 2010.