The Levenshtein Edit Distance Algorithm calculates the minimum number of operations necessary to transform one string into another by displaying all possible combinations in a matrix. Because of the algorithm’s string flexibility, words do not need to have an identical number of letters in each word. The matrix is set up by assigning two strings: a source string (s) and a target string (t). The source string and the target string index the rows and columns of the two-dimensional array. In the uppermost, left-hand corner cell, the value is zero. The algorithm travels diagonally and throughout the matrix, and the bottom right-hand corner cell displays the minimum edit distance between the source string and target string. Each cell value is determined by the values of the immediate surrounding cell.
The top horizontal row is always an increasing total of insertions. The cell below the first letter has a value of 1, the second has 2... until value n, the total number of characters in the source string. These are also the cost of insertions. A similar pattern for the first column exists for the vertical target string. To fill in a cell, take the minimum value of either the value of the cell directly above plus the value of a deletion, the value of the diagonal cell plus the value of a substitution (if necessary), or the value of the left cell plus the value of an insertion.
A good demo of the Levenshtein Algorithm was developed by Peter Kleiweg. You can find it at http://odur.let.rug.nl/%7Ekleiweg/lev/
In linguistics, languages are categorized into language families. This project will specifically compare words from Indo-European languages. Indo-European languages include Italic, Celtic, Germanic, Slavic, Baltic, Hellenic, Albanian, Armenian, and Indo-Iranian languages. The Italic languages are no longer spoken, but the resultant Romance languages descendent of the Italic languages have a strong presence in the world today. The major Romance languages include Italian, French, Spanish, Portuguese, Catalan, and Romanian.