khorsi¶
-
corpustools.symbolsim.khorsi.
khorsi
(word1, word2, freq_base, sequence_type, max_distance=None)[source]¶ Calculate the string similarity of two words given a set of characters and their frequencies in a corpus based on Khorsi (2012)
Parameters: word1: Word
First Word object to compare
word2: Word
Second Word object to compare
freq_base: dictionary
a dictionary where each segment is mapped to its frequency of occurrence in a corpus
sequence_type: string
The type of segments to be used (‘spelling’ = Roman letters, ‘transcription’ = IPA symbols)
Returns: float
A number representing the relatedness of two words based on Khorsi (2012)