Back | WordLists

BulgarianStemmer - bulgarianStemmer.txt

Download Complete Wordlist (4.07 K)
Download Complete Wordlist bzip2 Compressed (1.25 K)
Showing 28 Randomly Sampled Lines...
# Light stemmer for Bulgarian language (to be viewed by selecting UTF-8 encoding)
if ($word =~ m/е..и$/) { # rewritting rule
}
if (($i > 10) && (substr($line,$i-4,2) eq "ъ")) {
}
if ($word =~ m/ища$/) { # final -HWa
substr($word,$i-2,2) = "";
return($word);
$word =~ s/зи$/г/;
$word =~ s/овци$/о/;
if ($line =~ m/[аое]$/) { # final -[aoe]
}
$i = length($line);
if ($word =~ m/си$/) { # final -cH --> x
substr($line,$i-4,4) = substr($line,$i-2,2);
$line = substr($line,0,$i-2);
# done by J. Savoy University of Neuchatel (www.unine.ch/info/clef/)
if ($i > 8) {
return($line);
my($word, $i); # use local var $word and $i
# done by J. Savoy University of Neuchatel (www.unine.ch/info/cl
if ($i > 12) { # for words having more than 6 characters
};
$line =~ s/^(\s)+//;
chomp $line;
return(substr($word,0,$i-4));
# We assume that each character (in Cyrillic) needs two bytes
# Light stemmer for Bulgarian language (to be viewed by selec


Back | WordLists