Back | WordLists

BulgarianStemmer - bulgarianStemmer.txt

Download Complete Wordlist (4.07 K)
Download Complete Wordlist bzip2 Compressed (1.25 K)
Showing 28 Randomly Sampled Lines...
chomp $line;
return($word);
}
$word = $_[0];
return(substr($word,0,$i-6));
# done by J. Savoy University of Neuchate
}
if ($i > 12) { # for words having more than 6 characters
return(substr($word,0,$i-4));
# definite article (the) for nouns
$line =~ s/^(\s)+//;
if ($word =~ m/ци$/) { # final -UH --> k
$line = substr($line,0,$i-6);
$word =~ s/овци$/о/;
return(substr($word,0,$i-4));
if ($i > 6) {
return(substr($word,0,$i-4));
if ($line =~ m/я$/) { # final -(R) (masc)
print "$stem\n";
# Light stemmer for Bulgarian language (to be viewed by selecting UTF-8 encoding)
return(substr($word,0,$i-4));
$stem = BulgarianLightStemmer($line); # assuming one word per line
substr($line,$i-4,4) = substr($line,$i-2,2);
$word =~ s/зи$/г/;
if ($i > 10) {
return($line);
$i = length($word);
if ($word =~ m/ища$/) { # final -HWa


Back | WordLists