You are here

README.txt in Porter-Stemmer 7

Same filename and directory in other branches
  1. 8 README.txt
  2. 5 README.txt
  3. 6.2 README.txt
  4. 6 README.txt
GENERAL INFORMATION
-------------------

This module implements the Porter-Stemmer algorithm, version 2, to improve
English-language searching with the Drupal built-in Search module. Information
about the algorithm can be found at
http://snowball.tartarus.org/algorithms/english/stemmer.html

Stemming reduces a word to its basic root or stem (e.g. 'blogging' to 'blog') so
that variations on a word ('blogs', 'blogger', 'blogging', 'blog') are
considered equivalent when searching. This generally results in more relevant
results.

Note that a few parts of the Porter Stemmer algorithm work better for American
English than British English, so some British spellings will not be stemmed
correctly.

This module will use the PECL "stem" library's implementation of the Porter
Stemmer algorithm, if it is installed on your server. If the PECL "stem" library
is not available, the module uses its own PHP implementation of the
algorithm. The output is the same in either case. More information about the
PECL "stem" library: http://pecl.php.net/package/stem


INSTALLATION
------------

See the INSTALL.txt file for installation instructions.


TESTING
-------

The Porter Stemmer module includes tests for the stemming algorithm and
functionality.  If you would like to run the tests, enable the core Testing
module, and then navigate to Administer > Configuration / Development / Testing.

Each "Stemming output" test for the Porter Stemmer module includes approximately
2000 individual word stemming tests (which test the module against a standard
word list downloaded from the site above).  Due to the way output is displayed
in SimpleTest, you may run into browser timeout or memory issues if you try to
run all 16 of the "Stemming output" tests during the same test run.

Tests are provided both for the internal algorithm and the PECL library.

There are also functional tests and tests for some of the internal steps of the
stemming algorithm.

File

README.txt
View source
  1. GENERAL INFORMATION
  2. -------------------
  3. This module implements the Porter-Stemmer algorithm, version 2, to improve
  4. English-language searching with the Drupal built-in Search module. Information
  5. about the algorithm can be found at
  6. http://snowball.tartarus.org/algorithms/english/stemmer.html
  7. Stemming reduces a word to its basic root or stem (e.g. 'blogging' to 'blog') so
  8. that variations on a word ('blogs', 'blogger', 'blogging', 'blog') are
  9. considered equivalent when searching. This generally results in more relevant
  10. results.
  11. Note that a few parts of the Porter Stemmer algorithm work better for American
  12. English than British English, so some British spellings will not be stemmed
  13. correctly.
  14. This module will use the PECL "stem" library's implementation of the Porter
  15. Stemmer algorithm, if it is installed on your server. If the PECL "stem" library
  16. is not available, the module uses its own PHP implementation of the
  17. algorithm. The output is the same in either case. More information about the
  18. PECL "stem" library: http://pecl.php.net/package/stem
  19. INSTALLATION
  20. ------------
  21. See the INSTALL.txt file for installation instructions.
  22. TESTING
  23. -------
  24. The Porter Stemmer module includes tests for the stemming algorithm and
  25. functionality. If you would like to run the tests, enable the core Testing
  26. module, and then navigate to Administer > Configuration / Development / Testing.
  27. Each "Stemming output" test for the Porter Stemmer module includes approximately
  28. 2000 individual word stemming tests (which test the module against a standard
  29. word list downloaded from the site above). Due to the way output is displayed
  30. in SimpleTest, you may run into browser timeout or memory issues if you try to
  31. run all 16 of the "Stemming output" tests during the same test run.
  32. Tests are provided both for the internal algorithm and the PECL library.
  33. There are also functional tests and tests for some of the internal steps of the
  34. stemming algorithm.