You are here

INSTALL.txt in Spam 5

Same filename and directory in other branches
  1. 6 INSTALL.txt
The spam module is a powerful collection of tools designed to help website
administrators to automatically deal with spam.  Spam is any content that is
posted to a website that is unrelated to the subject at hand, usually in the
form of advertising and links back to the spammer's own website.  This module
can automatically detect spam, instantly unpublish it, and send notification
to the site administrator.  For more information, refer to README.txt.

------------
Requirements:
------------

- Drupal 5.x


------------
Installation:
------------

1) Create a 'spam/' subdirectory within your 'modules/' directory and add the
   'spam.module', 'spam.install' and 'spam.info' files to this directory.  Be 
   sure your web server has read permissions to this directory and the files 
   you have placed into it.

2) Now you need to log in to your site and enable the spam.module.  This will
   automatically create all database tables required by the module.
    (Goto :: administer -> modules :: then check 'spam')


-------------
Configuration:
-------------

3) Optionally reconfigure the spam module.  The default configuration will
   protect you from many common spammer techniques.
    (Goto :: administer -> settings -> spam)
  - In the "Filter" section, check the boxes next to the types of content that
    you want the spam filter to examine.  If only trusted users can post some
    type of content, there is no reason to filter that content.  If anonymous
    users are allowed to post certain types of content (such as comments), it
    would be wise to filter that content.
  - In the "Limits" section, select appropriate values in the drop down boxes.
    The first to boxes will catch a large percentage of spam, as spammers tend
    to include an abnormally high number of URLs in their content.  It can
    also be effective to catch duplicate content when it is posted.  Finally,
    blacklisting is helpful in that it will minimize the affect of repeat
    offenders.
  - In the "Actions" section, check the boxes next to the desired actions.
    Automatically unpublishing spam is wise, as it will prevent probable
    spam from showing up on your web pages.  You will be able to manually
    review content determined to be spam, giving you a chance to publish
    anything that was wrongly filtered.  The notify user action is a useful
    way to prevent user confusion, as should content be filtered the user will
    see a message explaining what has happened.  Email notification is a
    good way to let the site administrator know that there is spam in the
    queue to be reviewed.  The final option is a drop down menu, allowing you
    to automatically delete spam content after a specified amount of time.  
    This is a useful way to prevent your databases from getting clogged with
    spam content.
  - In the "Advanced" section you should not need to make any changes, unless
    you are trying to better understand how the spam filter works.
  - Now click "Save configuration".

4) Setup spam module permissions.
    (Goto :: administer -> access control -> permissions)
  - Assign the "report spam" permission to users that should be able to report
    content that they think is spam.  It is recommended that you do not give
    this ability to anonymous guests otherwise spambots could easily flood you
    with false reports.
  - Assign the "access spam" permission to users that should be able to see
    whether content is considered to be spam or not.
  - Assign the "administer spam" permission to users that should be able to
    manually mark content as spam or not spam.
  - Assign the "bypass filter" permission to trusted users who's content does
    not need to be run through the filter because you are certain it will
    never be spam.

5) Defining custom filters.
    (Goto :: administer -> spam -> custom filters)
   - In the 'custom filter' text area, enter a string.  You can enter a
     word, a phrase, a regular expression.  For example, if a large number
     of spam contents on your site contain the word 'Viagra', you can use
     it as your custom filter.
   - If the string you entered was formatted as a regular expression, you
     need to check the "Regular expression" box to let the spam module know
     it should treat your filter as a regular expression.  (If your regular
     expression is formatted incorrectly, you will get an error message when
     you try to save it)
   - You need to tell the filter what it should do if the filter
     matches.  Choices are 'always spam', 'usually spam', 'usually not spam',
     and 'never spam'.  The first option, 'always spam', allows you to
     blacklist matching words, phrases and regular expressions.  The last
     option, 'never spam', allows you to whitelist matching words, phrases
     and regular expressions.  The middle two options allow you to greylist
     matching words, phrases or regular expressions.  When only one greylist
     filter matches, the final choice of making new content spam or not spam
     is left up to the Bayesian filter.
   - If you normally receive email notification when content is marked as
     spam, but you tend to be flooded with spam matching this filter, it can
     be helpful to check "Disable email notification" to prevent the spam
     module from sending you a notification email.  It is better to use this
     option and then to "auto expire" spam than it is to check the next
     option.
   - If you are 100% confident that your custom filter only matches spam,
     and never accidently matches non-spam, you can check the "automatically
     delete spam" box.  If checked, any content that matches this filter and
     is ultimately determined to be spam will be automatically, irrevocably and
     silently deleted.  Be warned that it consumes a lot of resources to
     delete spam content, and that if you are under a flood of spam postings
     this can lead to a DoS.  Instead, it is suggested that you instead check
     the earlier "Disable email notification" box and configure your spam
     module to automatically delete spam after a certain amount of time.
   - After defining custom filters, it is a good idea to regularly visit this
     page and review how effective your custom filters are.  Simple statistics
     are provided to show you how often your filters matched new content, and
     when was the last time each of them matched.
   - Whitelisting words, phrases and regular expressions can be a bad idea.
     If a spammer discovers items from your whitelist, they will be able to
     consistently get spam through your filter.
   - Spam is constantly evolving, thus you will probably find that you are
     constantly having to update your spam filter.

6) Defining URL filters.
    (Goto :: administer -> spam -> URL filters)
   - If enabled, URL filters are automatically learned by the Bayesian
     filter.  Domains listed here are considered "spammer domains", and
     any new comment or other content will containing references (ie links)
     to these domains will be marked as spam.
   - Domain names that were erroneously learned by the Bayesian filter
     as spammer domain can be manually deleted here.
   - Known spammer domains can also be manually entered.

File

INSTALL.txt
View source
  1. The spam module is a powerful collection of tools designed to help website
  2. administrators to automatically deal with spam. Spam is any content that is
  3. posted to a website that is unrelated to the subject at hand, usually in the
  4. form of advertising and links back to the spammer's own website. This module
  5. can automatically detect spam, instantly unpublish it, and send notification
  6. to the site administrator. For more information, refer to README.txt.
  7. ------------
  8. Requirements:
  9. ------------
  10. - Drupal 5.x
  11. ------------
  12. Installation:
  13. ------------
  14. 1) Create a 'spam/' subdirectory within your 'modules/' directory and add the
  15. 'spam.module', 'spam.install' and 'spam.info' files to this directory. Be
  16. sure your web server has read permissions to this directory and the files
  17. you have placed into it.
  18. 2) Now you need to log in to your site and enable the spam.module. This will
  19. automatically create all database tables required by the module.
  20. (Goto :: administer -> modules :: then check 'spam')
  21. -------------
  22. Configuration:
  23. -------------
  24. 3) Optionally reconfigure the spam module. The default configuration will
  25. protect you from many common spammer techniques.
  26. (Goto :: administer -> settings -> spam)
  27. - In the "Filter" section, check the boxes next to the types of content that
  28. you want the spam filter to examine. If only trusted users can post some
  29. type of content, there is no reason to filter that content. If anonymous
  30. users are allowed to post certain types of content (such as comments), it
  31. would be wise to filter that content.
  32. - In the "Limits" section, select appropriate values in the drop down boxes.
  33. The first to boxes will catch a large percentage of spam, as spammers tend
  34. to include an abnormally high number of URLs in their content. It can
  35. also be effective to catch duplicate content when it is posted. Finally,
  36. blacklisting is helpful in that it will minimize the affect of repeat
  37. offenders.
  38. - In the "Actions" section, check the boxes next to the desired actions.
  39. Automatically unpublishing spam is wise, as it will prevent probable
  40. spam from showing up on your web pages. You will be able to manually
  41. review content determined to be spam, giving you a chance to publish
  42. anything that was wrongly filtered. The notify user action is a useful
  43. way to prevent user confusion, as should content be filtered the user will
  44. see a message explaining what has happened. Email notification is a
  45. good way to let the site administrator know that there is spam in the
  46. queue to be reviewed. The final option is a drop down menu, allowing you
  47. to automatically delete spam content after a specified amount of time.
  48. This is a useful way to prevent your databases from getting clogged with
  49. spam content.
  50. - In the "Advanced" section you should not need to make any changes, unless
  51. you are trying to better understand how the spam filter works.
  52. - Now click "Save configuration".
  53. 4) Setup spam module permissions.
  54. (Goto :: administer -> access control -> permissions)
  55. - Assign the "report spam" permission to users that should be able to report
  56. content that they think is spam. It is recommended that you do not give
  57. this ability to anonymous guests otherwise spambots could easily flood you
  58. with false reports.
  59. - Assign the "access spam" permission to users that should be able to see
  60. whether content is considered to be spam or not.
  61. - Assign the "administer spam" permission to users that should be able to
  62. manually mark content as spam or not spam.
  63. - Assign the "bypass filter" permission to trusted users who's content does
  64. not need to be run through the filter because you are certain it will
  65. never be spam.
  66. 5) Defining custom filters.
  67. (Goto :: administer -> spam -> custom filters)
  68. - In the 'custom filter' text area, enter a string. You can enter a
  69. word, a phrase, a regular expression. For example, if a large number
  70. of spam contents on your site contain the word 'Viagra', you can use
  71. it as your custom filter.
  72. - If the string you entered was formatted as a regular expression, you
  73. need to check the "Regular expression" box to let the spam module know
  74. it should treat your filter as a regular expression. (If your regular
  75. expression is formatted incorrectly, you will get an error message when
  76. you try to save it)
  77. - You need to tell the filter what it should do if the filter
  78. matches. Choices are 'always spam', 'usually spam', 'usually not spam',
  79. and 'never spam'. The first option, 'always spam', allows you to
  80. blacklist matching words, phrases and regular expressions. The last
  81. option, 'never spam', allows you to whitelist matching words, phrases
  82. and regular expressions. The middle two options allow you to greylist
  83. matching words, phrases or regular expressions. When only one greylist
  84. filter matches, the final choice of making new content spam or not spam
  85. is left up to the Bayesian filter.
  86. - If you normally receive email notification when content is marked as
  87. spam, but you tend to be flooded with spam matching this filter, it can
  88. be helpful to check "Disable email notification" to prevent the spam
  89. module from sending you a notification email. It is better to use this
  90. option and then to "auto expire" spam than it is to check the next
  91. option.
  92. - If you are 100% confident that your custom filter only matches spam,
  93. and never accidently matches non-spam, you can check the "automatically
  94. delete spam" box. If checked, any content that matches this filter and
  95. is ultimately determined to be spam will be automatically, irrevocably and
  96. silently deleted. Be warned that it consumes a lot of resources to
  97. delete spam content, and that if you are under a flood of spam postings
  98. this can lead to a DoS. Instead, it is suggested that you instead check
  99. the earlier "Disable email notification" box and configure your spam
  100. module to automatically delete spam after a certain amount of time.
  101. - After defining custom filters, it is a good idea to regularly visit this
  102. page and review how effective your custom filters are. Simple statistics
  103. are provided to show you how often your filters matched new content, and
  104. when was the last time each of them matched.
  105. - Whitelisting words, phrases and regular expressions can be a bad idea.
  106. If a spammer discovers items from your whitelist, they will be able to
  107. consistently get spam through your filter.
  108. - Spam is constantly evolving, thus you will probably find that you are
  109. constantly having to update your spam filter.
  110. 6) Defining URL filters.
  111. (Goto :: administer -> spam -> URL filters)
  112. - If enabled, URL filters are automatically learned by the Bayesian
  113. filter. Domains listed here are considered "spammer domains", and
  114. any new comment or other content will containing references (ie links)
  115. to these domains will be marked as spam.
  116. - Domain names that were erroneously learned by the Bayesian filter
  117. as spammer domain can be manually deleted here.
  118. - Known spammer domains can also be manually entered.