Web Content Extractor by Nikola Ljubešić – a tool for extracting content from web pages
the PotaModule (a Perl module that is intended to perform “boilerplate” stripping and other forms of HTML document filtering and extraction) is available in the BootCaT toolkit (see link above).