User Tools

Site Tools


tools

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
tools [2013/03/20 09:40]
eros [Tools]
tools [2016/02/25 15:20] (current)
eros [Boilerplate removal]
Line 15: Line 15:
  
   * [[http://​code.google.com/​p/​justext/​|jusText]] -- a tool for removing boilerplate content   * [[http://​code.google.com/​p/​justext/​|jusText]] -- a tool for removing boilerplate content
-  * [[http://www.nljubesic.net/resources/tools/webcontentextractor/|WebContentExtractor]] -- a tool for extracting content from web pages+  * [[http://metashare.elda.org/repository/browse/web-content-extractor/​9e14ee4a663d11e28a985ef2e4e6c59e51a55e76bd4b47f39338db609624ff54/|Web Content Extractor]] by Nikola Ljubešić ​-- a tool for extracting content from web pages
   * the **PotaModule** (a Perl module that is intended to perform "​boilerplate"​ stripping and other forms of HTML document filtering and extraction) is available in the BootCaT toolkit (see link above).   * the **PotaModule** (a Perl module that is intended to perform "​boilerplate"​ stripping and other forms of HTML document filtering and extraction) is available in the BootCaT toolkit (see link above).
- 
tools.txt · Last modified: 2016/02/25 15:20 by eros