Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revisionBoth sides next revision | ||
tools [2013/03/14 10:32] – [Tools] eros | tools [2013/03/14 10:34] – [Complete pipelines] eros | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ===== Tools ===== | + | ====== Tools ====== |
This is an incomplete list of tools you can use to build corpora from the web. | This is an incomplete list of tools you can use to build corpora from the web. | ||
- | ==== Complete pipelines ==== | + | ===== Complete pipelines |
- | * [[http:// | + | * [[http:// |
- | ==== De-duplication ==== | + | ===== De-duplication |
* {{: | * {{: | ||
* [[http:// | * [[http:// | ||
- | ==== Boilerplate removal ==== | + | ===== Boilerplate removal |
* [[http:// | * [[http:// |