Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision Next revisionBoth sides next revision | ||
tools [2013/03/14 10:32] – created eros | tools [2013/03/14 10:34] – [Complete pipelines] eros | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ===== Tools ===== | + | ====== Tools ====== |
- | This is an incomplete list of tools used to build corpora from the web | + | This is an incomplete list of tools you can use to build corpora from the web. |
- | ==== Complete pipelines ==== | + | ===== Complete pipelines |
- | * [[http:// | + | * [[http:// |
- | ==== De-duplication ==== | + | ===== De-duplication |
* {{: | * {{: | ||
* [[http:// | * [[http:// | ||
- | ==== Boilerplate removal ==== | + | ===== Boilerplate removal |
* [[http:// | * [[http:// |