Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision | |||
| start [2013/04/12 14:20] – eros | start [2022/12/05 11:57] (current) – eros | ||
|---|---|---|---|
| Line 7: | Line 7: | ||
| We try to keep everything very laid-back and flexible (minimal constraint on data representation, | We try to keep everything very laid-back and flexible (minimal constraint on data representation, | ||
| - | We built a few [[corpora]] you can [[download|download or use directly]], we | + | We built a few [[corpora]] you can [[download|download or use directly]], we described in great detail the procedure we followed to create our first corpora (DeWaC, UkWaC and ItWaC) in the paper: |
| - | described in great detail the procedure we followed to create our first corpora (DeWaC, UkWaC and ItWaC) in the paper: | + | |
| M. Baroni, S. Bernardini, A. Ferraresi and E. Zanchetta. 2009. The WaCky Wide Web: A Collection of Very Large Linguistically Processed Web-Crawled Corpora. //Language Resources and Evaluation// | M. Baroni, S. Bernardini, A. Ferraresi and E. Zanchetta. 2009. The WaCky Wide Web: A Collection of Very Large Linguistically Processed Web-Crawled Corpora. //Language Resources and Evaluation// | ||