Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revisionLast revisionBoth sides next revision | ||
start [2008/02/01 16:16] – eros | start [2013/04/12 14:20] – eros | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== WaCky ====== | + | ====== WaCky - The Web-As-Corpus Kool Yinitiative |
- | Welcome to WaCky! | + | Welcome to WaCky! |
We are a community of linguists and information technology specialists who got together to develop a set of tools (and interfaces to existing tools) that will allow linguists to crawl a section of the web, process the data, index and search them. | We are a community of linguists and information technology specialists who got together to develop a set of tools (and interfaces to existing tools) that will allow linguists to crawl a section of the web, process the data, index and search them. | ||
Line 7: | Line 7: | ||
We try to keep everything very laid-back and flexible (minimal constraint on data representation, | We try to keep everything very laid-back and flexible (minimal constraint on data representation, | ||
- | We also built a few [[corpora]] you can [[download]] | + | We built a few [[corpora]] you can [[download|download or use directly]], we |
+ | described in great detail the procedure we followed | ||
+ | |||
+ | M. Baroni, S. Bernardini, A. Ferraresi and E. Zanchetta. 2009. The WaCky Wide Web: A Collection of Very Large Linguistically Processed Web-Crawled Corpora. //Language Resources and Evaluation// | ||
+ | |||
+ | There, | ||
The project (including this website) is currently being sponsored by the [[LiMiNe]] project. | The project (including this website) is currently being sponsored by the [[LiMiNe]] project. | ||
- | The old version of this website is still available | + | [[staff_only:|Private section]] |