---++ Corpus Usage Guidelines ---+++ Access Policies In order to ensure compliance with the licenses for the various corpora we have installed, we have instituted the following policies. 1 Compling laboratory members are granted access to corpora solely for coursework and research projects in the context of their affiliation with the UW. 1 Corpora may not be copied from the servers, nor used in commercial applications, unless permitted by the corpus license agreement. 1 Many of the corpora have additional licensing conditions (see the [[https://vervet.ling.washington.edu/db/][CompLing Database]].) Before you access any particular corpus, you are responsible for reading and understanding the license. For LDC corpora, you should also read the [[https://catalog.ldc.upenn.edu/license/ldc-not-for-profit-membership.pdf][general membership agreement]]. 1 For some of the corpora, we must maintain a list of individuals granted access and/or have each user sign an individual license agreement. This is indicated in the "Restriction" column in the database. To access these corpora, you'll need to click the "Request Access" button and agree to the license agreement. 1 Whenever you use a corpus for course work or for a paper, you should cite the corpus among your references. The proper citation information should be found in the license or README file of the corpus. 1 Failure to follow these policies could result in loss of access to the corpora, or to the lab/servers in general. ---+++ Available corpora For a list of currently available corpora, along with their licensing and access information, see the [[https://vervet.ling.washington.edu/db/index.php][CompLing Database]]. _(If your browser prompts you with a certificate warning, you need to [[https://www.washington.edu/computing/ca/][install the UW root certificate]].)_ Terminology: * _Installed_ means the corpus is currently installed on the server and ready to use. * _Available_ means the corpus is immediately available, but not currently installed on the server. * _Requested_ means that a request has been put in to LDC for the corpus, but it's not immediately available. We can obtain any LDC corpus, but there may be a lead time of several weeks for corpora that are not listed in the database. ---+++ Requesting additional corpora Lab members who would like access to a corpus listed as "Available" in the database should send an email to linghelp@u with a request for it to be installed. Lab members who would like access to a corpus not listed in the database should send an email to Emily (ebender at u) with the request.
ore topic actions
Topic revision: r16 - 2015-03-25 - 21:57:36 - olzama
Create New Topic
Copyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Terms & Conditions