Ubuntu LKB (née Knoppix LKB) News 10 APR 2013: Version 16 released. This includes a new LKB version, a default .lkbrc file with (lui initialize), and python3. 28 ...
Spring Quarter 2013 Wednesdays 10:30 11:20, DEN 205 (and on line) Date Topic Presenter 4/10 Towards Detecting Linguistic Phenomena in Precision Grammars ...
Using Hadoop on the Patas cluster General Info is a processing framework that allows for scalable, distributed processing. It includes a distributed filesystem (HDFS ...
Troubleshooting Condor Job Problems General suggestions Make sure you're giving the full path to the executable, in your submit file. (Unless the executable ...
Languages Analyzed in Ling 567 Alphabetical by language. Language families are per Ethnologue. Language Language Code Family Grammar Engineer Year Ainu ...
On a Windows PC, using the command line Make sure your 'path' environment variable points to your Python executable (perhaps C:\Python25).See page for instructions ...
Installing Knoppix LKB (or UbuntuLKB) as a VirtualBox appliance VirtualBox is a way to run Linux (or other operating systems) inside a window under OS X or Windows ...
CLMS Survival Guide Tips and tricks from past CLMA/CLMS students to help future generations. We learned the hard way, and you will too, but maybe not so much if you ...
Creating a Git Repo Git repositories allow for many types of workflows, centralized or decentralized. Before creating your repo, decide which steps to follow: Create ...
How To Access the CLMA Cluster Command Line Access MacOS Run Terminal (under Applications/Utilities). At the command prompt, type: ssh user #64;patas.ling.washington ...
List of known OpenSolaris Linux NFSv4 bugs Because this needs to be documented somewhere... Some of these also affect Nexenta and other Solaris derived operating ...
Welcome to the CLMS Internship Page Internships There are many internship opportunities available to CLMS students. This page presents a list of such opportunities ...
2010 10 04 We are revamping this page to include photos and short bios. Please add yourself (or update your entry) in the appropriate category below (alphabetical ...
vervet.ling.washington.edu Vervet is a web server run by the Department of the University of Washington, to provide services to students and faculty in the Computational ...
Grammar Engineering Frequently Asked Questions I want to install the LKB on my local machine, what should I do? The LKB can downloaded from the website, for Linux ...
CLMS Parallel Computing Cluster What is it? The Computational Linguistics program maintains a cluster of several servers (referred to as "compute nodes") that can ...
nop PatasMahout General Info is a big data machine learning toolkit that can run on top of PatasHadoop Hadoop. At the time of writing it is rather unstable, so ...
Building and running code: Paths and Environment Variables Some classes (notably the Ling570 series) require you to write code that will be run by a TA on Patas (and ...
MapReduce defined MapReduce is a framework for processing highly distributable problems across huge datasets using a large number of computers (nodes). "Map" step ...
Welcome to the UW Linguistics Treehouse Wiki On this site, we will create a repository of information about topics relating to computational linguistics, computational ...
Here is an article giving industry opinions on the relative assets and liabilities of some commonly used programming languages. JonathanPool 31 Mar 2005 And a ...
Here is a useful wiki page on Haskell.org covering a lot of what is going on with Haskell and linguistics. There is also this recent book on computational semantics ...
Hadoop Example using WordCount In this example, we'll run the example that comes with Hadoop on our local copy of the Brown Corpus. 1. Make a directory in the Hadoop ...
Grammar Engineering Frequently Asked Questions When I try to generate, the LKB says "Probable circular lexical rule" . How do I debug this? This error arises when ...
Academic Software License Deals This page lists free or reduced cost commercial software licenses that students in the UW Lingustics department may be eligible for ...
MacOS SSH shortcuts using Terminal This page illustrates how to create a Dock shortcut that will automatically launch an SSH session in a terminal window. 1. Open ...
How do I paste into the LKB parse dialogue? Contributed by a 567 student: On Linux, the LKB dialogs that take textual input (e.g., the one reached from View/Type ...
Grammar Engineering with LKB and the LinGO Grammar Matrix: Frequently Asked Questions To the reader These pages have been prepared in the context of Linguistics 567 ...
Lemur Shared Projects Use this page to list projects you've created under svn://lemur.ling.washington.edu/shared . Include the directory name, the project's purpose ...
Grammar Engineering Glossary To the reader These pages have been prepared in the context of Linguistics 567, "Knowledge Engineering for Deep NLP", at the University ...
Wiki Standards and Practices Encodings This Wiki may or may not respect characters you enter into its editor if they are outside the Latin 1 (ISO 8859 1) range. As ...
Links to job postings of interest This page is obsolete. Please see the posting database instead. (If you get a certificate warning, you need to the UW root certificate ...
Running Condor jobs with large memory requirements By default, Condor assigns each process you launch 1 GB of RAM. If your job grows too large, one of two things will ...
Compling resources from UW libraries Start here: http://www.lib.washington.edu/ To get off campus access: click the the "off campus access" button and sign in ...
Parallel Processing One of the challenges of working in natural language processing is the large amounts of data that must be processed to get meaningful results. ...
How do I use it? A quick Condor tutorial How do I set up a job? Creating a submit description file You need to create a submit description file (sometimes referred ...
Data Protection What we do and don't do to protect data stored on our systems. Disk storage All of our critical servers use RAID (Redundant Array of Inexpensive ...
Corpus Usage Guidelines Access Policies In order to ensure compliance with the licenses for the various corpora we have installed, we have instituted the following ...
Email Lists and Other Local Sources of Information Email lists All of these lists carry same day information fairly frequently, so we do NOT recommend the "digest ...
$ Who do I contact for help with patas or the Treehouse workstations?: linghelp at u dot washington dot edu $ Who do I contact if I have problems with the wireless ...
"Shebang" lines for scripts on Patas Here are the proper script headers for various interpreted languages on patas. Bash #!/bin/bash Perl v5.10.0 #!/opt/perl ...
Guidelines for good passwords The security of our cluster relies in large part on the security of users' passwords. Passwords are checked regularly for strength and ...
Grammar Engineering Glossary C CONT C CONT stands for "Constructional Content" and represents the semantic contribution of a rule (phrase structure rule or lexical ...
Grammar Engineering Frequently Asked Questions What is the feature geometry assigned in the Matrix? (Or: How do I figure out what paths I need to use?) It's not possible ...
nop Main Web Preferences The following settings are web preferences of the Main web. These preferences overwrite the site level preferences in . and ...
TWiki Administrator Group Member list (comma separated list): Set GROUP brodbd,mithreal Persons/group who can change the list: Set ALLOWTOPICCHANGE ...
Should I Use Git or Subversion (SVN)? In many ways, this is a matter of taste, however, there are some main ways in which the two differ: Git SVN Online ...
Obtaining and Installing Git Precompiled binaries for Mac OSX and Windows, and source code for self compiling on Linux can be found at the Git project homepage: http ...
To help newly arrived CLMA students find their way around, we thought we'd start a site where people can post links to their favorite restaurants, shops, and sites ...
Running Knoppix LKB Under VirtualBox Manual installation for VirtualBox 4.x.x These instructions describe how to set up KnoppixLKB manually using an ISO image. The ...
Running Knoppix LKB Under VirtualBox Manual installation for VirtualBox 3.x.x These instructions describe how to set up KnoppixLKB manually using an ISO image. The ...
Treehouse Lecture: agree grammar engineering environment March 4, 2011 In this talk I present the status of my work on a new processing environment for typed feature ...
Lab member thesis topics Please add a short description of your thesis (MA or PhD) here, with a date indicating when the description was last added/modified. Master ...
The Treehouse presents: Antske Fokkens, U Saarlandes November 12, 2010 12:30 1:20 pm Using Grammar Customization for Analysis Comparison For most syntactic phenomena ...
The Treehouse presents: Luke Zettlemoyer, University of Washington CSE November 5, 2010 12:30 1:20 pm Recent research has focused on the problem of learning to map ...
The Treehouse Presents: Meladel Mistica, U Melbourne October 14, 2010, 3:30 4:20pm CSE 203 `Voice marking' is a phenomenon seen in some western Austronesian languages ...
The Treehouse presents: Rebecca Dridan, U. Melbourne October 15, 2010 12:30 1:20 pm In many cases, NLP is a consumer of the results of grammar engineering, but NLP ...
Using F# on Patas You can experiment with Microsoft's functional programming language on patas. The interactive interpreter can be started with: $ mono /opt/fsharp ...
Poor performance due to excessively large directories Understanding the problem Directories with a large number of files become extremely inefficient to work with ...
Performance Tips Tips on writing fast, efficient jobs. Feel free to add to this page if you have more good ideas! General Advice These are not condor specific, but ...
patas disk quota information Quota enforcement All quotas on the patas cluster are now "hard" quotas that is, there is no grace period. When you reach your quota ...
Download Managers If your connection is slow or unreliable, you may want to use a download manager to download KnoppixLKB. This will let you stop and resume the download ...
Poor performance due to unnecessary I/O It's very common for parallel jobs on our system to bottleneck on disk I/O. This is because a single network fileserver is ...
Installing KnoppixLKB to a USB Drive KnoppixLKB can be run from a USB drive. This may be a more convenient option than running from a CD if your computer can boot ...
Locale Settings In Linux Dealing with multiple character encodings When we start dealing with multiple character sets and languages, we get into a confusing area ...
The keyboard doesn't work in the incr tsdb() window or The keyboard doesn't work in Emacs after exiting incr tsdb() This is a known problem with incr tsdb ...
Grammar Engineering Glossary # (hash) The hash mark is used in tdl syntax to indicate an identity constraint between the values of two features in a type definition ...
Grammar Engineering Glossary MC feature: Main Clause The feature MC stands for 'Main Clause', and is used to distinguish phenomena which can only occur in main ...
Grammar Engineering Glossary LIGHT feature matrix.tdl states: ; The feature LIGHT is used to model phenomena which distinguish ; shorter words from longer ones ...
Grammar Engineering Glossary VC feature: VerbCluster The feature VC stands for 'Verb Cluster', and keeps track whether the main verb is present in cluster ( ...
Technical Terms from Ling 566 10/2/07 Ontology A model of the entities in a certain domain which makes explicit the types of entities and the relationships ...
Knoppix LKB Input Method Notes IPA IPA input support is provided by the IPA X SAMPA input method. Activation: Activate SCIM the normal way (Ctrl Space Shift ...
Subversion Advanced usage This page contains links to examples of doing more complicated tasks with Subversion. Feel free to add your own examples. Copying changes ...
Subversion Keyword Substitution Those of you who have previously used CVS may be accustomed to inserting revision information into CVS checkins using keywords such ...
Fetch the Debian stable ISO from a iso mirror near you. The fastest one I've found from Seattle is the OSU OSL mirror: 64 bit CPU: http://ftp.osuosl.org/pub/debian ...
Treehouse Talk Deep Lexical Acquisition Tim Baldwin University of Melbourne Abstract: Deep lexical acquisition is the task of classifying lexemes for predefined ...
Grammar Engineering Frequently Asked Questions How do I interact with the LKB through the Lisp prompt? The LKB prompt in the common lisp buffer is actually a Lisp ...
Introduction Microsoft Word is convenient for word processing, but its file format (.DOC) is not as portable as we might like. This is particularly true for linguists ...
LaTeX is a high quality typesetting system, with features designed for the production of technical and scientific documentation. It is the de facto standard for ...
Using C# on Patas October 2009 : Mono upgraded to 2.4.2.3 on Patas. Thanks David! C# is a powerful general purpose programming language originally developed by ...
My Links ATasteOfTWiki view a short introductory presentation on TWiki for beginners WelcomeGuest starting points on TWiki TWikiUsersGuide complete ...
List of TWiki users Please take the time and add yourself to the list. To do that fill out the form in .TWikiRegistration. This will create an account for you which ...
Software Requests Software requests should be emailed to linghelp@u. Please allow two weeks of leadtime if you have a firm deadline. Your request should include the ...