1. NodeBox 1
    1. Homepage
    2. NodeBox 3Node-based app for generative design and data visualization
    3. NodeBox OpenGLHardware-accelerated cross-platform graphics library
    4. NodeBox 1Generate 2D visuals using Python code (Mac OS X only)
  2. Gallery
  3. Documentation
  4. Forum
  5. Blog

Nodebox Linguistics Missing Corpus

Posted by Chris on Apr 09, 2008

The corpora meant to be accessed via Nodebox Linguistic's en.parser.nltk_lite.corpora appear to be missing. For example, en.parser.nltk_lite.corpora.cmudict.demo() throws the error "No such file or directory /usr/lib/nltk_lite/cmudict/cmudict". I tried copying some of the files from ftp://ftp.cs.cmu.edu/afs/cs.cmu.edu/data/anonftp/project/fgdata/dict/ but none seem to work, causing nltk to raise parsing errors.

Where can we get the appropriate corpora files? The corpora code using in Nodebox's nltk seems notably different from the current stand-alone nltk.


 
Posted by Tom De Smedt on Apr 11, 2008

Hi Chris,

The corpora are not included because of file size. I have never tested this so I'll have to look around a bit. I'll let you know when I come up with a solution.