don't include the subversion metadata
.svn
don't include local mpi data
src/data/mpi
don't include James' thesaurus tools or tokeniser
thesaurus
tokeniser
don't include Python api until it works everywhere
src/api/python
src/api/nlp
don't include bootstrapping experimental tools
bootstrap
src/data/boot
don't include Depbank 560 results sets
src/tests/depbank560
don't include the old CCG regression tests because of the big data files
src/tests/ccg
don't include the VPE annotation (not stable yet) -- JB
src/data/vpe
don't include the Italian CCGbank (not stable yet) -- JB
src/data/italian