[CWB] Input for collostructional analysis
Stephanie Evert
stefanML at collocations.de
Fri Mar 18 14:12:29 CET 2022
> On 18 Mar 2022, at 12:05, Kara, Elif <elif.kara at fu-berlin.de> wrote:
>
> I have tried:
> cwb-lexdecode -r <REGISTRY> -F list.txt -f0 -P word <MYCORPUS>
>
> but this computes no matches which can't be right (my list contains one word per line).I’m using my university's v3.0.0 CWB installation from the command line.
As far as I can see, this should work. One possibility is that "list.txt" has CRLF (Windows) line endings, which would cause cwb-lexdecode to read the CR character as part of the word form (e.g. "dog<CR>" instead of "dog"), which doesn't match word forms in the corpus, of course.
CWB v3.0.0 is really, really outdated. Recent versions of CWB should clean up CRLF line endings in most cases, hopefully also with cwb-lexdecode.
Best,
Stephanie
More information about the CWB
mailing list