[CWB] How to install new corpus through CQPweb
Jörg Knappen
j.knappen at mx.uni-saarland.de
Thu Nov 5 10:50:34 CET 2020
There are two things: Format conversion and the addition of annotations
(in the example Part of Speech and Lemma). There are tools like
TreeTagger that do both tasks in one run.
Adding the <s> structure is another independent step, we use some simple
script written by ourselves to add those.
--Jörg Knappen
Am 2020-11-05 10:43, schrieb YANG CHRICS:
> hi, I come here to ask for help again. I couldn't figure out how to install a pure text corpus through CQPweb. Today I read the encoding manual again, it seems that I have to change the text to CWB input format ( one-word-per-line text, just like the picture below). It would be grateful, if you can tell how to change the plain text into the following format, thank you.
>
> )
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20201105/2ac94944/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: img3.png
Type: image/png
Size: 2887 bytes
Desc: not available
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20201105/2ac94944/attachment.png>
More information about the CWB
mailing list