[CWB] CWB and CoNLL format

Christian Chiarcos christian.chiarcos at web.de
Sun Mar 7 09:48:41 CET 2021


Dear Stefan,

Am Fr., 5. März 2021 um 12:38 Uhr schrieb Stefan Evert <
stefanML at collocations.de>:

>
> > On 3 Mar 2021, at 22:14, Stefan Evert <stefanML at collocations.de> wrote:
> >
> > But thanks for the recommendations, I'll try to remember until I have
> time to work on the manual again.  I think I explained my understanding of
> "CoNLL-style format" in the manpages, but I completely agree that "full
> CoNLL support" in the encoding tutorial is misleading.
>
> Support for CoNLL-style input and its limitations are now explained in
> much more detail in the encoding tutorial and the manpages.  This should no
> longer be misleading and also provides thorough documentation.
>

Much better, thanks a lot!

> I've never been able to find formal documentation for a general CoNLL
format ...
> And CWB 4 will require much better (i.e. more explicit) input formats
than CoNLL.

We have machine-readable definitions of most CoNLL formats under
https://github.com/acoli-repo/conll-rdf/blob/master/owl/conll.ttl. This is
being used for lossless round-tripping among different CoNLL dialects and
between these and graph representations. Publication in preparation.

Best,
Christian



> I've also implemented a few small improvements (CoNLL-style feature sets
> can be converted to CWB format) and added tests in CWB/Perl.
>
> Best,
> Stefan
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20210307/aaa85602/attachment.html>


More information about the CWB mailing list