[CWB] CWB and CoNLL format
Christian Chiarcos
christian.chiarcos at web.de
Sun Mar 7 09:48:41 CET 2021
Dear Stefan,
Am Fr., 5. März 2021 um 12:38 Uhr schrieb Stefan Evert <
stefanML at collocations.de>:
>
> > On 3 Mar 2021, at 22:14, Stefan Evert <stefanML at collocations.de> wrote:
> >
> > But thanks for the recommendations, I'll try to remember until I have
> time to work on the manual again. I think I explained my understanding of
> "CoNLL-style format" in the manpages, but I completely agree that "full
> CoNLL support" in the encoding tutorial is misleading.
>
> Support for CoNLL-style input and its limitations are now explained in
> much more detail in the encoding tutorial and the manpages. This should no
> longer be misleading and also provides thorough documentation.
>
Much better, thanks a lot!
> I've never been able to find formal documentation for a general CoNLL
format ...
> And CWB 4 will require much better (i.e. more explicit) input formats
than CoNLL.
We have machine-readable definitions of most CoNLL formats under
https://github.com/acoli-repo/conll-rdf/blob/master/owl/conll.ttl. This is
being used for lossless round-tripping among different CoNLL dialects and
between these and graph representations. Publication in preparation.
Best,
Christian
> I've also implemented a few small improvements (CoNLL-style feature sets
> can be converted to CWB format) and added tests in CWB/Perl.
>
> Best,
> Stefan
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20210307/aaa85602/attachment.html>
More information about the CWB
mailing list