[CWB] Abortion on corpus creation

Graham Ranger -- UAPV graham.ranger at univ-avignon.fr
Wed Apr 26 16:28:11 CEST 2023


Hello,
Many thanks for your help. Unfortunately, that didn't work... I've just 
checked: my XML tags are on different lines (though I would hope that 
would not make a difference) and the only spaces in the file are in the 
XML tag between "text" and "id".
Best,
Graham.

Le 26/04/2023 à 16:00, Andressa Gomide a écrit :
> Hello,
>
> I just had the same problem.
> I had a line with two XML tags (</text> <text id="A234">) and lemmas 
> with spaces (e.g. /de a/; /em o/).
> I addressed these issues (</text>\n<text id="A234">, de_a, em_o) and 
> managed to install the corpus.
> Hope it helps.
>
> Best,
>
> Andressa
>
> On Wed, 26 Apr 2023 at 15:30, Graham Ranger -- UAPV 
> <graham.ranger at univ-avignon.fr> wrote:
>
>     Hello everybody,
>     I'm running into problems on corpus creation... specifically the
>     error message: "Unexpected line outside text_id tags while
>     creating corpus GARDNER_PEN__FREQ! -- creation aborted"
>     Now, I'd expect that to mean that there is some junk outside the
>     areas tagged with <text>...</text>, but after inspecting the file
>     closely I cannot for the life of me see where the junk might be.
>     Any help would as always be very gratefully received.
>     Best,
>     Graham.
>     _______________________________________________
>     CWB mailing list
>     CWB at sslmit.unibo.it
>     http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20230426/94814ba3/attachment.html>


More information about the CWB mailing list