[CWB] Abortion on corpus creation
Graham Ranger -- UAPV
graham.ranger at univ-avignon.fr
Wed Apr 26 16:28:11 CEST 2023
Hello,
Many thanks for your help. Unfortunately, that didn't work... I've just
checked: my XML tags are on different lines (though I would hope that
would not make a difference) and the only spaces in the file are in the
XML tag between "text" and "id".
Best,
Graham.
Le 26/04/2023 à 16:00, Andressa Gomide a écrit :
> Hello,
>
> I just had the same problem.
> I had a line with two XML tags (</text> <text id="A234">) and lemmas
> with spaces (e.g. /de a/; /em o/).
> I addressed these issues (</text>\n<text id="A234">, de_a, em_o) and
> managed to install the corpus.
> Hope it helps.
>
> Best,
>
> Andressa
>
> On Wed, 26 Apr 2023 at 15:30, Graham Ranger -- UAPV
> <graham.ranger at univ-avignon.fr> wrote:
>
> Hello everybody,
> I'm running into problems on corpus creation... specifically the
> error message: "Unexpected line outside text_id tags while
> creating corpus GARDNER_PEN__FREQ! -- creation aborted"
> Now, I'd expect that to mean that there is some junk outside the
> areas tagged with <text>...</text>, but after inspecting the file
> closely I cannot for the life of me see where the junk might be.
> Any help would as always be very gratefully received.
> Best,
> Graham.
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20230426/94814ba3/attachment.html>
More information about the CWB
mailing list