[CWB] Abortion on corpus creation
Andressa Gomide
gomide.andressa at gmail.com
Wed Apr 26 16:00:47 CEST 2023
Hello,
I just had the same problem.
I had a line with two XML tags (</text> <text id="A234">) and lemmas with
spaces (e.g. *de a*; *em o*).
I addressed these issues (</text>\n<text id="A234">, de_a, em_o) and
managed to install the corpus.
Hope it helps.
Best,
Andressa
On Wed, 26 Apr 2023 at 15:30, Graham Ranger -- UAPV <
graham.ranger at univ-avignon.fr> wrote:
> Hello everybody,
> I'm running into problems on corpus creation... specifically the error
> message: "Unexpected line outside text_id tags while creating corpus
> GARDNER_PEN__FREQ! -- creation aborted"
> Now, I'd expect that to mean that there is some junk outside the areas
> tagged with <text>...</text>, but after inspecting the file closely I
> cannot for the life of me see where the junk might be.
> Any help would as always be very gratefully received.
> Best,
> Graham.
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20230426/bd0d0a75/attachment.html>
More information about the CWB
mailing list