[CWB] Re: assertion failed / multilingual corpus

Ruprecht von Waldenfels waldenfels at issl.unibe.ch
Tue Jun 28 12:02:30 CEST 2011


Hi,
I'm resending this in plain format (sorry!),
Ruprecht

Am 27.06.2011 12:16, schrieb Ruprecht von Waldenfels:
> Dear everyone,
>
> I use CWB with a multilingual corpus (ParaSol, parasol.unibe.ch). I am 
> using an Ubuntu Server, CWB 3.2.7, downloaded and compiled Mon Jun  6 
> 16:43:04 CEST 2011, files encoded as UTF-8
>
> Sometimes, CWB breaks for an unknown reason; however, it does so only 
> in PrintMode sgml and only if two layers of annotation are included. 
> Here is the experiment:
>
> Setup: two corpus files, ECOROSA_IT, ECOROSA_RU; both with tags and 
> lemmata, aligned
>
> ECOROSA_IT; show +ecorosa_ru; [word=".*[smtv]i"];   cat Last to 
> "file.txt"; (over 9000 hits)
>
> adding tags OR  lemmas is not problem:
> ECOROSA_IT; show +ecorosa_ru; show +tag; [word=".*[smtv]i"];   cat 
> Last to "file.txt"; (over 9000 hits)
> ECOROSA_IT; show +ecorosa_ru; show +lemma; [word=".*[smtv]i"];   cat 
> Last to "file.txt"; (over 9000 hits)
>
> but adding BOTH leads to an error:
> ECOROSA_IT; show +ecorosa_ru; show +tag; show +lemma; 
> [word=".*[smtv]i"];   cat Last to "file.txt";
>
> cqp: concordance.c:425: remember_this_position: Assertion 
> `position_list' failed.
> Aborted
>
> It seems to me that this type of error has been happening with other 
> versions of CWB before, too, so this is not necessarily linked to the 
> current version. However, I cannot be sure because I do not normally 
> see the error messages when something does not work.
>
> (A minimal version of the corpus with only these two corpus files is 
> visible here. )
>
> All the best,
> Ruprecht
>
>
>
> -- 
> ------------------------------------------------
>
>
> Ruprecht von Waldenfels
> Universitaet Bern
> Institut fuer slavische Sprachen und Literaturen
> Laenggassstrasse 49 - CH 3005 Bern 9
> ------------------------------------------------
> Tel: +41  31 631 35 83 /  Fax: +41 31  631 39 90
> Tel: +49 761 214 66 72 / Mob.: +49 163 230 34 23
> -----------------------------------------------



More information about the CWB mailing list