Hi, <br><br>I have an error, I am not able to solve. I'm trying to build a Latin corpora but I get this error:<br><br>Error: Huffman codes too long (32 bits, current maximum is 31 bits).<br> Please contact the CWB development team for assistance.<br>
<br>I got this error when trying to build a 40 words corpora (I cut it to see if I could detect the error; with 39 words I do not get the error)<br><br>-----------<br><doc type="CHRISTIAN_LATIN" title="Abelard"><br>
<s><br>PETRUS Petrus N:nom<br>ABAELARDUS UNKNOUN ADJ<br>( ( PUN<br>1079-1142 card ADJ:NUM<br>) ) PUN<br>ABAELARDI UNKNOUN N:voc<br>AD UNKNOUN N:abl<br>AMICUM amicus ADJ<br>
SUUM sus N:gen<br>CONSOLATORIA consolatorius ADJ<br>Sepe sepes N:dat<br>humanos humanus ADJ<br>affectus affectus N:nom<br>aut aut CC<br>provocant provoco V:IND<br>aut aut CC<br>
mittigant mi V:IND<br>amplius ample ADV<br>exempla exemplum N:nom<br>quam qui REL<br>verba verbum N:nom<br>. . SENT<br></s><br><s><br>Unde unde ADV<br>post post PREP<br>
nonnullam nonnullus ADJ<br>sermonis sermo N:gen<br>ad ad PREP<br>habiti habeo V:PTC<br>consolationem consolatio N:acc<br>, , PUN<br>de de PREP<br>ipsis ipse DET<br>calamitatum calamitas N:gen<br>
mearum meus POSS<br>experimentis experimentum N:abl<br></s><br></doc><br><br>-----------------<br>This are the attributes I use to describe the corpus:<br><br>cat $SOURCEFILE | /usr/local/cwb-3.4.1/bin/cwb-encode -c utf8 -d $DATADIR -R $REGDIR/$CORPUSNAME -xsB -P lema -P pos -V s -S doc:0+type+title -S not:0+text<br>
<br>Thanks<br><br>Eva Bofias<br>