[CWB] Installation of a corpus and subsequent problems...

Graham Ranger graham.ranger at univ-avignon.fr
Thu May 15 13:33:16 CEST 2025


Hello again,
I think (shamefacedly) that the errors were simply due to feeding cqpweb 
a non-verticalised file. I had the fuzzy feeling that I had done this in 
the past, but I must have been wrong, as all the files in the upload 
area are in fact verticalised 😳. I'll test with a properly verticalised 
file and, if all goes well, not report back.
Best,
Graham.

Le 14/05/2025 à 21:09, Graham Ranger -- UAPV a écrit :
> Hello to everyone,
> The title says it all, or almost.
>
> 1) I attempted to install a corpus with a set of xml tags, etc. but 
> ran into an error;
> 2) I then attempted to install a mini-corpus, in an effort at 
> debugging, and ran into the same error (something about extra material 
> after xml tags, repeated for every line of the corpus) -- I can't be 
> more precise for reasons which will soon become clear;
> 3) I then attempted to delete the corpus which, although not created, 
> was occupying a registry entry, and now have another error message: 
> "**** CQP ERROR **** cl_new_corpus: <1984_1> is not a valid corpus 
> name REGISTRY ERROR (/var/cqpweb/registry/1984_1): syntax error 
> REGISTRY ERROR (/var/cqpweb/registry/1984_1): Error parsing the main 
> Registry structure. CQPweb encountered an error and could not continue."
> 4) I am now unable to execute any queries or do anything much with 
> cqpweb... On executing a query, for example, I get this error message:
>
> CQP reports an error! The CQP program sent back these error messages:
>
> **** CQP ERROR ****
>
> CQP Error:
>
> No corpus activated
>
> CQP Error:
>
> CQP Syntax Error: syntax error
>
> [r] Registry <--
>
> Ignoring subsequent input until next ';'...
>
> I'm going to ask for the server to be restored to a previous state, 
> which should provide a fix, but won't get me any further with 
> installing the corpus I wished to set up. If there's a simpler way to 
> repair the registry entries, I'd be interested.
>
> The "toy corpus" which managed to break the server was as follows:
>
> <text id="1984_1">
> <title>Nineteen eighty-four</title>
> <div1 type="part" n="1">
> <head>PART 1</head>
> <div2 type="chapter" n="1">
> <head>1</head>
> <p>It was a bright cold day in April, and the clocks were striking 
> thirteen. Winston Smith, his chin nuzzled into his breast in an effort 
> to escape the vile wind, slipped quickly through the glass doors of 
> Victory Mansions, though not quickly enough to prevent a swirl of 
> gritty dust from entering along with him.</p>
> </div2>
> </div1>
> </text>
>
> Many thanks in advance for any help with this!
> Best regards,
> Graham.
>
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20250515/52ba065b/attachment.html>


More information about the CWB mailing list