[CWB] Limit for text elements in CQPweb?
Hardie, Andrew
a.hardie at lancaster.ac.uk
Fri Jun 26 01:49:02 CEST 2020
Apologies, I saw Stefan's response on the N of texts issue, and didn't spot that Daniel's error indicates that something else is the problem.
To wit:
"Corpus `tweets_covid' already exists on the system. Please specify a different name for your new corpus."
*In my experience*, this error normally arises on a second attempt to install a corpus after the first attempt has failed. the remnants of the first attempt in terms of files and database entries can then block the second attempt. The cause of the first failure is irrelevant - subsequent failures will be due to this blocking until all remnants are purged
In this case, AT LEAST the entry in the database table "corpus_info" has persisted from a previous attempt.
NORMALLY this can be solved simply by going through the "delete corpus" procedure -- from the main list of corpora in the admin area -- after an installation has crashed but before attempting to run it again.
I know this isn't super user friendly in terms of workflow and if I ever get a spare minute I intend to do something about it.
best
Andrew.
-----Original Message-----
From: cwb-bounces at sslmit.unibo.it <cwb-bounces at sslmit.unibo.it> On Behalf Of Daniel Wachter
Sent: 19 June 2020 12:38
To: cwb at sslmit.unibo.it
Subject: [CWB] Limit for text elements in CQPweb?
Dear all,
I have trouble installing a corpus in CQPweb and my best guess so far is that the number of text elements causes the problem.
The corpus is a collection of almost 5 million Tweets/texts with around
140 million tokens. I could index the whole corpus without any problems in CWB (version 3.4.18), but when I try to install it in CQPweb as pre-indexed it fails with the error message "Corpus `tweets_covid'
already exists on the system. Please specify a different name for your new corpus." - which of course I tried several times with several different names, but that doesn't seem to be the real problem.
I tested the whole procedure with a small subset of only 10,000 texts and had no problems at all. We're running CQPweb version 3.2.31 which is quite old, but we had some trouble upgrading to a more recent version.
So my questions are: Is there a hard limit how many texts CQPweb can handle? Is this a known bug of older versions and recent versions can handle that? Or does another issue causes my problem and it has nothing to do with the number of texts?
Here's the complete php backtrace:
array(5) {
[1]=>
array(4) {
["file"]=>
string(41) "/var/www/cqpweb/lib/admin-install.inc.php"
["line"]=>
int(584)
["function"]=>
string(9) "exiterror"
["args"]=>
array(1) {
[0]=>
string(104) "Corpus `tweets_covid' already exists on the system.
Please specify a different name for your new corpus."
}
}
[2]=>
array(4) {
["file"]=>
string(35) "/var/www/cqpweb/lib/execute.inc.php"
["line"]=>
int(179)
["function"]=>
string(18) "install_new_corpus"
["args"]=>
array(0) {
}
}
[3]=>
array(4) {
["file"]=>
string(41) "/var/www/cqpweb/lib/admin-execute.inc.php"
["line"]=>
int(122)
["args"]=>
array(1) {
[0]=>
string(35) "/var/www/cqpweb/lib/execute.inc.php"
}
["function"]=>
string(7) "require"
}
[4]=>
array(4) {
["file"]=>
string(37) "/var/www/cqpweb/lib/adminhome.inc.php"
["line"]=>
int(34)
["args"]=>
array(1) {
[0]=>
string(41) "/var/www/cqpweb/lib/admin-execute.inc.php"
}
["function"]=>
string(7) "require"
}
[5]=>
array(4) {
["file"]=>
string(29) "/var/www/cqpweb/adm/index.php"
["line"]=>
int(3)
["args"]=>
array(1) {
[0]=>
string(37) "/var/www/cqpweb/lib/adminhome.inc.php"
}
["function"]=>
string(7) "require"
}
}
Thanks a lot for your help!
Best,
Daniel
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
More information about the CWB
mailing list