[CWB] Limit for text elements in CQPweb?

Daniel Wachter daniel-wachter at gmx.net
Fri Jun 19 13:37:34 CEST 2020


Dear all,

I have trouble installing a corpus in CQPweb and my best guess so far is
that the number of text elements causes the problem.

The corpus is a collection of almost 5 million Tweets/texts with around
140 million tokens. I could index the whole corpus without any problems
in CWB (version 3.4.18), but when I try to install it in CQPweb as
pre-indexed it fails with the error message "Corpus `tweets_covid'
already exists on the system. Please specify a different name for your
new corpus." - which of course I tried several times with several
different names, but that doesn't seem to be the real problem.
I tested the whole procedure with a small subset of only 10,000 texts
and had no problems at all. We're running CQPweb version 3.2.31 which is
quite old, but we had some trouble upgrading to a more recent version.

So my questions are: Is there a hard limit how many texts CQPweb can
handle? Is this a known bug of older versions and recent versions can
handle that? Or does another issue causes my problem and it has nothing
to do with the number of texts?

Here's the complete php backtrace:

array(5) {
   [1]=>
   array(4) {
     ["file"]=>
     string(41) "/var/www/cqpweb/lib/admin-install.inc.php"
     ["line"]=>
     int(584)
     ["function"]=>
     string(9) "exiterror"
     ["args"]=>
     array(1) {
       [0]=>
       string(104) "Corpus `tweets_covid' already exists on the system.
Please specify a different name for your new corpus."
     }
   }
   [2]=>
   array(4) {
     ["file"]=>
     string(35) "/var/www/cqpweb/lib/execute.inc.php"
     ["line"]=>
     int(179)
     ["function"]=>
     string(18) "install_new_corpus"
     ["args"]=>
     array(0) {
     }
   }
   [3]=>
   array(4) {
     ["file"]=>
     string(41) "/var/www/cqpweb/lib/admin-execute.inc.php"
     ["line"]=>
     int(122)
     ["args"]=>
     array(1) {
       [0]=>
       string(35) "/var/www/cqpweb/lib/execute.inc.php"
     }
     ["function"]=>
     string(7) "require"
   }
   [4]=>
   array(4) {
     ["file"]=>
     string(37) "/var/www/cqpweb/lib/adminhome.inc.php"
     ["line"]=>
     int(34)
     ["args"]=>
     array(1) {
       [0]=>
       string(41) "/var/www/cqpweb/lib/admin-execute.inc.php"
     }
     ["function"]=>
     string(7) "require"
   }
   [5]=>
   array(4) {
     ["file"]=>
     string(29) "/var/www/cqpweb/adm/index.php"
     ["line"]=>
     int(3)
     ["args"]=>
     array(1) {
       [0]=>
       string(37) "/var/www/cqpweb/lib/adminhome.inc.php"
     }
     ["function"]=>
     string(7) "require"
   }
}

Thanks a lot for your help!

Best,
Daniel


More information about the CWB mailing list