[CWB] offline-freqlists.php: Invalid utf8 character string: ''

Stefan Fischer stefan.fischer at uni-saarland.de
Fri Nov 29 20:06:44 CET 2019


Dear all,

We are trying to import a CWB-encoded corpus into CQPweb. The source 
texts are in UTF-8 and queries for non-ASCII words ([word="dieſ"]) work 
both in the CWB and the CQPweb version. Unfortunately, we cannot 
complete the corpus setup as offline-freqlists.php crashes with the PHP 
backtrace below.

I would be grateful for any advice.

Thanks in advance,
Stefan

----

PHP debugging backtrace
=======================
array(4) {
   [1]=>
   array(4) {
     ["file"]=>
     string(40) "/var/www/html/cqpweb/lib/library.inc.php"
     ["line"]=>
     int(299)
     ["function"]=>
     string(20) "exiterror_mysqlquery"
     ["args"]=>
     array(3) {
       [0]=>
       int(1300)
       [1]=>
       string(33) "Invalid utf8 character string: ''"
       [2]=>
       string(227) "LOAD DATA LOCAL INFILE 
'/data2/cqpweb/cache/______tempfreq_dta_17_09_web.tbl' INTO TABLE 
`__tempfreq_dta_17_09_web` FIELDS ESCAPED BY ''
	/* from User: cqpwebAdmin | Function: corpus_make_freqtables() | 
2019-Nov-26 04:34:59 */"
     }
   }
   [2]=>
   array(4) {
     ["file"]=>
     string(40) "/var/www/html/cqpweb/lib/library.inc.php"
     ["line"]=>
     int(423)
     ["function"]=>
     string(14) "do_mysql_query"
     ["args"]=>
     array(1) {
       [0]=>
       &string(227) "LOAD DATA LOCAL INFILE 
'/data2/cqpweb/cache/______tempfreq_dta_17_09_web.tbl' INTO TABLE 
`__tempfreq_dta_17_09_web` FIELDS ESCAPED BY ''
	/* from User: cqpwebAdmin | Function: corpus_make_freqtables() | 
2019-Nov-26 04:34:59 */"
     }
   }
   [3]=>
   array(4) {
     ["file"]=>
     string(42) "/var/www/html/cqpweb/lib/freqtable.inc.php"
     ["line"]=>
     int(124)
     ["function"]=>
     string(21) "do_mysql_infile_query"
     ["args"]=>
     array(3) {
       [0]=>
       string(24) "__tempfreq_dta_17_09_web"
       [1]=>
       string(52) "/data2/cqpweb/cache/______tempfreq_dta_17_09_web.tbl"
       [2]=>
       bool(true)
     }
   }
   [4]=>
   array(4) {
     ["file"]=>
     string(46) "/var/www/html/cqpweb/bin/offline-freqlists.php"
     ["line"]=>
     int(133)
     ["function"]=>
     string(22) "corpus_make_freqtables"
     ["args"]=>
     array(1) {
       [0]=>
       string(13) "dta_17_09_web"
     }
   }
}


More information about the CWB mailing list