[CWB] offline-freqlists.php: Invalid utf8 character string: ''

Hardie, Andrew a.hardie at lancaster.ac.uk
Fri Nov 29 20:18:25 CET 2019


Hi Stefan,

Looks like you have a pretty old version. Can you upgrade, retry in 3.2.40 and check whether the issue persists?

best

Andrew.


-----Original Message-----
From: cwb-bounces at sslmit.unibo.it <cwb-bounces at sslmit.unibo.it> On Behalf Of Stefan Fischer
Sent: 29 November 2019 19:07
To: cwb at sslmit.unibo.it
Subject: [External Sender] [CWB] offline-freqlists.php: Invalid utf8 character string: ''

This email originated from outside of the University. Do not click links or open attachments unless you recognise the sender and know the content is safe.

Dear all,

We are trying to import a CWB-encoded corpus into CQPweb. The source texts are in UTF-8 and queries for non-ASCII words ([word="dieſ"]) work both in the CWB and the CQPweb version. Unfortunately, we cannot complete the corpus setup as offline-freqlists.php crashes with the PHP backtrace below.

I would be grateful for any advice.

Thanks in advance,
Stefan

----

PHP debugging backtrace
=======================
array(4) {
   [1]=>
   array(4) {
     ["file"]=>
     string(40) "/var/www/html/cqpweb/lib/library.inc.php"
     ["line"]=>
     int(299)
     ["function"]=>
     string(20) "exiterror_mysqlquery"
     ["args"]=>
     array(3) {
       [0]=>
       int(1300)
       [1]=>
       string(33) "Invalid utf8 character string: ''"
       [2]=>
       string(227) "LOAD DATA LOCAL INFILE '/data2/cqpweb/cache/______tempfreq_dta_17_09_web.tbl' INTO TABLE `__tempfreq_dta_17_09_web` FIELDS ESCAPED BY ''
        /* from User: cqpwebAdmin | Function: corpus_make_freqtables() |
2019-Nov-26 04:34:59 */"
     }
   }
   [2]=>
   array(4) {
     ["file"]=>
     string(40) "/var/www/html/cqpweb/lib/library.inc.php"
     ["line"]=>
     int(423)
     ["function"]=>
     string(14) "do_mysql_query"
     ["args"]=>
     array(1) {
       [0]=>
       &string(227) "LOAD DATA LOCAL INFILE '/data2/cqpweb/cache/______tempfreq_dta_17_09_web.tbl' INTO TABLE `__tempfreq_dta_17_09_web` FIELDS ESCAPED BY ''
        /* from User: cqpwebAdmin | Function: corpus_make_freqtables() |
2019-Nov-26 04:34:59 */"
     }
   }
   [3]=>
   array(4) {
     ["file"]=>
     string(42) "/var/www/html/cqpweb/lib/freqtable.inc.php"
     ["line"]=>
     int(124)
     ["function"]=>
     string(21) "do_mysql_infile_query"
     ["args"]=>
     array(3) {
       [0]=>
       string(24) "__tempfreq_dta_17_09_web"
       [1]=>
       string(52) "/data2/cqpweb/cache/______tempfreq_dta_17_09_web.tbl"
       [2]=>
       bool(true)
     }
   }
   [4]=>
   array(4) {
     ["file"]=>
     string(46) "/var/www/html/cqpweb/bin/offline-freqlists.php"
     ["line"]=>
     int(133)
     ["function"]=>
     string(22) "corpus_make_freqtables"
     ["args"]=>
     array(1) {
       [0]=>
       string(13) "dta_17_09_web"
     }
   }
}
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
https://eur02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fliste.sslmit.unibo.it%2Fmailman%2Flistinfo%2Fcwb&amp;data=02%7C01%7Ca.hardie%40lancaster.ac.uk%7Cd986489b45dd455594e608d7750048c0%7C9c9bcd11977a4e9ca9a0bc734090164a%7C1%7C1%7C637106516411126050&amp;sdata=k0nZjSQQH5h7EaKXi%2BGcGxF06QIXrgbnrBjkZ4oLnlo%3D&amp;reserved=0


More information about the CWB mailing list