[CWB] UTF-8 encoding problem

Serge Heiden slh at ens-lyon.fr
Tue Apr 23 14:21:10 CEST 2013


Hi Jie,

An alternative would be to install the TXM software
which includes Unicode compatible CWB binaries:
https://sourceforge.net/projects/txm
It always bundles current CWB binaries for Linux32
and Linux64 (currently version 3.4.1), found in
the "/usr/lib/TXM/cwb/bin" directory.
[binaries also available for Win32, Win64 and
Mac OS X]

Best,
Serge

Le 23/04/2013 11:36, Hardie, Andrew a écrit :
> Hi Jie Jiang,
>
> If full UTF8 support is important to you then v3.0 is not going to be
> good enough for the job. You need the most recent version (as of now,
> that’s 3.4.6).
>
> If you’re a Linux user, the only way to get v 3.4.* is to build it from
> source following the instructions here:
>
> http://cwb.sourceforge.net/developers.php
>
> basically,
>
> *svn co http://svn.code.sf.net/p/cwb/code/cwb/trunk cwb*
>
> And then the file “INSTALL” has instructions on making
> build-plus-install easy.
>
> best
>
> Andrew.
>
> *From:*cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it]
> *On Behalf Of *Jie Jiang
> *Sent:* 23 April 2013 10:30
> *To:* cwb at sslmit.unibo.it
> *Subject:* [CWB] UTF-8 encoding problem
>
> Hi all:
>
> I'm a newbie in CWB. It is a great tool, and I do wish to use it for
> corpus management.
>
> However, I'm quite concerned that UTF-8 is not well supported as
> reported in the documentation for version 3.0, so I'm wondering how to
> word around this issue since UTF-8 is very important for me.
>
> By the way, I noticed version 3.2 and 3.4 are only available as windows
> installers on SF, but are they available for Linux users as well please?
>
> Thank you in advance!
>
>
> Best regards!
>
> Jie Jiang
>
>
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>

-- 
Dr. Serge Heiden, slh at ens-lyon.fr, http://textometrie.ens-lyon.fr
ENS de Lyon/CNRS - ICAR UMR5191, Institut de Linguistique Française
15, parvis René Descartes 69342 Lyon BP7000 Cedex, tél. +33(0)622003883


More information about the CWB mailing list