[CWB] Generating list of POS tags used in a corpus

Serge Heiden slh at ens-lyon.fr
Sun Jul 22 12:07:08 CEST 2012


Hi Josep,

TXM which is a wrapper around CQP is designed to allow you to do exactly 
that: http://sf.net/projects/txm

Best,
Serge


Selon Josep M. Fontana le 22/07/2012 11:40:
> Hi,
>
> I was wondering whether there is a CQP command (or some other way in 
> CWB) to generate a list of the pos tags that have been used in a 
> corpus. I am exploting some corpora for which I don't have any 
> documentation and I would like to have a clear idea about what tagset 
> has been used and what has been encoded in the different tags.
>
> I've searched through the last versions of the CQP language tutorial 
> and the Corpus encoding tutorial but I haven't been able to find 
> anything relevant. Any help will be greatly appreciated.
>
> Josep M.
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb

-- 
Dr. Serge Heiden, slh at ens-lyon.fr, http://textometrie.ens-lyon.fr
ENS de Lyon/CNRS - ICAR UMR5191, Institut de Linguistique Française
15, parvis René Descartes 69342 Lyon BP7000 Cedex, tél. +33(0)622003883



More information about the CWB mailing list