[CWB] xml regions in cwb-lexdecode

Stefan Evert stefanML at collocations.de
Sat May 23 12:15:10 CEST 2020


Hi Simon,

not with cwb-lexdecode, because that accesses the built-in frequency list for the entire corpus.

What you want is

	cwb-scan-corpus -o freqlist.txt CORPUS lemma+0 '?lemma+0=/SPD/'

Best,
Stefan

> On 23 May 2020, at 10:11, Simon Meier-Vieracker <simon.meier-vieracker at tu-dresden.de> wrote:
> 
> am I right that it is NOT possible to restrict cwb-lexdecode to certain xml regions as defined by xml attributes?
> 
> My corpus contains xml tags like
> 
>> <speech speaker="Timon Gremmels" fraction="SPD">
> 
> And I would like to generate a frequency list of all parts of the corpus tagged with speech_fraction="SPD"
> 
> It is possible to do this in CWB with this query:
> 
>> [] :: match.speech_fraction="SPD";
>> count by lemma;
> 



More information about the CWB mailing list