[CWB] xml regions in cwb-lexdecode

Stefan Evert stefanML at collocations.de
Sat May 23 13:15:59 CEST 2020



> On 23 May 2020, at 13:09, Stefan Evert <stefanML at collocations.de> wrote:
> 
> 	cwb-scan-corpus -o freqlist.txt CORPUS lemma+0 '?speech_fraction+0=/SPD/'

And by way of explanation: the "+0" is redundant in both cases, but clarifies the way cwb-scan-corpus works.  The leading "?" turns the scan key into a constraint, which is used to filter tokens but will not be included in the frequency counts.

Best,
STefan


More information about the CWB mailing list