[CWB] cwb-scan-corpus

Simon Meier-Vieracker simon.meier-vieracker at tu-dresden.de
Fri Nov 1 08:57:38 CET 2019


Hi,

I am trying to access frequency informations (trigrams) with cwb-scancorpus.

It works fine with this command:

cwb-scan-corpus CORPUS lemma+0 lemma+1 lemma+2 > trigrams.txt

However, I would like to filter out sentence-ending punctuations as tagged with '$.'
I tried something like

cwb-scan-corpus CORPUS lemma+0 lemma+1 lemma+2 ?pos+0=/\$\./ ?pos+1=/\$\./ ?pos+2=/\$\./ > trigrams.txt

but then I get no results. I do have to escape special characters like '$', I guess? What am I doing wrong?

Thanks in advance!
Simon


-------------- n?chster Teil --------------
Ein Dateianhang mit HTML-Daten wurde abgetrennt...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20191101/41908d69/attachment.html>


More information about the CWB mailing list