[CWB] N-grams
Hardie, Andrew
a.hardie at lancaster.ac.uk
Mon May 6 08:41:38 CEST 2024
Not at present.
best
Andrew.
From: cwb-bounces at sslmit.unibo.it <cwb-bounces at sslmit.unibo.it> On Behalf Of Graham Ranger
Sent: Tuesday, April 30, 2024 1:24 PM
To: cwb at sslmit.unibo.it
Subject: Re: [CWB] N-grams
Hi Simon,
Many thanks for this! Is there a means via the cqpweb interface, I wonder. I ask, as I'm using this in a classroom context, in addition to research.
All best wishes,
Graham.
Le 30/04/2024 à 14:07, Simon Meier-Vieracker a écrit :
Hi Graham.
you can use cwb-scan-corpus, please check the Encoding Manual (https://cwb.sourceforge.io/files/CWB_Encoding_Tutorial.pdf), p. 15:
E.g.
$ cwb-scan-corpus VSS pos+0 pos+1 pos+2 | sort -nr -k 1 | head -20
for part-of-speech trigrams.
Best, Simon
Am 30.04.2024 um 13:38 schrieb graham.ranger <graham.ranger at univ-avignon.fr<mailto:graham.ranger at univ-avignon.fr>>:
This must have been asked already, and I see that it was once a feature request, but does anybody know if there is now a way to generate n-grams for a given corpus, via a cql query?
Many thanks as always,
Graham.
Envoyé depuis mon appareil Galaxy
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it<mailto:CWB at sslmit.unibo.it>
http://liste.sslmit.unibo.it/mailman/listinfo/cwb
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it<mailto:CWB at sslmit.unibo.it>
http://liste.sslmit.unibo.it/mailman/listinfo/cwb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20240506/547be203/attachment.html>
More information about the CWB
mailing list