[CWB] Restrictions on lemma annotation

Graham Ranger -- UAPV graham.ranger at univ-avignon.fr
Sat May 31 11:43:10 CEST 2025


Hello,
In a corpus I'm setting up, using treetagger with a parameter file for 
classical French, there are a number of alternative lemmata, i.e. things 
like:
eau    Nc    eau|eaux [Nc: common noun]
I'm not entirely sure why, since there is no ambiguity here, but as a 
result it is impossible to search for the lemma "eau".
Are there any solutions to other than simply opting to remove the pipe 
and what comes after it from column three of the vrt file to allow 
querying only for the first choice of lemma?
Many thanks in advance.
Graham.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20250531/e3ad3184/attachment.html>


More information about the CWB mailing list