[CWB] A few miscellaneous questions

Scott Sadowsky ssadowsky at gmail.com
Sat Nov 30 09:32:13 CET 2019


I've got a few miscellaneous questions about searching in CQPweb using CQP
syntax.

*1. *I've tagged each utterance with a unique serial number that's stored
in the s_utterance s-attribute. It's encoded as free text. I'd like to be
able to query specific utterances by number, e.g. s_utterance="1287117",
and get a single result just once -- the entirety of the utterance.

The best attempts I've been able to come up with are kludgy and return one
hit per token in the utterance: [word=".+"] :: match.s_utterance="12887"and
[] :: match.s_utterance="12887".

Is there a better way to do this?


*2.* Performing *case-sensitive* queries of words is of limited use to me
(and likely others). However, it's the default with CQP syntax queries.
This is different from both the simple query syntax and search engine
syntax, which makes it very easy to forget to add %c to every single query
element.

Is there any way to set searching to be case-*insensitive* by default?


*3.* In a similar vein, searching across sentence/utterance boundaries is
of limited usefulness, but it is also the default. This can, of course, be
dealt with by adding within s to all queries, but that's a lot of typing
over time, it's not intuitive to many users, and it's also easy to forget.

Can queries somehow be set to not cross sentence/utterance boundaries by
default?

Thanks in advance!
Scott
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20191130/0bdf25c3/attachment.html>


More information about the CWB mailing list