<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">Hi Jiajin, <br>
<br>
This solution will definitely be of help for the most careful and
knowledgeable users. Not that I'm either of those but I remember I
went back to the "view corpus metadata" page a couple of times so
I would probably have noticed those links you added now and I
would have found the info about the right tagset. But definitely,
having the link to the CLAWS7 tagset next to "parts-of-speech-tag"
might still confuse some. <br>
<br>
JM<br>
<br>
<br>
</div>
<blockquote
cite="mid:CAA9s2DzLRq48c8G3NSRtcQQXkHxqV2D-EkRXEn5iLj81QJPPvw@mail.gmail.com"
type="cite">Hi JM, Andrew, and Ray,<br>
<br>
The only thing I could do for the Corpus Info section of the
lefthand menu of CQPweb is the Corpus documentation.<br>
<br>
In our Icelandic corpus interface (<a moz-do-not-send="true"
href="http://124.193.83.252/cqp/IcePaHC/">http://124.193.83.252/cqp/IcePaHC/</a>,
ID: test, Pass: test), I added the link of the official site of
the Historical Icelandic corpus to <a moz-do-not-send="true"
href="http://www.linguist.is/icelandic_treebank/Icelandic_Parsed_Historical_Corpus_%28IcePaHC%29">http://www.linguist.is/icelandic_treebank/Icelandic_Parsed_Historical_Corpus_%28IcePaHC%29</a>,
which provides all useful information of the corpus, including the
tagset used (<a moz-do-not-send="true"
href="http://www.linguist.is/icelandic_treebank/Tagset">http://www.linguist.is/icelandic_treebank/Tagset</a>),
and the download links. Andrew's trial use of Q-A tag has to be
the parsed part of the corpus, as the corpus has been both
PoS-tagged and parsed.<br>
<br>
I hope the information above helps.<br>
<br>
Best,<br>
<br>
Jiajin<br>
<br>
Jiajin XU<br>
Ph.D., associate professor<br>
National Research Centre for Foreign Language Education<br>
Beijing Foreign Studies University<br>
Beijing 100089<br>
China<br>
Email: <a moz-do-not-send="true"
href="mailto:xujiajin@bfsu.edu.cn">xujiajin@bfsu.edu.cn</a><br>
<br>
<br>
<br>
<div class="gmail_quote">On Fri, Oct 26, 2012 at 2:10 AM, "Andrés
Chandía" <span dir="ltr"><<a moz-do-not-send="true"
href="mailto:andres@chandia.net" target="_blank">andres@chandia.net</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">Hi JM<br>
We've been able of making a query like the one you describe,
our tagset is
personalized so if we want to look for a "Name folloewd by an
Adjective" we do next
at the SQL: "_N* _A*" (wothout the doublequotes<br>
<br>
if we want to look for a
secondary tag like, let say gender, the query is like this:
{M} (for masculine)<br>
<br>
that means:<br>
query for primary anotation tag = _*<br>
query for secondary anotation tag
= {*}<br>
<br>
What we haven't been able to do is find the combination to
query for a
"Noun Masculine" for instance, we have tried many combinations
with no success (
_N{M} - {N/M}, etc.) so if somebody could help us with this we
would appreciate it a lot.<br>
<br>
@ch<br>
<br>
<br>
El Jue, 25 de Octubre de 2012, 13:04, Josep M. Fontana
escribió:<br>
<div>
<div>Hi,<br>
<br>
I am a little (or quite) confused
about the syntax of CQPweb queries (simple query
language). I went to the wonderful
resource Ray Wu has made available so that I could see how
it works since we are
in the process of installing CQPweb as an interface for
our corpora. I wasn't able to
complete any search using the simple query language,
though. I'm sure it is something
very simple that I am missing. From what I understand
reading the document 'simple query language syntax', I
should be able to do the following in the simple query
mode:<br>
<br>
_JJ _NN1 <br>
<br>
which would supposedly look
for sequences of an adjective followed by noun according
to the CLAWS tag set. <br>
<br>
OK, I'm conducting the searches in the Old Icelandic
Corpus which has
been supposedly tagged using the CLAWS7 tagset (according
to the information in
"View corpus metadata". When I do this, however, I get a
message saying
"Your query had no results. There are no matches for your
query." This is very
puzzling because you would imagine that there would be
occurrences of adjectives followed by nouns. Doing it the
opposite order (_NN1 _JJ) gives me the same results.
What is even more puzzling is that I also get nothing
using single POS labels such as
_NN1 by itself or _JJ. <br>
<br>
Am I doing something wrong or is this due to
the fact that this particular corpus uses a completely
different tagset? When you access a CQPWeb corpus, is
there any way to retrieve the tags that have been used in
the
corpus? The only relevant info I find in this corpus is
the link to the CLAWS7 tagset
but, as I said, this doesn't seem to be the right
information. Going into the CQP syntax mode and doing
"show +pos" doesn't work. <br>
<br>
<br>
JM<br>
<br>
</div>
<blockquote type="cite">
<div
style="line-height:1.7;font-size:14px;font-family:arial">Dear
members,<br>
<br>
We are pleased to announce another CWB/CQPweb setup in
China and we dub it BFSU CQPweb. It is closely modelled
after Hardie's own (sorry Andrew, we're badly in need of
imagination) and currently features more than 20
corpora, including two Brown family cousins (CLOB and
Crown) developed at Beijing
Foreign Studies Unversity by Dr. Xu Jiajing and
Professor Liang Maocheng. <br>
<br>
You may access it from <a moz-do-not-send="true"
title="Este enlace externo se abrirá en una nueva
ventana" href="http://124.193.83.252/cqp/"
target="_blank">http://124.193.83.252/cqp/</a> using
test/test as
username/password. <br>
<br>
We'd like to take this opportunity to thank
the CWB team for their wonderful work and generosity. It
is great fun to build our work on their shoulders.<br>
<br>
<div><span style="color:rgb(128,128,128)"><span
style="color:rgb(128,128,128)"><span
style="color:rgb(0,0,0)"><span
style="color:rgb(0,0,0)">Best,</span><br>
<span style="color:rgb(0,0,0)">Ray<br>
</span></span></span></span></div>
</div>
</blockquote>
</div>
</blockquote>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
CWB mailing list
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a>
<a class="moz-txt-link-freetext" href="http://devel.sslmit.unibo.it/mailman/listinfo/cwb">http://devel.sslmit.unibo.it/mailman/listinfo/cwb</a>
</pre>
</blockquote>
<br>
</body>
</html>