[CWB] Can't use sentence (s) as position label

Scott Sadowsky ssadowsky at gmail.com
Wed Aug 28 10:47:47 CEST 2019


On Wed, Aug 28, 2019 at 2:46 AM Hardie, Andrew <a.hardie at lancaster.ac.uk>
wrote:

What I mean is that you can’t use XML elements, only XML attributes (And it
> is the *value* of the attribute that appears). These must first exist in
> your corpus of course. So, if you want to use sentence numbers, you need
> to have a sentence number attribute (call it num, n or whatever) on <s> .
> If you only have <s> … </s> with no attribute-values, then there is nothing
> to use as a label. And that requires having <s num=”…”> in your corpus
> input files, and declaring num as an attribute of s  when indexing. That
> internally will define s_num.
>

Thanks for the explanation, Andrew. It looks like the solution would be a
script that reads through the tagged texts and adds a number from a counter
to all the <s> tags, then.

Best wishes,
Scott
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190828/e1f932ab/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 34027 bytes
Desc: not available
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190828/e1f932ab/attachment-0001.png>


More information about the CWB mailing list