[CWB] [cwb:feature-requests] #57 CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus
Andrew Hardie
andrewhardie at users.sourceforge.net
Thu Aug 1 11:41:22 CEST 2019
- **labels**: --> CQPweb
- **assigned_to**: Andrew Hardie
---
** [feature-requests:#57] CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus**
**Status:** open
**Group:** TODO-3.5
**Labels:** CQPweb
**Created:** Tue Jul 30, 2019 10:45 AM UTC by Stefan Evert
**Last Updated:** Tue Jul 30, 2019 10:45 AM UTC
**Owner:** Andrew Hardie
When Installating a corpus that's already indexed in CWB, corpus annotation (p-attributes) and XML structure (s-attributes) have to be set up manually. This process would be a lot less cumbersome if annotation and XML templates could be used to assign names and data types.
The implementation should be relatively easy: it would look up relevant information in the template by attribute name and either ignore everything else or complain in the case of a mismatch.
Motivation: Sometimes indexing a corpus via the Web admin interface is undesirable, in particular (i) for very large corpora (>> 100 M words), where uploading the .vrt file may not be possible and indexing will take a very long time; and (ii) if the corpus does not exist as a .vrt file in the first place, i.e. if annotation (both p-attributes and s-attributes) has been added to the CWB-indexed version (and CWB cannot export the .vrt format expected by CQPweb).
---
Sent from sourceforge.net because cwb at sslmit.unibo.it is subscribed to https://sourceforge.net/p/cwb/feature-requests/
To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/cwb/admin/feature-requests/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190801/b819ade1/attachment.html>
More information about the CWB
mailing list