[CWB] [cwb:feature-requests] #57 CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus
Andrew Hardie
andrewhardie at users.sourceforge.net
Fri Feb 7 19:51:11 CET 2020
- **status**: open --> closed
- **Comment**:
Done in upcoming 3.3 but not tested yet.
---
** [feature-requests:#57] CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus**
**Status:** closed
**Group:** TODO-3.5
**Labels:** CQPweb
**Created:** Tue Jul 30, 2019 10:45 AM UTC by Stefan Evert
**Last Updated:** Thu Aug 01, 2019 09:41 AM UTC
**Owner:** Andrew Hardie
When Installating a corpus that's already indexed in CWB, corpus annotation (p-attributes) and XML structure (s-attributes) have to be set up manually. This process would be a lot less cumbersome if annotation and XML templates could be used to assign names and data types.
The implementation should be relatively easy: it would look up relevant information in the template by attribute name and either ignore everything else or complain in the case of a mismatch.
Motivation: Sometimes indexing a corpus via the Web admin interface is undesirable, in particular (i) for very large corpora (>> 100 M words), where uploading the .vrt file may not be possible and indexing will take a very long time; and (ii) if the corpus does not exist as a .vrt file in the first place, i.e. if annotation (both p-attributes and s-attributes) has been added to the CWB-indexed version (and CWB cannot export the .vrt format expected by CQPweb).
---
Sent from sourceforge.net because cwb at sslmit.unibo.it is subscribed to https://sourceforge.net/p/cwb/feature-requests/
To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/cwb/admin/feature-requests/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20200207/3bdbb8b7/attachment.html>
More information about the CWB
mailing list