[CWB] [cwb:feature-requests] #57 CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus

Andrew Hardie andrewhardie at users.sourceforge.net
Fri Feb 7 19:51:11 CET 2020


- **status**: open --> closed
- **Comment**:

Done in upcoming 3.3 but not tested yet.



---

** [feature-requests:#57] CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus**

**Status:** closed
**Group:** TODO-3.5
**Labels:** CQPweb 
**Created:** Tue Jul 30, 2019 10:45 AM UTC by Stefan Evert
**Last Updated:** Thu Aug 01, 2019 09:41 AM UTC
**Owner:** Andrew Hardie


When Installating a corpus that's already indexed in CWB, corpus annotation (p-attributes) and XML structure (s-attributes) have to be set up manually.  This process would be a lot less cumbersome if annotation and XML templates could be used to assign names and data types.

The implementation should be relatively easy: it would look up relevant information in the template by attribute name and either ignore everything else or complain in the case of a mismatch.

Motivation: Sometimes indexing a corpus via the Web admin interface is undesirable, in particular (i) for very large corpora (>> 100 M words), where uploading the .vrt file may not be possible and indexing will take a very long time; and (ii) if the corpus does not exist as a .vrt file  in the first place, i.e. if annotation (both p-attributes and s-attributes) has been added to the CWB-indexed version (and CWB cannot export the .vrt format expected by CQPweb).


---

Sent from sourceforge.net because cwb at sslmit.unibo.it is subscribed to https://sourceforge.net/p/cwb/feature-requests/

To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/cwb/admin/feature-requests/options.  Or, if this is a mailing list, you can unsubscribe from the mailing list.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20200207/3bdbb8b7/attachment.html>


More information about the CWB mailing list