I was afraid it would be something like that. Modifying the interface to query multiple corpora at once is worth a shot, I'll give that a try. Thanks for the help guys.<div class="gmail_extra"><br><br><div class="gmail_quote">
On Thu, Nov 8, 2012 at 5:56 PM, Hardie, Andrew <span dir="ltr"><<a href="mailto:a.hardie@lancaster.ac.uk" target="_blank">a.hardie@lancaster.ac.uk</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div>
And now I see Stefan had already replied, in greater detail and more helpfully. Ooops!
<span class="HOEnZb"><font color="#888888"><div><br>
</div>
<div>Andrew.</div></font></span><div><div class="h5">
<br>
<br>
<br>
Nik <<a href="mailto:cqplist@nikvdp.com" target="_blank">cqplist@nikvdp.com</a>> wrote:<br>
<br>
<br>
<div>Hi all,
<div>I have a pretty simple question: is there any way to append text to an existing corpus?</div>
<div><br>
</div>
<div>We're working on a corpus based on data collected from a webcrawler and would like to periodically update the corpus with new data from the crawler. From the documentation I found info on how to add annotations to existing corpora etc., but I can't find
anything about simply appending new data to an existing corpus. </div>
<div><br>
</div>
<div>Decoding the entire corpus, adding the new data to the generated file and re-encoding the new file is an option, but the server we're running on isn't exactly fast. Any way to save a few CPU cycles and directly insert the new data into the existing corpus?
Perhaps there's some functionality to combine two corpora into one?</div>
<div><br>
</div>
<div>Thanks,</div>
<div>Nik</div>
</div>
</div></div></div>
<br>_______________________________________________<br>
CWB mailing list<br>
<a href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a><br>
<a href="http://devel.sslmit.unibo.it/mailman/listinfo/cwb" target="_blank">http://devel.sslmit.unibo.it/mailman/listinfo/cwb</a><br>
<br></blockquote></div><br></div>