Hi all,<div>I have a pretty simple question: is there any way to append text to an existing corpus?</div><div><br></div><div>We're working on a corpus based on data collected from a webcrawler and would like to periodically update the corpus with new data from the crawler. From the documentation I found info on how to add annotations to existing corpora etc., but I can't find anything about simply appending new data to an existing corpus. </div>
<div><br></div><div>Decoding the entire corpus, adding the new data to the generated file and re-encoding the new file is an option, but the server we're running on isn't exactly fast. Any way to save a few CPU cycles and directly insert the new data into the existing corpus? Perhaps there's some functionality to combine two corpora into one?</div>
<div><br></div><div>Thanks,</div><div>Nik</div>