<div dir="ltr">Dear all,<br><div class="gmail_quote"><div dir="ltr"><div><div><div><div><br></div>I was wondering if I had missed something when reading CWB documentation or there does not exist any trivial way to generate per text corpus statistics (eg. text_id, text_author, word_count, types_count etc.). I have already tried both external  (cwb-scan-corpus) and internal (query = []; then tabulate) approach, but without major success. I have also started to analyse CQPWeb php scripts in order to see how it populates mysql tables with frequency data, but it is not precisely what I was looking for (I am still digging, though).<br>


</div>I would like only to add that apart from using proper CWB/CQPWeb, I also used to manipulate my corpora from within R (with rcqp) and it would be a great aid if this sort of information could be easily retrieved.<br>


<br></div>Thanks for any hint<br></div>Chris<br></div>

</div></div>