[CWB] File format of encoded cwb corpora
    Hardie, Andrew 
    a.hardie at lancaster.ac.uk
       
    Fri Jul 13 16:08:10 CEST 2012
    
    
  
-----Original Message-----
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Stefan Evert 
>>>> There's no formal specification of the precise file format
Arguably there should be, however, especially if we need to change it and thus have to deal with format versioning. Moreover, having obtained (and read) a copy of the "Managing Gigabytes" book, I personally don't think the book alone alone adequately documents the technical details of the binary format: for a full understanding of how CWB does it, the book has to be read alongside the indexing code. 
Yet another thing for the TODO list!
Andrew.
    
    
More information about the CWB
mailing list