[CWB] Virtual seminar POSTPONED: Embedding CWB in a CL Workflow | Finite State Queries
Stefan Evert
stefanML at collocations.de
Sun Jan 24 16:21:38 CET 2021
Unfortunately, the CWB virtual seminar announced a last week has to be postponed. The new data will be
Wednesday, 17 February 2021, 16:15–17:45 CET
Further registrations for the new date to obtain the Zoom link (free of charge, of course) are still welcome.
Best,
Stefan
> On 16 Jan 2021, at 19:28, Stefan Evert <stefanML at collocations.de> wrote:
>
> Dear CWB aficionados,
>
> as part of the colloquium of my research group, there will be an online presentation concerned with applications of CWB and a glimpse into the query “engine room” of CQP.
>
> Philipp Heinrich & Stefan Evert (CCL, FAU Erlangen-Nürnberg)
>
> News from the Corpus Workbench (CWB):
> Embedding CWB in a CL Workflow | Finite State Queries
>
> Wednesday, 27 January 2021, 16:15–17:45 CET
>
> https://www.linguistik.phil.fau.de/teaching/oberseminar/#2021_01_27
>
> The presentation will be given in English via a regular Zoom videoconference. Anybody interested is welcome to attend. Please send us an e-mail (to stefan.evert at fau.de) in order to obtain the Zoom link (which we don't want to share on any public Web page).
>
> Feel free to share this e-mail with anyone else who might be interested.
>
> Best & hope to see some of you at the presentation,
> Stefan
>
>
> ABSTRACT
>
> Many powerful corpus query engines – notably the IMS Open Corpus
> Workbench (CWB), the (No)Sketch Engine, and several other tools inspired
> by them – offer a query language based on generalised regular expressions
> (formulated over complex token descriptions rather than individual
> characters). This enables researchers to locate lexico-grammatical
> patterns of interest and collect corpus instances in a concordance. Many
> applications of corpus linguistics – notably corpus-based discourse
> analysis and computational lexicography – are furthermore in need of
> collocations or word sketches, as well as dispersion and keyword analyses
> (based on metadata annotation included in the corpus).
>
> The first part of the talk gives a practical introduction to cwb-ccc, an
> open-source Python package that translates CWB query results into pandas
> dataframes and then performs collocation analyses for different contexts.
> It also offers keyword analysis for subcorpora defined by metadata
> constraints.
>
> The second part of the talk gives the first publicly available
> introduction to the CWB implementation of corpus queries by
> non-deterministic simulation of finite-state automata. It also addresses
> pitfalls and limitations of finite-state queries, in particular certain
> corner cases that may not be evaluated correctly.
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
More information about the CWB
mailing list