[CWB] Bilingual corpus alignment
Austin Yang
austin.yang.2014 at gmail.com
Mon Oct 3 03:43:53 CEST 2022
Dear all,
Recently I've encountered a problem using cwb's alignment encoding function.
"Problem" might not be the accurate word but, I used a different alignment
tool and fitted into cwb's standard format, and ran the regedit and encode
procedure. This created an alx file in the source language index file. The
tutorial says "This procedure only creates an a-attribute in HOLMES-EN
(source corpus), linking it to HOLMES-DE (target corpus).", but that's all
I can find. I don't know how to use cqp/cwb to present sentence alignment
(i.e. I imagine querying "Sherlock" in the source corpus, it will present
both the English and Dutch sentence including "Sherlock"). The attachment
shows the command and output. I'm not even sure if the alignment is
successful or not. Any help or information that sheds some light to this
situation will be greatly appreciated!
Best,
Austin Yang (楊承洋)
MS in Cognitive Neuroscience, NCU
BS in Psychology, CYCU
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20221003/d7a889e8/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 2022-10-03 09_40_48-ay2014 at 2024us-trt-1_ _var_CQPweb_registry.png
Type: image/png
Size: 5968 bytes
Desc: not available
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20221003/d7a889e8/attachment-0003.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 2022-10-03 09_40_22-ay2014 at 2024us-trt-1_ _var_CQPweb_registry.png
Type: image/png
Size: 4143 bytes
Desc: not available
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20221003/d7a889e8/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 2022-10-03 09_43_19-_var_CQPweb_registry_test.algn - ay2014 at 120.127.233.176 - ??? - WinSCP.png
Type: image/png
Size: 21047 bytes
Desc: not available
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20221003/d7a889e8/attachment-0005.png>
More information about the CWB
mailing list