<html><head></head><body><link href='http://mail.yonsei.ac.kr/sens-static/css/mail/mail_view_ko.css' rel='stylesheet' type='text/css'><table width=99% style="text-align:left;"><tr valign=top><td style="font-size:9pt">


<p>Hi,</p><p> </p><p>I am building an English-Korean bilingual corpus using cwb-align-encode.</p><p> </p><p>So, I encoded and aligned.</p><p> </p><p>At firts it seemed that it worked.</p><p> </p><p>However I found a problem, when I checked the search results.</p><p> </p><p>Some first sentences were aligned as right pairs.</p><p>But the others were not.</p><p>It seems to be related with statistical aligning process.</p><p> </p><p>Actually I made two corpora so, that every pair sentence should have the same sentence id like &lt;s id="100"&gt; or &lt;s id="10000"&gt;, in order to avoid the failure of statistical alignment.</p><p>I am working with 60000 sentences. And I manually aligned all sentences and put the information into the xml tag "s_id".</p><p> </p><p>My question is how I can make useful the manually created xml tag "s_id"?</p><p> </p><p>Could anyone help me?</p><p> </p><p>I will appreciate your support.</p><p> </p><p>Thanks.</p><p> </p><p>Munich</p><br><table width=100% align=center><tr valign=top><td><font size=2></font></td></tr></table></td></tr></table></body></html>


<img id='mailexp' width=0 heigh=0 border=0 src='http://mail.yonsei.ac.kr/Mail?act=RECEIPT_CHECK&ukey=52e599173fdc33ae7f093824&userid=leemh&mhost=yonsei.ac.kr&ahost=d0001'></body></html>