<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">Hi, <br>
if this is TEI, I can send you my XSLT script. <br>
Best, <br>
Ruprecht<br>
<br>
Am 17.12.2014 um 11:25 schrieb Ingrid Sör:<br>
</div>
<blockquote
cite="mid:%3CCAE_XA5BonucfW1-bZhpSFgFjw5Gw5Y50Zo8-PnKNJDqLykKq8g@mail.gmail.com%3E"
type="cite">
<meta http-equiv="Content-Type" content="text/html;
charset=windows-1252">
<div dir="ltr">
<div>
<div>
<div>
<div>Hi,<br>
<br>
</div>
<div>I hope this is the right forum for my following
questions..<br>
</div>
I am trying to get frequency data of Swedish nouns from
certain corpora in the Swedish "Språkbanken". They have
their files available for download in xml format, so I am
now trying to make them usable with CWB. I read in the CWB
encoding tutorial that the files need to be in .vrt-format
to encode them and that this can be done easily via XSLT.
<br>
<br>
</div>
Is this the best way to go about things? I am not familiar
with XSLT really and I think it will take some time to learn
how to do it on my own, so if XSLT is the solution I would
be very grateful if anyone might have a "standard" xslt code
for me to adapt. Or if there is any other way? I have been
using <i>sed </i>in my ubuntu terminal to get each tag or
word onto a new line, but this seems a complicated way to
also make the p-attributes tab-separated (as they are now
inside <w> tags).<br>
<br>
</div>
<div>Sorry if I am probably asking about rudimentary things
now - I am very new to CWB and corpus work. Thanks for any
help!<br>
</div>
</div>
Best regards,<br>
Ingrid<br>
</div>
</blockquote>
<br>
<br>
</body>
</html>