<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" 
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
    xmlns:admin="http://webns.net/mvcb/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd">
	<channel>
<title>DCs RSS Feed</title><link>http://cavar.me/damir/index.html</link><description>some less relevant stuff...</description><dc:language>en</dc:language><dc:creator>dcavar@me.com</dc:creator><dc:rights>Copyright 2011-2012 Damir Cavar</dc:rights><dc:date>2013-04-14T23:33:15-04:00</dc:date><admin:generatorAgent rdf:resource="http://www.realmacsoftware.com/" />
<admin:errorReportsTo rdf:resource="mailto:dcavar@me.com" /><sy:updatePeriod>hourly</sy:updatePeriod>
<sy:updateFrequency>1</sy:updateFrequency>
<sy:updateBase>2000-01-01T12:00+00:00</sy:updateBase>
<lastBuildDate>Sun, 14 Apr 2013 23:36:01 -0400</lastBuildDate><item><title>Midwest Speech and Language Days 2013</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2013-04-14T23:33:15-04:00</dc:date><link>http://cavar.me/damir/blog/files/midwest-speech-and-language-days-2013.php#unique-entry-id-61</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/midwest-speech-and-language-days-2013.php#unique-entry-id-61</guid><content:encoded><![CDATA[The <a href="http://ttic.uchicago.edu/~kgimpel/MSLD2013.html" rel="external">Midwest Speech and Language Days 2013</a> at the Toyota Technological Institute at Chicago are happening on the 2nd and 3rd of May 2013.<br /><br />]]></content:encoded></item><item><title>Python 3 for Linguists at the LSA Summer Institute 2013 Course Material</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2013-04-02T00:25:07-04:00</dc:date><link>http://cavar.me/damir/blog/files/python-3-for-linguists-1.php#unique-entry-id-60</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/python-3-for-linguists-1.php#unique-entry-id-60</guid><content:encoded><![CDATA[The course material for the <a href="http://lsa2013.lsa.umich.edu/" rel="external">LSA Summer Institute 2013</a> course <em><a href="http://lsa2013.lsa.umich.edu/2012/05/python-3-for-linguists/" rel="external">Python 3 for Linguists</a></em> will be made available at:<br /><br /><a href="http://ltl.emich.edu/wiki/projects/pythonforlinguists/Python_for_Linguists.html" rel="external">Python for Linguists Wiki</a> (LTL, EMU)<br /><a href="https://dl.dropbox.com/u/11318112/Python34Ling/index.html" rel="external">Python 3 for Linguists</a> (Dropbox)<br /><br />There is a (currently not so full) <a href="https://github.com/dcavar/Py3L" rel="external">Github repository Py3L</a> with the (future) source code.<br /><br />We are using the <a href="http://www.activestate.com/komodo-edit" rel="external">Komodo Edit</a> 8.x (the free editor) and <a href="http://www.python.org/download/" rel="external">Python 3.3</a> in the course. We will be able to help you installing the necessary software components.<br />]]></content:encoded></item><item><title>AARDVARC Workshop May 2013</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2013-04-02T00:05:01-04:00</dc:date><link>http://cavar.me/damir/blog/files/AARDVARC-Workshop-EMU.php#unique-entry-id-59</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/AARDVARC-Workshop-EMU.php#unique-entry-id-59</guid><content:encoded><![CDATA[<a href="http://linguistlist.org/aardvarc/" rel="external">AARDVARC</a> - <a href="http://linguistlist.org/aardvarc/" rel="external">Automatically Annotated Repository of Digital Audio and Video Resources Community</a><br /><br /><a href="http://www.nsf.gov/awardsearch/showAward?AWD_ID=1244713" rel="external">NSF sponsored workshops</a> at ILIT/EMU and CUNY.<br /><br />]]></content:encoded></item><item><title>Moving projects and code to GitHub</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-11-25T12:14:28-05:00</dc:date><link>http://cavar.me/damir/blog/files/migrating-project-code-repos-1.php#unique-entry-id-57</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/migrating-project-code-repos-1.php#unique-entry-id-57</guid><content:encoded><![CDATA[I am moving code and project folders to <a href="http://github.com/" rel="external">GitHub</a>. I don&rsquo;t know, whether this is a good idea, it just turns out to be easier to use&hellip; :-)<br /><br />This port includes the <a href="http://github.com/dcavar/SNLTK" rel="external">SNLTK code</a>, all kinds of Python 3 projects, Java code, some of the C(++) code for FSTs and some NLP tasks, corpus and <a href="http://www.tei-c.org/index.xml" rel="external">TEI XML</a> utils. Some of that I limited to pull-only and push-access exclusively for collaborators. If you were involved in some of that, let me know, send me your GitHub-ID is and I can add you to the collaborators group of the particular repos.<br /><br />In particular, my course material will be migrated to GitHub completely. For example, the course material for the <a href="http://lsa2013.lsa.umich.edu/" rel="external">LSA Summer Institute course</a> in summer 2013 will be placed there:<br /><a href="http://github.com/dcavar/Py3L" rel="external">Python 3 for Linguists</a><br /><br />]]></content:encoded></item><item><title>Some old files about the Linguistics Program at the University of Zadar</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-10-25T15:29:58-04:00</dc:date><link>http://cavar.me/damir/blog/files/linguistics-ma-program-bologna-2008-university-of-zadar.php#unique-entry-id-56</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/linguistics-ma-program-bologna-2008-university-of-zadar.php#unique-entry-id-56</guid><content:encoded><![CDATA[Since I was asked many times about this MA program and the original text that went to the accreditation committee in Croatia (where we got one very nasty and absolutely irrelevant review, if I find it, I&rsquo;ll post it here; but also a very good and constructive review), here are the files, the Croatian and English text about the MA program in Linguistics that we submitted for accreditation within the Bologna system back in 2008 at the University of Zadar. I think, this is the corrected version. It was not the best possible program, developed under time pressure and in a very difficult situation, and its was building on the growing wave of computational linguistics, speech and language technology, as well as theoretical linguistics. We would do a lot of things differently nowadays. If you can use any of this for your inspiration or personal attempts to apply for a program or other support, let us know. I can forward you the editable version for some Office package.<br /><br />English version:<br /><ul class="disc"><li><a href="../resources/Linguistics-MA-EN-Uni-Zadar-2008-A.pdf" rel="external">Part 1</a></li><li><a href="../resources/Linguistics-MA-EN-Uni-Zadar-2008-B.pdf" rel="external">Part 2</a></li><li><a href="../resources/Linguistics-MA-EN-Uni-Zadar-2008-C.pdf" rel="external">Part 3</a></li></ul><br /><ul class="disc"><li>Croatian version:</li></ul><ul class="disc"><li><a href="../resources/Linguistics-MA-EN-Uni-Zadar-HR-2008-A.pdf" rel="external">Part 1</a></li><li><a href="../resources/Linguistics-MA-EN-Uni-Zadar-HR-2008-B.pdf" rel="external">Part 2</a></li><li><a href="../resources/Linguistics-MA-EN-Uni-Zadar-HR-2008-C.pdf" rel="external">Part 3</a></li></ul><br />]]></content:encoded></item><item><title>LibreOffice and TEI Stylesheets for file conversion</title><dc:creator>dcavar@me.com</dc:creator><category>Corpus Linguistics</category><dc:date>2012-10-17T23:47:18-04:00</dc:date><link>http://cavar.me/damir/blog/files/libreoffice-tei-stylesheets-file-format-conversion.php#unique-entry-id-55</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/libreoffice-tei-stylesheets-file-format-conversion.php#unique-entry-id-55</guid><content:encoded><![CDATA[If you want to batch convert a lot of files to some more accessible format (for example <a href="http://en.wikipedia.org/wiki/OpenDocument" rel="external">ODT</a> or <a href="http://en.wikipedia.org/wiki/Docx" rel="external">DOCX</a> to <a href="http://en.wikipedia.org/wiki/HTML" rel="external">HTML</a> or <a href="http://www.tei-c.org/" rel="external">TEI XML</a>), you can use first of all <a href="http://www.libreoffice.org/" rel="external">LibreOffice</a>.<br /><br />Here is a brief introduction how to batch convert files to some <a href="http://www.libreoffice.org/" rel="external">LibreOffice</a> output format or TEI XML.<br />]]></content:encoded></item><item><title>XFST: Python 3 script to convert prolog file to DOT-graph</title><dc:creator>dcavar@me.com</dc:creator><category>Computational Linguistics</category><dc:date>2012-10-16T11:38:46-04:00</dc:date><link>http://cavar.me/damir/blog/files/plg2dot-python-xfst-prolog-to-dot.php#unique-entry-id-54</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/plg2dot-python-xfst-prolog-to-dot.php#unique-entry-id-54</guid><content:encoded><![CDATA[If you write out a stack (or network) in <a href="http://www.fsmbook.com/" rel="external">XFST</a> to a prolog file:<br /><br /><span style="font:12px Courier, mono; ">write prolog > mymorph.plg</span><br /><br />and you want to convert it to <a href="http://www.graphviz.org/content/dot-language" rel="external">DOT</a> and visualize it in <a href="http://www.graphviz.org/" rel="external">Graphviz</a>, here is a <a href="http://www.python.org/" rel="external">Python 3.x</a> script to do so:<br /><br /><a href="http://cavar.me/damir/resources/plg2dot.py.zip" rel="external">Download zipped Python source</a><br /><a href="http://cavar.me/damir/resources/plg2dot.py.html" rel="external">View Python code</a><br /><br />]]></content:encoded></item><item><title>WSU talk: info on corpora and tech that will be discussed</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-10-07T09:26:39-04:00</dc:date><link>http://cavar.me/damir/blog/files/corpus-talk-WSU-2012-10.php#unique-entry-id-53</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/corpus-talk-WSU-2012-10.php#unique-entry-id-53</guid><content:encoded><![CDATA[I&rsquo;ll give a talk on corpora and relevant technologies at Wayne State University in Detroit on the 19<sup>th</sup> of October at 11 AM. Here are some links, papers and slides that might be interesting for colleagues and students to follow and post process:<br /><br />]]></content:encoded></item><item><title>Java programming sessions for the ILIT group</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-10-02T17:14:57-04:00</dc:date><link>http://cavar.me/damir/blog/files/java-meeting-ILIT-EMU.php#unique-entry-id-52</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/java-meeting-ILIT-EMU.php#unique-entry-id-52</guid><content:encoded><![CDATA[<span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">We are meeting Fridays at 9 AM in the Cooper building for Java programming.<br /><br />You might want to prepare your machine by installing:<br /><br />1. the Java SE 7u7 JDK:<br /></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.oracle.com/technetwork/java/javase/downloads/index.html" rel="external">http://www.oracle.com/technetwork/java/javase/downloads/index.html</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><br /><br />2. the NetBeans 7.2 IDE:<br /></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://netbeans.org/downloads/index.html" rel="external">http://netbeans.org/downloads/index.html</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><br /><br />and maybe reading some of the Java Tutorial:<br /></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://docs.oracle.com/javase/tutorial/index.html" rel="external">http://docs.oracle.com/javase/tutorial/index.html</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><br /><br /></span>]]></content:encoded></item><item><title>Endangered languages is up</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-06-21T12:49:35-04:00</dc:date><link>http://cavar.me/damir/blog/files/endangered-languages-site-up.php#unique-entry-id-51</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/endangered-languages-site-up.php#unique-entry-id-51</guid><content:encoded><![CDATA[<span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">The Endangered Languages site has been launched today:<br /><br /></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.endangeredlanguages.com/">http://www.endangeredlanguages.com/</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><br /><br /></span>]]></content:encoded></item><item><title>Clozure CL on Mac App Store</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-05-25T02:46:51-04:00</dc:date><link>http://cavar.me/damir/blog/files/Cluzure-CL-Mac-App-Store.php#unique-entry-id-50</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Cluzure-CL-Mac-App-Store.php#unique-entry-id-50</guid><content:encoded><![CDATA[<span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">Clozure CL, an open source and free implementation of Common Lisp for Mac is available on the App Store:<br /><br /></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://itunes.apple.com/us/app/clozure-cl/id489900618?mt=12" rel="external">http://itunes.apple.com/us/app/clozure-cl/id489900618?mt=12</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><br /></span>]]></content:encoded></item><item><title>Talk at the IDS 8th of May</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-05-07T05:55:15-04:00</dc:date><link>http://cavar.me/damir/blog/files/talk-at-ids-may-2012.php#unique-entry-id-49</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/talk-at-ids-may-2012.php#unique-entry-id-49</guid><content:encoded><![CDATA[<span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">Tomorrow, 8</span><sup>th</sup><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> of May 2012, I will be </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.ids-mannheim.de/aktuell/vortraege/2012/cavar.html" rel="self">presenting</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> at the </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.ids-mannheim.de/" rel="external">Institute of German Language in Mannheim</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">, and there is the last day of </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.maimarkt.de/" rel="external">Maimarkt</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">&hellip; I might meet U there???<br /></span>]]></content:encoded></item><item><title>Course at LSA Institute 2013: Python 3 for Linguists</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-04-19T13:38:34-04:00</dc:date><link>http://cavar.me/damir/blog/files/Python-3-Py3k-for-Linguists-LSA-Institute-2013-University-of-Michigan.php#unique-entry-id-48</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Python-3-Py3k-for-Linguists-LSA-Institute-2013-University-of-Michigan.php#unique-entry-id-48</guid><content:encoded><![CDATA[<span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://cavar.me/malgosia/" rel="external">Malgosia</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> and I will be teaching a course at the </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://lsa2013.lsa.umich.edu/" rel="external">LSA Institute 2013</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> at the </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.umich.edu/" rel="external">University of Michigan</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> in Ann Arbor: </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://docs.python.org/py3k/" rel="external">Python 3</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> for Linguists.<br /><br />Thanks to the </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://lsa2013.lsa.umich.edu/about.html" rel="external">Institute Steering Committee</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> for accepting our proposal!<br /></span>]]></content:encoded></item><item><title>Talk: Piotr Banski &#x22;TEI XML for Linguists&#x22;</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-04-18T21:20:49-04:00</dc:date><link>http://cavar.me/damir/blog/files/piotr-basnki-tei-xml-linguistics.php#unique-entry-id-47</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/piotr-basnki-tei-xml-linguistics.php#unique-entry-id-47</guid><content:encoded><![CDATA[<span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">Please join us for a talk by:<br />Dr. Piotr Banski (</span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.ids-mannheim.de/" rel="external">Institute for German Language</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">/</span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.ids-mannheim.de/" rel="external">Institut fuer Deutsche Sprache</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">, Mannheim, Germany)<br /><br />Title: "</span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="http://www.tei-c.org/index.xml" rel="external">TEI XML</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "> for Linguists"<br /><br />Time: Friday, April 20, 2012 at 2:00 pm<br />Location: Suite 104, Cooper Building, on the Eastern Michigan University campus (see </span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; "><a href="https://maps.google.com/maps?q=2000+Huron+River+Drive,+Ypsilanti+Township,+MI&hl=en&ll=42.25993,-83.645605&spn=0.003395,0.006968&sll=37.0625,-95.677068&sspn=58.598104,114.169922&oq=2000+Huron+River+&hnear=2000+N+Huron+River+Dr,+Ypsilanti+Township,+Michigan+48197&t=m&z=18" rel="external">Google maps</a></span><span style="font:12px &#39;Lucida Grande&#39;, LucidaGrande, Verdana, sans-serif; ">)<br /></span>]]></content:encoded></item><item><title>Talk: M. Cavar &#x22;On the influence of L1 on the L2 perception: The case of tenseness contrast in American vowels&#x22;</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-04-11T11:26:46-04:00</dc:date><link>http://cavar.me/damir/blog/files/talk-malgorzata-cavar-influence-L1-L2-perception-tenseness-contrast-american-vowels.php#unique-entry-id-46</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/talk-malgorzata-cavar-influence-L1-L2-perception-tenseness-contrast-american-vowels.php#unique-entry-id-46</guid><content:encoded><![CDATA[<strong>Date:</strong> April 13th, 2012<br /><strong>Time:</strong> 1:30 PM<br /><strong>Location:</strong> Cooper Building, Suite 104, EMU, 2000 Huron River Drive, Ypsilanti<br /><br /><strong>Directions:</strong>  Take Washtenaw heading east from Ann Arbor toward Ypsilanti.  Go past Hwy 23, turn left on Golfside, then turn right on Huron River Drive.  The Cooper Building will be on the left, across from Rynearson Stadium, and there is free parking right out front.  If you reach Superior St. you have gone too far.<br /><br /><strong>Title: On the influence of L1 on the L2 perception: The case of tenseness contrast in American vowels<br />Author: </strong><strong><a href="http://cavar.me/malgosia/" rel="external">Malgorzata E. Cavar</a></strong><strong><br /></strong><br /><strong>Abstract:</strong><br />One obvious difficulty in foreign language learning is the production of foreign sounds. What is less obvious is the fact that the perception of foreign categories by L2 learners differs from that of the native speakers and in itself might be and often is a hurdle in the acquisition of the phonetic/phonological system of the foreign language. In this talk, I will present the results of a series of experiments pertaining to the perception of the English vocalic contrast in high vowels by learners with different L1 backgrounds. The goal of this and similar studies is to determine how perceptual strategies of L2 learners differ from those of English native speakers and what these differences depend on. In the long run, the aim is to predict &ldquo;customized&rdquo; areas of difficulty for learners with different backgrounds and to help develop curricula and teaching aids that would actually respond to learners&rsquo; needs.<br />]]></content:encoded></item><item><title>Tokenization&#x2c; frequency profiles and N-gram models in Python 3</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-04-03T11:01:21-04:00</dc:date><link>http://cavar.me/damir/blog/files/tokenization-ngram-models-python3.php#unique-entry-id-45</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/tokenization-ngram-models-python3.php#unique-entry-id-45</guid><content:encoded><![CDATA[This is a brief description about how to use the Python 3 scripts to generate N-gram models for word tokens and characters from text. I expect you to have a Python 3 interpreter installed on your system.<br /><br />]]></content:encoded></item><item><title>The LINGUIST List corpus</title><dc:creator>dcavar@me.com</dc:creator><category>Corpus Linguistics</category><dc:date>2012-04-03T06:06:45-04:00</dc:date><link>http://cavar.me/damir/blog/files/linguist-list-corpus.php#unique-entry-id-44</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/linguist-list-corpus.php#unique-entry-id-44</guid><content:encoded><![CDATA[The LINGUIST List corpora can be found here:<br /><br /><a href="http://ltl.emich.edu/llc/" rel="external">http://ltl.emich.edu/llc/</a><br /><br />You can find in there the LINGUIST List mailings converted to TEI P5 XML. The linguistically annotated version will be available in an extended interface.<br /><br />See the previous blog for instructions on how to use <a href="http://cavar.me/damir/blog/files/philologic-corpus-introduction-part-1.php" rel="self" title="Blog:Working with the Philologic interface on the LTL corpora">Philologic</a>&hellip;<br /><br />]]></content:encoded></item><item><title>Working with the Philologic interface on the LTL corpora</title><dc:creator>dcavar@me.com</dc:creator><category>Corpus Linguistics</category><dc:date>2012-03-26T21:21:48-04:00</dc:date><link>http://cavar.me/damir/blog/files/philologic-corpus-introduction-part-1.php#unique-entry-id-41</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/philologic-corpus-introduction-part-1.php#unique-entry-id-41</guid><content:encoded><![CDATA[Here is a brief first introduction to the Philologic interface for the LTL corpora and the LINGUIST List corpus;<br /><br />]]></content:encoded></item><item><title>The LTL corpus</title><dc:creator>dcavar@me.com</dc:creator><category>Corpus Linguistics</category><dc:date>2012-03-08T12:39:55-05:00</dc:date><link>http://cavar.me/damir/blog/files/LTL-corpus-online.php#unique-entry-id-40</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/LTL-corpus-online.php#unique-entry-id-40</guid><content:encoded><![CDATA[The first version of the small <a href="http://ltl.emich.edu/ltlcorpus/" rel="external">LTL corpus</a> with a couple of million tokens is online. It contains <a href="http://www.tei-c.org/Guidelines/P5/index.xml" rel="external">TEI P5 XML encoded</a> books from the public domain. <a href="http://ltl.emich.edu/philologic/" rel="external">See here</a>&hellip;<br />]]></content:encoded></item><item><title>TEI online converter: OxGarage Converter</title><dc:creator>dcavar@me.com</dc:creator><category>Corpus Linguistics</category><dc:date>2012-03-08T12:35:05-05:00</dc:date><link>http://cavar.me/damir/blog/files/TEI-OxGarage-Converter.php#unique-entry-id-39</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/TEI-OxGarage-Converter.php#unique-entry-id-39</guid><content:encoded><![CDATA[The online <a href="http://www.tei-c.org/oxgarage/" rel="external">OxGarage Converter</a> on the TEI pages converts almost anything to something else, in particular to <a href="http://www.tei-c.org/Guidelines/P5/index.xml" rel="external">TEI XML</a>. This is obviously using the OpenOffice filters and converters in the backend as batch processors, as <a href="http://cavar.me/damir/blog/files/Conversion-to-TEI-using-OpenOffice.php" rel="self" title="Blog:TEI XML export in OpenOffice again...">described here for the manual conversion</a>.<br />]]></content:encoded></item><item><title>Lithuanian Morphology and LFG-Grammar...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-03-05T19:17:30-05:00</dc:date><link>http://cavar.me/damir/blog/files/Lithuanian-Morphology.php#unique-entry-id-38</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Lithuanian-Morphology.php#unique-entry-id-38</guid><content:encoded><![CDATA[The poster for the <a href="http://dgfs.uni-frankfurt.de/dgfs/info_en.html" rel="external">DGfS annual meeting 2012</a> on a Lithuanian Morphology and LFG Grammar is done. This was the result of a grad course at the <a href="http://www.uni-konstanz.de/" rel="external">University of Konstanz</a> on rule-based natural language processing (using <a href="http://www.fsmbook.com/" rel="external">XFST</a> and <a href="http://www2.parc.com/isl/groups/nltt/xle/" rel="external">XLE</a>). I am proud of all the participants!<br /><a href="http://cavar.me/damir/resources/Litauisch_Poster_OhneRand.pdf" rel="external">Here is the poster</a>. You can <a href="http://ltl.emich.edu/ltm/" rel="external">test the morphology online</a>. The coverage will improve, this is based on the morpheme numbers from the poster, without generic morphological rules. The generator will be made available there too.<br />]]></content:encoded></item><item><title>LINGUIST List has a store on amazon.com</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-03-05T19:04:04-05:00</dc:date><link>http://cavar.me/damir/blog/files/LINGUIST-List-store-on-amazon.php#unique-entry-id-37</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/LINGUIST-List-store-on-amazon.php#unique-entry-id-37</guid><content:encoded><![CDATA[<a href="http://astore.amazon.com/linguistlist-20" rel="external">The LINGUIST List store on amazon.com</a>&hellip;<br />]]></content:encoded></item><item><title>LINGUIST List Fund Drive 2012 has started</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-27T10:58:14-05:00</dc:date><link>http://cavar.me/damir/blog/files/LINGUIST-List-Fund-Drive-2012.php#unique-entry-id-35</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/LINGUIST-List-Fund-Drive-2012.php#unique-entry-id-35</guid><content:encoded><![CDATA[Please consider supporting <a href="http://linguistlist.org/" rel="external">LINGUIST List</a>, just go to the <a href="http://linguistlist.org/" rel="external">Fund Drive 2012 pages</a> and donate!<br />]]></content:encoded></item><item><title>Text analyzed and parsed to TEI XML wrapper</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-24T21:23:24-05:00</dc:date><link>http://cavar.me/damir/blog/files/TEI-XML-parser-output-wrapper-txt2tei.php#unique-entry-id-34</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/TEI-XML-parser-output-wrapper-txt2tei.php#unique-entry-id-34</guid><content:encoded><![CDATA[I set up a simple testing page for a wrapper of raw text to <a href="http://www.tei-c.org/Guidelines/P5/" rel="external">TEI XML</a>.  It uses in this version just the <a href="http://nlp.stanford.edu/software/corenlp.shtml" rel="external">Stanford CoreNLP</a> tools to tokenize, recognize sentences, part of speech annotate and lemmatize the input. Just paste a paragraph of text in there. In the next version this will be expanded with NLP tools for a couple of more languages, as well as other analysis components and tools for English.<br /><br />]]></content:encoded></item><item><title>Charty in JavaScript...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-23T11:02:17-05:00</dc:date><link>http://cavar.me/damir/blog/files/Charty-in-JavaScript.php#unique-entry-id-33</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Charty-in-JavaScript.php#unique-entry-id-33</guid><content:encoded><![CDATA[Ben Cool ported <a href="http://cavar.me/damir/charty/" rel="external">Charty</a> (CFG-based Chart parser) to JavaScript for a class project and added in one version feature augmentation and unification to it.  You can test it online.  This is running on mobile devices like iPad or iPhone in Safari and on Android with a browser that has JavaScript support without any server-based component.  See the <a href="http://ltl.emich.edu/links/jscharty/" rel="external">documentation and test site here</a>&hellip;<br /><br />]]></content:encoded></item><item><title>Stanford-CoreNLP corenlp.sh script on Mac OS X Lion</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-13T19:09:03-05:00</dc:date><link>http://cavar.me/damir/blog/files/Stanford-CoreNLP-script-on-Mac-OS-X-Lion.php#unique-entry-id-32</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Stanford-CoreNLP-script-on-Mac-OS-X-Lion.php#unique-entry-id-32</guid><content:encoded><![CDATA[To make the <a href="http://nlp.stanford.edu/software/corenlp.shtml" rel="external">Stanford CoreNLP</a> tools work on your Mac OS X 10.7.x (Lion) distribution with the included bash script do this...<br /><br />]]></content:encoded></item><item><title>LREC 2012 workshop on Challenges in the management of large corpora</title><dc:creator>dcavar@me.com</dc:creator><category>Call</category><dc:date>2012-02-13T15:15:59-05:00</dc:date><link>http://cavar.me/damir/blog/files/LREC-2012-Challenges-in-the-management-of-large-corpora.php#unique-entry-id-31</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/LREC-2012-Challenges-in-the-management-of-large-corpora.php#unique-entry-id-31</guid><content:encoded><![CDATA[You should really consider joining this <a href="http://www.lrec-conf.org/lrec2012/" rel="external">LREC 2012</a> workshop on <a href="http://corpora.ids-mannheim.de/cmlc.html" rel="external">Challenges in the management of large corpora</a>!<br /><br />]]></content:encoded></item><item><title>Changed Privacy Policy</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-13T14:02:42-05:00</dc:date><link>http://cavar.me/damir/blog/files/Changed_Privacy_Policy.php#unique-entry-id-30</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Changed_Privacy_Policy.php#unique-entry-id-30</guid><content:encoded><![CDATA[Since privacy policy changes seem to be all around now, here is one by me for the pages here:<br /><br />If you want to make your web-experience somewhat more private, and prevent me from being able to read out something from the apache log files about you, here are some hints about how you could configure your browser to reduce the amount of personal bits you leave on your way on this page or anywhere else on the web:<br /><br />]]></content:encoded></item><item><title>Language Technology Lab (LTL) up</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-07T13:54:13-05:00</dc:date><link>http://cavar.me/damir/blog/files/Language-Technology-Lab-LTL-up.php#unique-entry-id-29</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Language-Technology-Lab-LTL-up.php#unique-entry-id-29</guid><content:encoded><![CDATA[The <a href="http://ltl.emich.edu/" rel="external">Language Technology Lab</a> (<a href="http://ltl.emich.edu/" rel="external">LTL</a>) (<a href="http://linguistlist.org/ilit/" rel="external">ILIT</a> and <a href="http://www.emich.edu/" rel="external">EMU</a>) is up, check it out:<br /><br /><a href="http://ltl.emich.edu/" rel="external">http://ltl.emich.edu/</a><br /><br />More content to come in the next days and weeks&hellip; stay tuned!<br /><br />]]></content:encoded></item><item><title>Using Antconc: Notes 1</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-02-02T20:44:21-05:00</dc:date><link>http://cavar.me/damir/blog/files/Using-Antconc-Notes-1.php#unique-entry-id-28</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Using-Antconc-Notes-1.php#unique-entry-id-28</guid><content:encoded><![CDATA[Here is a short instruction on using Antconc for simple statistical analysis.<br /><br />]]></content:encoded></item><item><title>Dictionaries for Mac OS X</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-01-28T15:31:12-05:00</dc:date><link>http://cavar.me/damir/blog/files/Mac-Dictionary-Dicts.php#unique-entry-id-27</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Mac-Dictionary-Dicts.php#unique-entry-id-27</guid><content:encoded><![CDATA[Here are some of the dictionaries for the OS X Dictionary.app:<br /><br /><ul class="disc"><li><a href="http://lipflip.org/articles/dictcc-dictionary-plugin" rel="external">The dict.cc dictionary plugin English-German, German-English</a></li><li>Tekl.de <a href="http://www.tekl.de/deutsch/Lexikon-Plugins.html" rel="external">German Thesaurus and English-German dictionary</a></li></ul><br />]]></content:encoded></item><item><title>TikZ-dependency graph LaTeX library</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-01-23T13:18:18-05:00</dc:date><link>http://cavar.me/damir/blog/files/Latex-Dependency-Graph.php#unique-entry-id-25</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Latex-Dependency-Graph.php#unique-entry-id-25</guid><content:encoded><![CDATA[The TikZ-dependency graph library for LaTeX <a href="http://sourceforge.net/projects/tikz-dependency/" rel="external">can be found here</a>&hellip;<br />]]></content:encoded></item><item><title>Online tool for IPA transcription</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-01-23T11:27:49-05:00</dc:date><link>http://cavar.me/damir/blog/files/Online-tool-for-IPA-transcription.php#unique-entry-id-24</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Online-tool-for-IPA-transcription.php#unique-entry-id-24</guid><content:encoded><![CDATA[Here is an online tool for IPA transcription, <a href="http://www.i2speak.com/" rel="external">i2speak</a>:<br /><br /><a href="http://www.i2speak.com/" rel="external">http://www.i2speak.com/</a><br />]]></content:encoded></item><item><title>just restored the pages from backups...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-01-18T14:14:19-05:00</dc:date><link>http://cavar.me/damir/blog/files/SLS2009-JSSECL2006-CLS2010-CPALA2005.php#unique-entry-id-23</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/SLS2009-JSSECL2006-CLS2010-CPALA2005.php#unique-entry-id-23</guid><content:encoded><![CDATA[I just restored a bunch of web pages of summer schools and workshops. Some had interesting material on them, in particular pictures. Check out the JSSECL 2006 event&hellip;<br /><ul class="disc"><li><a href="http://cavar.me/sls2009/" rel="external">Fourth Annual Meeting of the Slavic Linguistic Society SLS 2009</a></li><li><a href="http://cavar.me/jsseclws2006/" rel="external">Student Conference on Empirical and Computational Linguistics</a> (<a href="http://cavar.me/jsseclws2006/" rel="external">JSSECL WS CECL 2006</a>), Zadar, Croatia</li><li><a href="http://cavar.me/cpala05/index.html" rel="external">Workshop on Computational Modeling of Lexical Acquisition</a> (<a href="http://cavar.me/cpala05/index.html" rel="external">CPALA 2005</a>), Split, Croatia</li><li><a href="http://www.cavar.me/damir/BOOT-LA/" rel="external">BOOT-LA</a> workshop at Indiana University</li></ul><ul class="disc"><li><a href="http://cavar.me/cls2010/" rel="external">Computational Linguistics Summer School 2010 at the University of Zadar</a> (<a href="http://cavar.me/cls2010/" rel="external">CLS2010</a>) (see on <a href="https://www.facebook.com/group.php?gid=130289183661223" rel="external">Facebook</a>)</li><li><a href="http://cavar.me/jssecl2006/" rel="external">Jadertina Summer School in Empirical and Computational Linguistics</a> (<a href="http://cavar.me/jssecl2006/" rel="external">JSSECL 2006</a>)</li></ul><br /><br />]]></content:encoded></item><item><title>the linguistic Wolfram Demonstrations Projects</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-01-16T16:09:04-05:00</dc:date><link>http://cavar.me/damir/blog/files/the-linguistic-Wolfram-Demonstrations-Projects.php#unique-entry-id-22</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/the-linguistic-Wolfram-Demonstrations-Projects.php#unique-entry-id-22</guid><content:encoded><![CDATA[Check out these demonstrations from the Wolfram Demonstrations Project:<br /><ul class="disc"><li><a href="http://demonstrations.wolfram.com/CollocationByChiSquare/" rel="external">Collocation by Chi Square</a></li><li><a href="http://demonstrations.wolfram.com/CollocationBySymmetricConditionalProbability/" rel="external">Collocation by Symmetric Conditional Probability</a></li></ul><ul class="disc"><li><a href="http://demonstrations.wolfram.com/MultilanguageWordLengths/" rel="external">Multilanguage Word Lengths</a></li></ul><ul class="disc"><li><a href="http://demonstrations.wolfram.com/ZipfsLawAppliedToWordAndLetterFrequencies/" rel="external">Zipf's Law Applied to Word and Letter Frequencies</a></li></ul><ul class="disc"><li>and all the other <a href="http://demonstrations.wolfram.com/topic.html?topic=Linguistics&limit=20" rel="external">Linguistic Demonstrations there</a>...</li></ul><br />]]></content:encoded></item><item><title>C-FASL 2012&#x2c; you should join it...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2012-01-15T01:51:32-05:00</dc:date><link>http://cavar.me/damir/blog/files/c9fd4a3a83a9541049b174bcdc936184-21.php#unique-entry-id-21</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/c9fd4a3a83a9541049b174bcdc936184-21.php#unique-entry-id-21</guid><content:encoded><![CDATA[You should submit a paper to <a href="http://cl.indiana.edu/~cfasl/" rel="external">Computational Formal Approaches to Slavic Languages</a> (<a href="http://cl.indiana.edu/~cfasl/" rel="external">C-FASL</a>) 2012:<br /><br /><a href="http://cl.indiana.edu/~cfasl/" rel="external">http://cl.indiana.edu/~cfasl/</a><br /><br />]]></content:encoded></item><item><title>Computational Approaches to Slavic Languages 2012</title><dc:creator>dcavar@me.com</dc:creator><category>Call</category><dc:date>2011-12-23T01:33:09-05:00</dc:date><link>http://cavar.me/damir/blog/files/a80907be412af8968b7a12406c0ceb95-20.php#unique-entry-id-20</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/a80907be412af8968b7a12406c0ceb95-20.php#unique-entry-id-20</guid><content:encoded><![CDATA[<a href="http://cl.indiana.edu/~cfasl/" rel="external">Computational Formal Approaches to Slavic Languages</a> 2012<br />Slavic Computational Linguistics: <a href="http://cl.indiana.edu/~cfasl/" rel="external">Computational Formal Approaches to Slavic Languages</a> (10-11 May 2012, Bloomington, Indiana); Co-located with: Formal Approaches to Slavic Linguistics (FASL 21), 11-13 May 2012 and the Workshop in Slavic Linguistics, 14-17 May 2012.<br />]]></content:encoded></item><item><title>Scheme and Racket implementation of a parser</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-12-02T23:38:55-05:00</dc:date><link>http://cavar.me/damir/blog/files/Scheme-Chart-Parser-Charty-SNLTK.php#unique-entry-id-19</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Scheme-Chart-Parser-Charty-SNLTK.php#unique-entry-id-19</guid><content:encoded><![CDATA[The GUI-based <a href="http://www.cavar.me/damir/charty/" rel="external">Charty</a> implementation (agenda-based chart parser for CFGs) is finally available on the <a href="http://www.snltk.org/examples/guicharty/index.html" rel="external">SNLTK pages</a>.<br />]]></content:encoded></item><item><title>Scheme and Racket meeting at ILIT (Cooper building)</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-11-30T12:00:21-05:00</dc:date><link>http://cavar.me/damir/blog/files/Scheme-Racket-Meeting-EMU.php#unique-entry-id-18</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Scheme-Racket-Meeting-EMU.php#unique-entry-id-18</guid><content:encoded><![CDATA[The Schemers at EMU meet on Thursday 31st of Nov. at 3 PM Eastern Time in the Cooper building for an initial 1.5 hours intro and coordination meeting.<br /><br />If you would like to participate, bring your computational hardware with <a href="http://racket-lang.org/" rel="external">DrRacket</a> with you, and maybe have a look at my previous blog entry and also the <a href="http://www.snltk.org/" rel="external">Scheme Natural Language Toolkit</a> (<a href="http://www.snltk.org/" rel="external">SNLTK</a>).<br /><br />We hope that some others will join us.  I can open up a communication channel, <a href="http://www.skype.com/" rel="external">Skype</a> and Desktop Sharing, maybe soon we can have collaborative editing going (using maybe <a href="http://www.eclipse.org/" rel="external">Eclipse</a> (Did anybody test <a href="http://wiki.eclipse.org/ECF" rel="external">ECF</a> and the <a href="http://wiki.eclipse.org/DocShare_Plugin" rel="external">DocShare</a> component in it?)).  Just let me know, if you would be interested in joining this session.<br />]]></content:encoded></item><item><title>Some DrRacket videos...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-11-18T23:01:20-05:00</dc:date><link>http://cavar.me/damir/blog/files/DrRacket-Scheme-Videos.php#unique-entry-id-17</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/DrRacket-Scheme-Videos.php#unique-entry-id-17</guid><content:encoded><![CDATA[Here are some introductory video clips for DrRacket:<br /><br /><a href="http://www.youtube.com/playlist?list=PLD0EB7BC8D7CF739A" rel="external">http://www.youtube.com/playlist?list=PLD0EB7BC8D7CF739A</a><br /><br />Thanks to John Clements.<br /><br />DC<br />]]></content:encoded></item><item><title>Intensive Python class for Linguists (for corpuslinguistics&#x2c; language data processing and manipulation etc.)</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-11-16T17:55:22-05:00</dc:date><link>http://cavar.me/damir/blog/files/Python-3-Py3k-for-Linguists.php#unique-entry-id-16</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Python-3-Py3k-for-Linguists.php#unique-entry-id-16</guid><content:encoded><![CDATA[I am offering an intensive class for the LING519 students, all the Linguist List people, and whoever might be interested, this Saturday 19th of Nov. 2011 at 10 AM Eastern Time in Cooper, the LinguistList Suite.  We plan to meet for 4 hours or more, depending on speed and interest.  Let me know, if you are interested.  If you want to join us, let me know.  I will share the screen and the audio already with Zadar, we can include you, if you cannot come.  The topics covered might be:<br /><br />Intro to <a href="http://python.org/download/releases/3.2.2/" rel="external">Python 3</a><br />Using <a href="http://www.activestate.com/komodo-edit" rel="external">Komodo Edit 6.x</a><br />Processing corpora like the <a href="http://en.wikipedia.org/wiki/Brown_Corpus" rel="external">Brown corpus</a> (raw text with slash-pos, or TEI XML), the <a href="http://en.wikipedia.org/wiki/Treebank" rel="external">Penn Treebank</a>, the <a href="http://en.wikipedia.org/wiki/Croatian_Language_Corpus" rel="external">Croatian Language Corpus</a> etc.<br />Generating statistical models and profiles: frequency profiles, <a href="http://en.wikipedia.org/wiki/N-gram" rel="external">N-gram models</a><br />Calculating significance, mutual information, relative entropy, &hellip;<br />Simple Finite State Machines<br />Simple Parsers<br />Generating outputs of analyses: CSV, HTML, XML, etc.<br />&hellip;<br /><br />DC<br />]]></content:encoded></item><item><title>Building the Google V8 JavaScript engine as a Shell interpreter for Mac OS X</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-11-04T23:19:48-04:00</dc:date><link>http://cavar.me/damir/blog/files/Google-V8-JavaScript-Mac.php#unique-entry-id-15</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Google-V8-JavaScript-Mac.php#unique-entry-id-15</guid><content:encoded><![CDATA[Here is an instruction for building the Google V8 JavaScript engine on Mac OS X as a shell tool for testing:<br /><br /><a href="http://kourge.net/node/123">http://kourge.net/node/123</a><br /><br />Just keep in mind, when you want to build it for Mac OS X Lion, the SCons call should be:<br /><br />scons arch=x64<br /><br />The rest of the build instructions goes unchanged. Also for the comment at the bottom of the page above, you would add:<br /><br />scons arch=x64 sample=shell<br /><br />and get a &ldquo;shell&rdquo; binary, which is the JavaScript standalone engine.<br /><br />If you build the v8 binary, you can copy it for example to /usr/local/bin, to make it available in general.<br /><br />]]></content:encoded></item><item><title>The Schemers become active again...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-10-31T16:03:30-04:00</dc:date><link>http://cavar.me/damir/blog/files/Schemers-Michigan.php#unique-entry-id-14</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Schemers-Michigan.php#unique-entry-id-14</guid><content:encoded><![CDATA[The Schemers and Racketeers are meeting again, join us, see the <a href="http://www.snltk.org/" rel="external">SNLTK pages</a>&hellip;<br /><br />]]></content:encoded></item><item><title>Ilse Lehiste Memorial Symposium: Melody and Meter</title><dc:creator>dcavar@me.com</dc:creator><category>Talk</category><dc:date>2011-10-20T14:10:27-04:00</dc:date><link>http://cavar.me/damir/blog/files/Ilse-Lehiste-Memorial-Symposium.php#unique-entry-id-13</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Ilse-Lehiste-Memorial-Symposium.php#unique-entry-id-13</guid><content:encoded><![CDATA[I&rsquo;ll be at the <a href="http://www.ling.ohio-state.edu/LehisteSymposium/program.html" rel="external">Ilse Lehiste Memorial Symposium: Melody and Meter</a> at the Ohio State University on the 11th of November 2011.<br />]]></content:encoded></item><item><title>SNLTK</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-10-20T14:09:43-04:00</dc:date><link>http://cavar.me/damir/blog/files/Scheme-Natural-Language-Toolkit-SNLTK.php#unique-entry-id-12</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Scheme-Natural-Language-Toolkit-SNLTK.php#unique-entry-id-12</guid><content:encoded><![CDATA[There is an update to be expected on the <a href="http://www.snltk.org/" rel="external">Scheme Natural Language Toolkit</a> (<a href="http://www.snltk.org/" rel="external">SNLTK</a>) (and there is soon an <a href="http://www.r7rs.org/" rel="external">update of Scheme coming</a> as well), and the <a href="http://www.snltk.org/" rel="external">SNLTK</a> is also being ported to common <a href="http://racket-lang.org/" rel="external">Racket</a>.<br />]]></content:encoded></item><item><title>ELS 2012</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-10-20T14:08:24-04:00</dc:date><link>http://cavar.me/damir/blog/files/European-Lisp-Symposium-2012-University-of-Zadar.php#unique-entry-id-11</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/European-Lisp-Symposium-2012-University-of-Zadar.php#unique-entry-id-11</guid><content:encoded><![CDATA[The <a href="http://ozk.unizd.hr/els2012/" rel="external">European List Symposium</a> in 2012 will be organized at the <a href="http://www.unizd.hr/" rel="external">University of Zadar</a>, and I am on the organizing committee, and participating as well.  Stay tuned&hellip;<br />]]></content:encoded></item><item><title>It took a while...</title><dc:creator>dcavar@me.com</dc:creator><category>Info</category><dc:date>2011-10-20T14:00:36-04:00</dc:date><link>http://cavar.me/damir/blog/files/ef231d0ad205fd1eb8a6d48a870bcf7b-10.php#unique-entry-id-10</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/ef231d0ad205fd1eb8a6d48a870bcf7b-10.php#unique-entry-id-10</guid><content:encoded><![CDATA[to settle down in Ann Arbor and start teaching at EMU, but now we will get back to the project work&hellip;<br />]]></content:encoded></item><item><title>Updated Python code and tools</title><dc:creator>dcavar@me.com</dc:creator><category>Computational Linguistics</category><dc:date>2011-06-21T06:39:19-04:00</dc:date><link>http://cavar.me/damir/blog/files/Python-Chart-Parser-TextStat.php#unique-entry-id-9</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Python-Chart-Parser-TextStat.php#unique-entry-id-9</guid><content:encoded><![CDATA[The <a href="http://www.cavar.me/damir/charty/" rel="external">Charty parser code</a> is updated to Python 3.x (implementing an Earley parser for context-free grammars), and a compact module, <a href="http://www.cavar.me/damir/textstat/" rel="external">TextStat.py</a>, with some useful functions for N-gram models, frequency profiles, vector space models, statistical analyses, information theoretic measures (entropy, mutual information, etc.). If you have comments, or you find some bug or error, let me know.<br />]]></content:encoded></item><item><title>Yet another comment related to Lexc&#x2c; XFST and compilation</title><dc:creator>dcavar@me.com</dc:creator><category>Computational Linguistics</category><dc:date>2011-06-12T18:00:45-04:00</dc:date><link>http://cavar.me/damir/blog/files/Lexc-XFST-Foma.php#unique-entry-id-8</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Lexc-XFST-Foma.php#unique-entry-id-8</guid><content:encoded><![CDATA[You can use <a href="http://www.ling.helsinki.fi/kieliteknologia/tutkimus/hfst/" rel="external">Helsinki Finite-State Transducer Technology HFST3</a> and Foma to compile XFST or Lexc defined morphologies and transducers&hellip;<br />]]></content:encoded></item><item><title>Setting up Aquamacs for XLE and XFST</title><dc:creator>dcavar@me.com</dc:creator><category>Computational Linguistics</category><dc:date>2011-05-13T03:34:48-04:00</dc:date><link>http://cavar.me/damir/blog/files/aquameacs-for-xle-and-xfst.php#unique-entry-id-7</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/aquameacs-for-xle-and-xfst.php#unique-entry-id-7</guid><content:encoded><![CDATA[Here is a small introduction about my working environment setting for grammar and morphology development using Aquamacs, XLE, XFST and just scripting with Python and Bash in the OS X Terminal.app...<br />]]></content:encoded></item><item><title>Drawing syntactic trees...</title><dc:creator>dcavar@me.com</dc:creator><category>Syntax</category><dc:date>2012-03-21T14:39:41-04:00</dc:date><link>http://cavar.me/damir/blog/files/drawing-syntactic-trees.php#unique-entry-id-5</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/drawing-syntactic-trees.php#unique-entry-id-5</guid><content:encoded><![CDATA[I have been asked by many students and colleagues, how to generate nice looking trees for presentations, assignments, papers etc. Here is a small summary of tools I have tried or seen.<br /><br />If you want to generate a graph of a syntactic relation, a syntactic tree, there are various ways to do that, without manually drawing it on paper and scanning the manual work... here is a small summary of ways and tools for generating syntactic trees...<br />]]></content:encoded></item><item><title>SNLTK at the ELS2011 in Hamburg</title><dc:creator>dcavar@me.com</dc:creator><category>Computational Linguistics</category><dc:date>2011-03-09T07:13:30-05:00</dc:date><link>http://cavar.me/damir/blog/files/snltk-at-els-2011-hamburg.php#unique-entry-id-4</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/snltk-at-els-2011-hamburg.php#unique-entry-id-4</guid><content:encoded><![CDATA[We will be at the 4th European Lisp Symposium with the <a href="http://www.snltk.org/" rel="external">SNLTK</a> end of March 2011...<br />]]></content:encoded></item><item><title>Apple Mail&#x2c; Snow Leopard and GnuPG (GPG)</title><dc:creator>dcavar@me.com</dc:creator><category>GnuPG</category><dc:date>2011-02-27T11:24:48-05:00</dc:date><link>http://cavar.me/damir/blog/files/Apple-Mail-GPG-Tools.php#unique-entry-id-3</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Apple-Mail-GPG-Tools.php#unique-entry-id-3</guid><content:encoded><![CDATA[If you want to set up GnuPG for encryption of files and emails on Mac OS X 10.6 here are the links, very briefly...<br />]]></content:encoded></item><item><title>Summer School&#x2c; Round Table and Workshop in Computational Linguistics&#x2c; Cognitive Science&#x2c; Machine Learning</title><dc:creator>dcavar@me.com</dc:creator><category>Computational Linguistics</category><dc:date>2010-07-05T06:14:31-04:00</dc:date><link>http://cavar.me/damir/blog/files/Summer-School-Round-Table-Workshop-Computational-Linguistics-Cognitive-Science-Machine-Learning.php#unique-entry-id-2</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Summer-School-Round-Table-Workshop-Computational-Linguistics-Cognitive-Science-Machine-Learning.php#unique-entry-id-2</guid><content:encoded><![CDATA[From the 22nd of August till the 3rd of September at the University of Zadar there is a summer school, workshop, round table on computational linguistics, cognitive science, and computer science related to language...<br />]]></content:encoded></item><item><title>TEI XML export in OpenOffice again...</title><dc:creator>dcavar@me.com</dc:creator><category>TEI</category><dc:date>2010-07-29T07:45:55-04:00</dc:date><link>http://cavar.me/damir/blog/files/Conversion-to-TEI-using-OpenOffice.php#unique-entry-id-1</link><guid isPermaLink="true">http://cavar.me/damir/blog/files/Conversion-to-TEI-using-OpenOffice.php#unique-entry-id-1</guid><content:encoded><![CDATA[Since the course pages went away somewhere, here again a summary of how to export some document in for example the Word, <a href="http://www.openoffice.org/" rel="external">OpenOffice</a> (<a href="http://www.libreoffice.org/" rel="external">LibreOffice</a>, <a href="http://www.neooffice.org/" rel="external">NeoOffice</a>), RTF or other type of document quickly to <a href="http://www.tei-c.org/Guidelines/P5/" rel="external">TEI XML P5</a>.<br />]]></content:encoded></item></channel>
</rss>