Midwest Speech and Language Days 2013

The Midwest Speech and Language Days 2013 at the Toyota Technological Institute at Chicago are happening on the 2nd and 3rd of May 2013.

Read More...

Python 3 for Linguists at the LSA Summer Institute 2013 Course Material

The course material for the LSA Summer Institute 2013 course Python 3 for Linguists will be made available at:

Python for Linguists Wiki (LTL, EMU)
Python 3 for Linguists (Dropbox)

There is a (currently not so full) Github repository Py3L with the (future) source code.

We are using the Komodo Edit 8.x (the free editor) and Python 3.3 in the course. We will be able to help you installing the necessary software components.
Read More...

AARDVARC Workshop May 2013

AARDVARC - Automatically Annotated Repository of Digital Audio and Video Resources Community

NSF sponsored workshops at ILIT/EMU and CUNY.

Read More...

Moving projects and code to GitHub

I am moving code and project folders to GitHub. I don’t know, whether this is a good idea, it just turns out to be easier to use… Happy

This port includes the SNLTK code, all kinds of Python 3 projects, Java code, some of the C(++) code for FSTs and some NLP tasks, corpus and TEI XML utils. Some of that I limited to pull-only and push-access exclusively for collaborators. If you were involved in some of that, let me know, send me your GitHub-ID is and I can add you to the collaborators group of the particular repos.

In particular, my course material will be migrated to GitHub completely. For example, the course material for the LSA Summer Institute course in summer 2013 will be placed there:
Python 3 for Linguists

Read More...

Some old files about the Linguistics Program at the University of Zadar

Since I was asked many times about this MA program and the original text that went to the accreditation committee in Croatia (where we got one very nasty and absolutely irrelevant review, if I find it, I’ll post it here; but also a very good and constructive review), here are the files, the Croatian and English text about the MA program in Linguistics that we submitted for accreditation within the Bologna system back in 2008 at the University of Zadar. I think, this is the corrected version. It was not the best possible program, developed under time pressure and in a very difficult situation, and its was building on the growing wave of computational linguistics, speech and language technology, as well as theoretical linguistics. We would do a lot of things differently nowadays. If you can use any of this for your inspiration or personal attempts to apply for a program or other support, let us know. I can forward you the editable version for some Office package.

English version:

  • Croatian version:

Read More...

LibreOffice and TEI Stylesheets for file conversion

If you want to batch convert a lot of files to some more accessible format (for example ODT or DOCX to HTML or TEI XML), you can use first of all LibreOffice.

Here is a brief introduction how to batch convert files to some LibreOffice output format or TEI XML.
Read More...

XFST: Python 3 script to convert prolog file to DOT-graph

If you write out a stack (or network) in XFST to a prolog file:

write prolog > mymorph.plg

and you want to convert it to DOT and visualize it in Graphviz, here is a Python 3.x script to do so:

Download zipped Python source
View Python code

Read More...

WSU talk: info on corpora and tech that will be discussed

I’ll give a talk on corpora and relevant technologies at Wayne State University in Detroit on the 19th of October at 11 AM. Here are some links, papers and slides that might be interesting for colleagues and students to follow and post process:

Read More...

Java programming sessions for the ILIT group

We are meeting Fridays at 9 AM in the Cooper building for Java programming.

You might want to prepare your machine by installing:

1. the Java SE 7u7 JDK:
http://www.oracle.com/technetwork/java/javase/downloads/index.html

2. the NetBeans 7.2 IDE:
http://netbeans.org/downloads/index.html

and maybe reading some of the Java Tutorial:
http://docs.oracle.com/javase/tutorial/index.html

Read More...

Endangered languages is up

The Endangered Languages site has been launched today:

http://www.endangeredlanguages.com/

Read More...

Clozure CL on Mac App Store

Clozure CL, an open source and free implementation of Common Lisp for Mac is available on the App Store:

http://itunes.apple.com/us/app/clozure-cl/id489900618?mt=12
Read More...

Talk at the IDS 8th of May

Tomorrow, 8th of May 2012, I will be presenting at the Institute of German Language in Mannheim, and there is the last day of Maimarkt… I might meet U there???
Read More...

Course at LSA Institute 2013: Python 3 for Linguists

Malgosia and I will be teaching a course at the LSA Institute 2013 at the University of Michigan in Ann Arbor: Python 3 for Linguists.

Thanks to the
Institute Steering Committee for accepting our proposal!
Read More...

Talk: Piotr Banski "TEI XML for Linguists"

Please join us for a talk by:
Dr. Piotr Banski (
Institute for German Language/Institut fuer Deutsche Sprache, Mannheim, Germany)

Title: "
TEI XML for Linguists"

Time: Friday, April 20, 2012 at 2:00 pm
Location: Suite 104, Cooper Building, on the Eastern Michigan University campus (see
Google maps)
Read More...

Talk: M. Cavar "On the influence of L1 on the L2 perception: The case of tenseness contrast in American vowels"

Date: April 13th, 2012
Time: 1:30 PM
Location: Cooper Building, Suite 104, EMU, 2000 Huron River Drive, Ypsilanti

Directions: Take Washtenaw heading east from Ann Arbor toward Ypsilanti. Go past Hwy 23, turn left on Golfside, then turn right on Huron River Drive. The Cooper Building will be on the left, across from Rynearson Stadium, and there is free parking right out front. If you reach Superior St. you have gone too far.

Title: On the influence of L1 on the L2 perception: The case of tenseness contrast in American vowels
Author:
Malgorzata E. Cavar

Abstract:
One obvious difficulty in foreign language learning is the production of foreign sounds. What is less obvious is the fact that the perception of foreign categories by L2 learners differs from that of the native speakers and in itself might be and often is a hurdle in the acquisition of the phonetic/phonological system of the foreign language. In this talk, I will present the results of a series of experiments pertaining to the perception of the English vocalic contrast in high vowels by learners with different L1 backgrounds. The goal of this and similar studies is to determine how perceptual strategies of L2 learners differ from those of English native speakers and what these differences depend on. In the long run, the aim is to predict “customized” areas of difficulty for learners with different backgrounds and to help develop curricula and teaching aids that would actually respond to learners’ needs.
Read More...