Lithuanian Morphology and LFG-Grammar...

The poster for the DGfS annual meeting 2012 on a Lithuanian Morphology and LFG Grammar is done. This was the result of a grad course at the University of Konstanz on rule-based natural language processing (using XFST and XLE). I am proud of all the participants!
Here is the poster. You can test the morphology online. The coverage will improve, this is based on the morpheme numbers from the poster, without generic morphological rules. The generator will be made available there too.
Read More...

Text analyzed and parsed to TEI XML wrapper

I set up a simple testing page for a wrapper of raw text to TEI XML. It uses in this version just the Stanford CoreNLP tools to tokenize, recognize sentences, part of speech annotate and lemmatize the input. Just paste a paragraph of text in there. In the next version this will be expanded with NLP tools for a couple of more languages, as well as other analysis components and tools for English.

Read More...

Charty in JavaScript...

Ben Cool ported Charty (CFG-based Chart parser) to JavaScript for a class project and added in one version feature augmentation and unification to it. You can test it online. This is running on mobile devices like iPad or iPhone in Safari and on Android with a browser that has JavaScript support without any server-based component. See the documentation and test site here

Read More...

Scheme and Racket implementation of a parser

The GUI-based Charty implementation (agenda-based chart parser for CFGs) is finally available on the SNLTK pages.
Read More...

Updated Python code and tools

The Charty parser code is updated to Python 3.x (implementing an Earley parser for context-free grammars), and a compact module, TextStat.py, with some useful functions for N-gram models, frequency profiles, vector space models, statistical analyses, information theoretic measures (entropy, mutual information, etc.). If you have comments, or you find some bug or error, let me know.
Read More...

Setting up Aquamacs for XLE and XFST

Here is a small introduction about my working environment setting for grammar and morphology development using Aquamacs, XLE, XFST and just scripting with Python and Bash in the OS X Terminal.app...
Read More...

Drawing syntactic trees...

I have been asked by many students and colleagues, how to generate nice looking trees for presentations, assignments, papers etc. Here is a small summary of tools I have tried or seen.

If you want to generate a graph of a syntactic relation, a syntactic tree, there are various ways to do that, without manually drawing it on paper and scanning the manual work... here is a small summary of ways and tools for generating syntactic trees...
Read More...