On Recursion. Cognitive Science Colloquium, Indiana University, Nov. 16th 2016, Bloomington, Indiana.
On Split Islands. 11th Slavic Linguistic Society Annual Meeting. Sept. 24th 2016. Toronto, Canada.
Generating speech tools and corpora for unwritten endangered languages: Chatino. Paper presented at the Tools & Methods Summit, University of Melbourne, Australia, 1st-3rd of June, 2016. See also program.
Bridging between Documentary Linguistics, Computational Linguistics, Theoretical Linguistics, and NLP. Paper presented at the Tools & Methods Summit, University of Melbourne, Australia, 1st-3rd of June, 2016. See also program.
On the Role of Linked Open Language Data for Language Documentation, Linguistic Research, and Speech and Language Technologies. Keynote presentation at the LDL-2016, 5th Workshop on Linked Data in Linguistics: Managing, Building and Using Linked Language Resources, LREC 2016, Portorož, Slovenia. May 2016.
Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR. LREC 2016, Portorož, Slovenia. Together with Małgorzata E. Ćavar and Hilaria Cruz. May 2016. See also: GORILLA
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project. LREC 2016, Portorož, Slovenia. Together with Małgorzata E. Ćavar, Dov-Ber Kerler, Anya Quilitsch. May 2016. See also: AHEYM, GORILLA
The Free Linguistic Environment. Natural Language Processing Seminar, Linguistic Engineering Group at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS), Warsaw, Poland. May 23rd 2016. See also: FLE.
Bootstrapping resources for under-resourced languages: Speech corpora and technologies for the languages of the Balkans. 20th Biennial Conference on Balkan and South Slavic Linguistics. University of Utah, April 2016. See also: GORILLA
The General Ontology for Linguistic Description (GOLD)and its Role for Digital Language Resources and Language Technology. Workshop “Development of Linguistic Linked Open Data (LLOD) Resources for Collaborative Data-Intensive Research in the Language Sciences“, LSA Summer Institute 2015, Chicago, IL. Together with Malgorzata E. Cavar (Indiana). Invited presentation. July 2015. See also: GOLD.
Documenting Endangered Languages by Developing Speech-Corpora and ASRs: Crow, Hidatsa, Mandan. Together with Malgorzata E. Cavar (Indiana), Wilhelm Meya (TLC), John Boyle (CSU Fresno). Midwest Speech and Language Days (MSLD) 2015, Toyota Technological Institute at Chicago.
Enabling Resources in Digital Language Archives using Automatic Speech Recognition. Together with Malgorzata E. Cavar, Lwin Moe, Aaron Albin. Midwest Speech and Language Days (MSLD) 2015, Toyota Technological Institute at Chicago.
Automatically Annotated Repository of Digital Video and Audio Resources Community (AARDVARC). Presentation at the ClingDing meeting, Department of Linguistics, Indiana University, together with Malgosia E. Cavar, July 2014. See also: AARDVARC
Morphological grammars and computational analyzer/generators for the documentation of indigenous/endangered languages of the world. Poster presented at the Morphology Fest at Indiana University, Department of Linguistics, together with U. Kazagashewa, M. Mueller, Malgosia E. Cavar, A. Lamont, S. Fox June 20th, 2014.
Technologies for Bootstrapping of Linguistically Annotated Text Collections with TEI XML Markup. Paper presented at the University of North Texas, Computer Science and Linguistics, together with Malgosia E. Cavar, February 2014.
The LINGUIST List Now and Then. Paper presented at the University of North Texas, Computer Science and Linguistics, together with Malgosia E. Cavar, February 2014.
Online visualization of research in historical linguistics. Poster presented at the LSA Annual Meeting 2014 in Minneapolis, MN. Together with Malgosia E. Cavar, S. Couture, U. Kazagasheva, E. Benzschawel, January, 2014.
Urban fieldwork, evolution and languages in cities. Paper presented at the Association for the Study of the Arts of the Present ASAP/5: Arts of the City Conference, together with Malgosia E. Cavar. Wayne State University, October 3rd-6th, 2013.
Bootstrapping large text corpora with TEI XML markup and linguistic annotation. Together with Malgosia E. Cavar. Chicago Colloquium on Digital Humanities and Computer Science. The University of Chicago. 19th of November 2012.
Automatic Linguistic Annotation with TEI-Output. TEI 2012 Conference at Texas A&M, College Station. 9th of November 2012.
The LINGUIST List Corpus: A Large Mailing List Corpus – Management, Annotation and Repository. Together with Malgorzata E. Cavar, Helen Dry Aristar, and Anthony Aristar. TEI 2012 Conference at Texas A&M, College Station. 9th of November 2012.
Dynamic Professional Content Corpora and New Technologies. Wayne State University, 19th of October 2012.
Large Mailing List Corpora: Management, Annotation and Repository. Together with Helen Aristar-Dry and Anthony Aristar. LREC 2012 Workshop on Challenges in the management of large corpora, Istanbul, 22nd of May, 2012.
Sprachtechnologie und Sprachdokumentation: Eine Darstellung von Projekten des Heimatinstituts der LINGUIST List. Institut für Deutsche Sprache, Mannheim. 8th of May, 2012.
Bootstrapping NLP and MT Resources for under-resourced languages. Crosslingual Language Technology in service of an integrated multilingual Europe – 20 years on-. University of Hamburg, Germany, 4-5 May 2012.
Cyclicity and Opacity Effects in the Prosody of Two Different Clitic Classes in New Shtokavian Variants. (Abstract, Slides) Presented at: Ilse Lehiste Memorial Symposium (Program). Together with Malgorzata E. Cavar. 11th of November 2011.
Clitic Placement, Syntactic Discontinuity, and information structure. With Melanie Seiß. (Handout, Slides) LFG 2011, Hong Kong. 17th of July 2011.
The Scheme Natural Language Toolkit (S-NLTK): NLP Library for R6RS and Racket. 4th European Lisp Symposium, Special Focus on Parallelism & Efficiency, TUHH, Hamburg University of Technology, Hamburg, Germany, 1st of April 2011.
No Escape from Clitics in Neo-Shtokavian: Contributions to the syntax and prosody of enclitic auxiliaries and pronouns, Research Colloquium, Linguistics Dept., University of Konstanz, 13th of Jan. 2011
Some notes on TEI in Linguistics. Plenary session: TEI 2010 (Annual meeting of the Text Encoding Initiative), University of Zadar, 13th of Nov. 2010
Riznica: The Croatian Language Corpus. Presented by, and together with Dunja Brozović Rončević, SLAVICORP Workshop, University of Warsaw, Poland, 22nd–24th of November 2010
O indukciji gramatike: računalni i statistički modeli usvajanja jezika (“On grammar induction: Computational and statistical models of language acquisition”). Research colloquium “Lingvistička srida,” University of Zadar, 25th of Feb. 2010
Morfološka analiza i lematizacija hrvatskog jezika na konačnim automatima (“Morphological analysis and lemmatization of Croatian using Finite State Automata”), Research Colloquium, Zadarska lingvistička Srida, University of Zadar, 19th of Nov. 2009
Frequency correlations in processing, familiarity, and language usage data of clitics in Croatian. Together with Tomislav Frleta, SLS 2009, 4th Annual Meeting of the Slavic Linguistic Society, 3rd-6th of September 2009
Empirical evidence for the functional determiner projection in Croatian. Together with Matea Birtić, SLS 2009, 4th Annual Meeting of the Slavic Linguistic Society, 3rd-6th of September 2009
On bootstrapping of linguistic features for bootstrapping grammars. EACL 2009 workshop Computational Linguistic Aspects of Grammatical Inference, Athens (Greece) 30th of March 2009
On the induction of linguistic categories and learning grammars. 10. Workshop in Szklarska Poreba (Poland), 12th of March 2009
Red riječi u hrvatskim govorima: empirijski pristup padežima. Zadarska lingvistička srida, Zadar, 10th of March 2009
Formalni modeli i Računalna obrada jezika. PhD program in linguistics at the Philosophical Faculty of the University in Osijek. 7th of March 2009
Nova generacija računalne obrade jezika. Developers User Group Split, 34. Skup IT profesionalaca, Split, Croatia, 22nd of January 2009
Some Quantitive and Qualitative Aspects of Nominal Case in Croatian. Second Croatian Syntax Days, Osijek, Croatia, 13.-15. November 2008
Interoperability and Rapid Bootstrapping of Morphological Parsing and Annotation Automata. IS-LTC 08, Ljubljana, Slovenia. 16.-17. October 2008
CroMo – Morphological Analysis for Standard Croatian and its Synchronic and Diachronic Dialects and Variants. FSMNLP 2008, Ispra, Lago Maggiore, Italy, 11.-12. of September 2008. (slides in PDF, poster in PDF)
Struktura i razvoj baze podataka za potrebe projekta „Hrvatsko strukovno nazivlje – projekt koordinacije“, Poletna Terminološka Šola, ZRC SAZU, Ljubljana, 4.-9. of Sept. 2008.
On Grammar induction, Language Models, and Language Evolution Simulations. PsychoCompLA 2008, at the CogSci 2008, Washington DC, 23.-24. of July 2008.
Machine learning systems as models for natural language grammar acquisition? University of Nova Gorica, Slovenia. 20th of February, 2008.
The Croatian Language Repository: Quantitative and Qualitative Resources for Linguistic Research and Language Technologies. Eastern Michigan University, Institute for Language Information and Technology (ILIT). 31st of January, 2008.
The Croatian Language Repository: Quantitative and Qualitative Resources for Linguistic Research and Language Technologies. Indiana University, Linguistics Dept. 29th of January, 2008.
Modelle dynamischer Eigenschaften von Sprachen und ihre Anwendungen, University of Bielefeld, Germany, 04/27/2007.
Dynamische und lernende Sprachmodelle. University of Bochum, Germany, 04/26/2007.
Dynamic Language Models. JOTA, Ljubljana, Slovenia. 03/29/2007
Inducing Lexical Properties with Probabilistic Methods. Polish Academy of Sciences, Warsaw, Poland. 08/11/2006
Das Korpus der kroatischen Sprache: Hrvatska jezična mrežna riznica. (The Croatian Language Corpus/Hrvatski jezični korpus). University of Graz, Austria. 06/19/2006
About Clitics in Croatian: Myths, and Fairy-tales. Together with Dunja Brozović-Rončević, at the conference Hrvatski sintaktički dani (Croatian Syntax Days) at the University of Osijek, Croatia. 05/11/2006
Parsing Croatian. Institute of Croatian Language and Linguistics, Zagreb, Croatia. 04/01/2006
Inducing Syntactic Properties of Lexical Elements via Information Theoretic Measures. Together with Paul Rodrigues and Giancarlo Schrementi. Workshop on Computational Modeling of Lexical Acquisition: The Split Meeting. Split, Croatia. 07/25-28/2005.
Using Morphological and Distributional Cues for Inductive Part-of-Speech Tagging. The Second Midwest Computational Linguistics Colloquium (MCLC-2005), Columbus, OH, The Ohio State University. 05/15/2005.
Unsupervised morphology induction for part-of-speech tagging. The 29th Penn Linguistics Colloquium. 02/26/2005
Constraint-based Cue-learning and cue-based language acquisition. Workshop on Approaches to Empirical Syntax/WOTS-8, Berlin, Germany. 08/28/2004
Alignment Based Induction of Morphology Grammar and its Role for Bootstrapping. The 9th conference on Formal Grammar: FGNancy, Nancy, France. 08/08/2004
Computationale Modellierung des Spracherwerbs: Wieviel kann man über natürliche Sprache von der Sprache selbst lernen? Center for Computing Technologies (TZI), University of Bremen, Germany. 07/08/2004
Syntactic Parsing Using Mutual Information and Relative Entropy. Midwest Computational Linguistics Colloquium (MCLC), Bloomington, Indiana. 06/25/2004
Unsupervised Grammar Induction. Cognitive Science Colloquium, Psychology Department, Indiana University, Bloomington, Indiana. 06/02/2004
On unsupervised grammar induction from untagged corpora. Poznań Linguistics Meeting 2004, Poznań, Poland. 05/19/2004
Computational Modeling of Language Acquisition. Linguistics Colloquium, University of Potsdam, Germany. 05/17/2004
Computational Modeling of Language Acquisition. Psycholinguistics Suppers, CUNY Graduate Center, New York. 05/04/2004
Cognitive Aspects of Language Acquisition and Processing. Sveučilište u Splitu (University of Split), Croatian Applied Linguistics Association. 03/17/2004
Lexicology and Corpus Linguistics. Sveučilište u Splitu, Anglistika (University of Split, English Department). 03/16/2004
Computational Aspects of Language Acquisition. The University of Arizona, Tucson. 03/04/2003
Cue-based bootstrapping: Computational Aspects of Language Acquisition. Indiana University, Bloomington. 01/29/2003
AMEWS – Automatic Metatagging Engine for SemanticWeb with WebService Architecture. University of California (UCLA), Los Angeles. 10/29/2002
A Real Live Web Service using Semantic Web Technologies: Automatic Generation of Meta-Information. Conf.: “On The Move Towards Meaningful Internet Systems” (DOA, ODBASE, CoopIS’02). Irvine, California. 10/28/2002
AMEWS – Automatic Metatagging Engine for SemanticWeb with WebService Architecture. Indiana University, Bloomington: Linguistics Colloqium. 09/13/2002
Automatic Generation of Metatags for Intra-Semantic-Web. XSW 2002 Workshop (Semantic Web), Humboldt Universität Berlin. 06/26/2002
Zur Rolle von “semantischen Netzen” für das Wissensmanagement (On the role of semantic nets for knowledge management). AIK-Symposium on Semantic Web, University of Karlsruhe. 04/19/2002
Natürlichsprachliche Systeme für das Wissensmanagement (Natural language systems for knowledge management). Kolloqium, University of Potsdam. 04/16/2002
Distributed Deletion. Kolloqium (Prof.Dr. Günther Grewendorf ) Universität Frankfurt a.M. Jan. 2002
Optimizing knowledge mining in the e-back-office. Euromap e2001 conference in Venice. Session Opportunities for language technology in multilingual e-markets. Chairperson: Hanne Fersøe, Center for Sprogteknologi, Denmark. 10/19/2001
Digital Dictionary of the 20th Century German Language. Institute of Computer Science, Polish Academy of Sciences. Joint work with Alexander Geyken (BBAW). Warsaw, Poland. 02/26/2001
Digital Dictionary of the 20th Century German Language. Jezikoslovne Tehnologije za Slovenski Jezik, JS 2000. Ljubljana, Slovenia. 10/17/2000
Klitikpositionen, diskontinuierliche Konstituenten und andere Wortunordnungen im Slavischen. (Clitic positions, discontinuous constituents and other word-diss-orders in Slavic) SlavGG Meeting, University of Leipzig. 07/15/2000
Split Constituents. Workshop “Conflicting Rules in Phonology and Syntax”, University of Potsdam. Joint work with: Gisbert Fanselow. 12/17/1999
Discontinuous Constituents in Slavic and Germanic. Drugi Hrvatski Slavistički Kongres (Second Croatian Slavic Congres) in Osijeku, Croatia. Joint work with: Gisbert Fanselow. 09/18/1999
End-to-End Evaluation ’99-1. Verbmobil Projektlenkungssitzung, Daimler-Chrysler Center in Stuttgart. 05/12/1999
Discontinuous Constituents. Poznań Linguistics Meeting. Poznań, Poland. 05/01/1999
End-to-End Evaluation in Verbmobil. Verbmobil Projektlenkungssitzung in Aachen. 12/07/1998
Klitike u Hrvatskom Jeziku (Clitics in Croatian). Institut za Hrvatski Jezik i Jezikoslovlje (Institute for Croatian language and linguistics), Zagreb, Croatia. 10/09/1998
Verbmobil: A Speech-to-Speech Translation System. Institut za Hrvatski Jezik i Jezikoslovlje (Institute for Croatian language and linguistics), Zagreb, Croatia. 10/08/1998
Verbmobil: A Speech-to-Speech Translation System. Jezikovne Tehnologije za Slovenski Jezik (JS) ’98 (Language technologies for the Slovenian language), Ljubljana, Slovenia. 10/06/1998
Autonomy and Look-ahead: Interface phenomena in Croatian. University of Tübingen. 05/25/1998
End-to-End Evaluation of Verbmobil. Verbmobil Projektlenkungssitzung, Bonn (D). 05/11/1998
Split Constituents: A Comparison between Germanic and Slavic. PLM: 31st Poznań Linguistic Meeting (PL). 05/1.-2./1998
Verb Movement and Coordination. FDSL 2, University of Potsdam, joint work with: Chris Wilder. 11/20.-22./1997
Interface Phenomena in Slavic: Polish and Croatian Cliticization. FDSL 2, University of Potsdam. 11/20.-22./1997
Children’s Sensitivity to Word-Order Variations in German: Evidence for Very Early Parameter Setting. Boston University Conference on Language Development (USA), joint work with: Jürgen Weissenborn, Barbara Höhle, Dorothea Kiefer. 11/07.-09./1996
Split Constituents in Germanic and Slavic. International Conference on Pied-Piping, Universität Jena (D), joint work with: Gisbert Fanselow. 05/30/1997
On the Syntactic and Phonological Properties of Enclitic Categories in Slavic. PLM: 30th Poznań Linguistic Meeting (Poland), AG: Reductionist Approaches to Syntax and Descriptive Adequacy (Jacek Witkoś). 05/03/1997
On the Syntax and Phonology of Clitics in Slavic. 19. DGfS Jahrestagung in Düsseldorf (D), AG: Die Interaktion grammatischer Teilbereiche (Katharina Hartmann & Daniel Büring). 02/24/1997
On Clitics and Split Consituents. Zentrum für allgemeine Sprachwissenschaft (ZAS), Berlin (D), Sprachwissenschaftliches Kolloqium. 02/18/1997
Interface Phenomena in Croatian and Polish. University of Leipzig (D). 11/27/1996
Frühe Syntaktische Wissensstrukturen: Kontinuität und Ökonomie (Early syntactic knowledge structures: continuity and economy). Annual meeting of the RULE group, Groß Dölln (D), (PANNONIA Hotel Döllnsee), Berlin-Brandenburg Academy of Science. 10/11/1996
Frühe Syntaktische Wissensstrukturen: Kontinuität und Ökonomie (Early syntactic knowledge structures: continuity and economy). Workshop: TROPICS, Berlin-Brandenburgische Akademie der Wissenschaften, joint work with: Jürgen Weissenborn, Barbara Höhle, Dorothea Kiefer. 09/28/1996
Weissenborn, J., Höhle, B., Kiefer, D. & Cavar, D. On the Structure of Tacit Syntactic Knowledge: Continuity and Economy. Conference: Approaches to Bootstrapping in Early Language Development, Berlin-Brandenburgische Akademie der Wissenschaften, Berlin, 1996.
Facing the Interface: On the Properties of “Deficient Elements” in South-Slavic. Third Summer School in Generative Linguistics, Palacky Univerzitet Olomouc (Czech Republic). 08/22/1996
On Clitics in Croatian: More Syntax than Prosody! Geisteswissenschaftliche Zentren Berlin e.V., Zentrum für allgemeine Sprachwissenschaft, Typologie und Universalienforschung (ZAS), Workshop on the Syntax, Morphology and Phonology of Clitics. 05/26.-27/1996
Auxiliaries in Serbian/Croatian and English. FDSL 1, University of Leipzig (D). Dec. 1995
Minimalistische Bewegung (Minimalistic Movement). University of Regensburg (D), 12/15/1993
Triggers & Economy. Universität Wuppertal (D). 11/12/1993
Pronominal and Verbal Clitics in Croatian. University of Durham (UK), EUROTYP Clitics Meeting. 10/17/1993
Pronominal and Verbal Clitics in Croatian. SOAS, London (UK) (School of Oriental and African Studies). 10/14/1993
Verbale und pronominale Klitika (Verbal and pronominal clitics). Universität zu Köln (D), GGS. 07/10/1993
Ein Konzept für MultiMedia im Sprachunterricht am Beispiel des Japanischen (A concept for multimedia in 2nd language teaching on the basis of Japanese). München (D), MediaNet, Universität Frankfurt a.M. 07/09/1993
Wortstellungsvariation, Verbbewegung und Ökonomie-Prinzipien (Wordorder variation, verb movement and economy principles). Universität Stuttgart (D). 06/24/1993
Verbs and Clitics in Croatian. Université de Genève (CH), Clitic Workshop. 06/01/1993
Verbs and Clitics in Croatian. Rijksuniversitet Groningen (NL). 05/28/1993
X0-Bewegung und Ökonomie (X0-movement and economy). DGfS annual meeting, University of Jena (D), Workshop: Wortordnung (word order). 03/03/1993
X0-Bewegung und Ökonomie (X0-movement and economy). Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V., Arbeitsgruppe Strukturelle Grammatik an der Humboldt-Universität, Berlin. 11/17/1992
Long Head Movement? Verb-Movement and Cliticization in Croatian. University of Regensburg (D), GGS. 02/01/1992