University of Helsinki Language Technology group at NoDaLiDa 2019

The University of Helsinki Language Technology group has a number of papers at The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa’19). Here’s where you can find us. See you in Turku!

Monday September 30:

NLP4CALL workshop:

  • 13:15. Mathias Creutz, Eetu Sjöblom: “Toward automatic improvement of language produced by non-native language learners”

Constraint Grammar – Methods, Tools, and Applications workshop:

  • 14:15. Anssi Yli-Jyrä: “Constraint Grammar Is a Hand-Crafted Transformer”

Tuesday 01 October:

Oral presentations:
Parallel session B: Morphology and Syntax.

  • 13:45. Ilmari Kylliäinen and Miikka Silfverberg: “Ensembles of Neural Morphological Inflection Models”

Posters:

  • 16:45. Jeff Ens, Mika Hämäläinen, Jack Rueter and Philippe Pasquier: “Morphosyntactic Disambiguation in an Endangered Language Setting”

Demos:

  • 16:45. Mikko Aulamo and Jörg Tiedemann: “The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services”

Wednesday 02 October

Oral presentations:
Parallel session B: Speech.

  • 14:50. Aarne Talman, Antti Suni, Hande Celikkanat, Sofoklis Kakouros, Jörg Tiedemann and Martti Vainio: “Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations”

Meet the LT Industry 2019

  • Place: Metsätalo (Unioninkatu 40), Sali I
  • Date: Friday October 11, 2019
  • Time: 14:15 – 15:45

The purpose of this event is to arrange a meeting between students and representatives of the industry that work with language technology in one way or another. The event is open to anyone who is interested in getting information about career opportunities. We will have short presentations of relevant companies and their business and leave time for questions and discussions. There will also be the opportunity to informally speak to the industry representatives face to face.

We have invited various language service providers and LT businesses and the preliminary list of confirmed participants is listed below.

  • AlphaSense
  • Insider Solutions
  • Kielikone
  • Lingsoft
  • Sanoma
  • Silo.AI
  • Semantix
  • Utopia Analytics

 

This list is subject to change and more information about the program will be posted later

Meet the LT Industry 2018

  • Place: Porthania, Suomen Laki -hall (PIV)
  • Date: October 12, 2018
  • Time: 14:15 – 15:45

The purpose of this event is to arrange a meeting between students and representatives of the industry that works with language technology in one way or another. The event is open to anyone who is interested in getting information about career opportunities. We will have short presentations of relevant companies and their business and leave time for questions and discussions. There will also be the opportunity to informally speak to the industry representatives face to face.

We have invited various language service providers and LT businesses and the preliminary list of confirmed participants is listed below.

 

This list is subject to change and more information about the program will be posted later

Grant from KITES

KITES offers a grant for work in a research project on the collection of Finnish-Swedish parallel corpora and the development of machine translation models for the same language pair. See the announcement and all details here.

FoTran2018

We organise an event on representation learning from multilingual language data (FoTran2018). We have great invited speakers:

  • Kyunghyun Cho, NYU, New York
  • Manaal Faruqui, Google
  • André Martins, Unbabel, Lisbon
  • Ivan Vulić, University of Cambridge
  • Željko Agić, IT University of Copenhagen

Sign up if you want to participate or even present your work! Participation is free but registration is required. More info here: https://blogs.helsinki.fi/language-technology/fotran-2018/

Found in Translation project presentation at Charles University’s Fred Jelinek Seminar Series

Professor Jörg Tiedemann gave a talk at the Charles University in Czech Republic on the 18th June as part of their Fred Jelinek Seminar Series.

Abstract:

Found in Translation – Learning to understand languages with cross-lingual grounding

Translated texts are semantic mirrors of the original text and the significant variations that we can observe across languages can be used to disambiguate the meaning of a given expression using the linguistic signal that is grounded in translation. We are interested in massively parallel corpora consisting of hundreds up to a thousand different languages and how they can be applied as implicit supervision to learn abstractions that could lead to significant improvements in natural language understanding. As a side-effect, we can also see how multilingual models can pick up essential relationships between languages building a continuous space with reasonable language clusters. I will talk about some initial results and plans for the future and I would like to get your feedback about those ideas.