Language technology entries in HELDA (tags: kieliteknologia, language technology, computational linguistics, språkteknologi); information for students looking for a bachelor/master/individual project:
Recent PhD Theses
- Emily Öhman, 2021: The Language of Emotions : Building and Applying Computational Methods for Emotion Detection for English and Beyond
- Aleksi Sahala, 2021: Contributions to Computational Assyriology
- Senka Drobac, 2020: OCR and post-correction of historical newspapers and journals
- Mika Hämäläinen, 2020: Generating Creative Language : Theories, Practice and Evaluation
- Tommi Jauhiainen, 2019: Language identification in texts
- Mikka, Silfverberg, 2016: Morphological Disambiguation using Probabilistic Sequence Models
- Maarit Koponen, 2016: Machine Translation Post-editing and Effort : Empirical Studies on the Post-editing Process
- Tommi Pirinen, 2014: Weighted Finite-State Methods for Spell-Checking and Correction
- Jussi Piitulainen, 2011: Explorations in the distributional and semantic similarity of words
Recent Master Theses
- Low-resource Neural Machine Translation from Finnish to Chinese, Zhixu, Gu (2023)
- RPG-GPT: Leveling up game dialogue with creative NLG, (2023)
- Annotating multimodal discourse relations by combining crowdsourcing and natural language processing, Hotti, Helmiina (2023)
- Linguistic Feature Analysis of Real and Fake News: Human-written vs. Grover-written, (2023)
- Detecting hostility in user-created content of mobile games using machine learning models : A case study of Supercell’s Brawl Stars, (2023)
- Automatic Normalization of Finnish Social Media Text, (2022)
- Clustering of Neural Document Embeddings for Machine Generation of Search Extension Terms in Finnish in the Public Procurement Domain, Rahman, Dean (2022)
- Supervised multi-class text classification for media research: augmenting BERT with topics and structural features, Bedretdin, Ümit (2022)
- Text Normalisation of Dialectal Finnish, Koho, Tiina (2022)
-
Phonological Bias, (2022)
- Analysing Finnish Multi-Word Expressions with Word Embeddings (2020)
- Semi-Automated Methods of Direct Anglicisms Identification in Finnish Corpora, (2020)
- Using POS n-grams to detect grammatical errors in Finnish text, (2019)
- Multilingual paraphrase grammars for machine translation evaluation (2018)
- Document classification based on library catalogue metadata, (2017)
- Nothing but the Truth! : Deception Prediction on Hotel Reviews using Language Technology, (2016)