Publications

2020

Mäkelä, Eetu, Krista Lagus, Leo Lahti, Tanja Säily, Mikko Tolonen, Mika Hämäläinen, Samuli Kaislaniemi & Terttu Nevalainen. Forthcoming. Wrangling with non-standard data. Proceedings of DHN 2020 (CEUR Workshop Proceedings). Aachen: CEUR-WS.org. Authors’ version.

2019

Hämäläinen, Mika, Tanja Säily, Jack Rueter, Jörg Tiedemann & Eetu Mäkelä. 2019. Revisiting NMT for normalization of early English letters. Beatrice Alex, Stefania Degaetano-Ortlieb, Anna Kazantseva, Nils Reiter & Stan Szpakowicz (eds.), Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (ACL Anthology W19-25), 71–75. Stroudsburg, PA: Association for Computational Linguistics. https://www.aclweb.org/anthology/W19-2509

2018

Hämäläinen, Mika, Tanja Säily, Jack Rueter, Jörg Tiedemann & Eetu Mäkelä. 2018. Normalizing early English letters to Present-day English spelling. Beatrice Alex, Stefania Degaetano-Ortlieb, Anna Feldman, Anna Kazantseva, Nils Reiter & Stan Szpakowicz (eds.), Proceedings of the 2nd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (ACL Anthology W18-45), 87–96. Stroudsburg, PA: Association for Computational Linguistics. http://aclweb.org/anthology/W18-4510

Nevalainen, Terttu. 2018. From speaker innovation to lexical change: A sociohistorical approach to neologisms. Pragmatics & Cognition 25(1): 8–29. Special issue, The Dynamics of Lexical Innovation: Data, Methods, Models, ed. by Daphné Kerremans, Jelena Prokić, Quirin Würschinger & Hans-Jörg Schmid. doi:10.1075/pc.18008.nev

Säily, Tanja, Eetu Mäkelä & Mika Hämäläinen. 2018. Explorations into the social contexts of neologism use in early English correspondence. Pragmatics & Cognition 25(1): 30–49. Special issue, The Dynamics of Lexical Innovation: Data, Methods, Models, ed. by Daphné Kerremans, Jelena Prokić, Quirin Würschinger & Hans-Jörg Schmid. doi:10.1075/pc.18001.sai; authors’ version.

Sairio, Anni, Samuli Kaislaniemi, Anna Merikallio & Terttu Nevalainen. 2018. Charting orthographical reliability in a corpus of English historical letters. ICAME Journal 42: 29–46. doi:10.1515/icame-2018-0005

2017

Säily, Tanja, Turo Vartiainen & Harri Siirtola. 2017. Exploring part-of-speech frequencies in a sociohistorical corpus of English. Tanja Säily, Arja Nurmi, Minna Palander-Collin & Anita Auer (eds.), Exploring Future Paths for Historical Sociolinguistics (Advances in Historical Sociolinguistics 7), 23–52. Amsterdam: John Benjamins. doi:10.1075/ahs.7.02sai; authors’ version.

Siirtola, Harri, Tanja Säily & Terttu Nevalainen. 2017. Interactive Principal Component Analysis. Ebad Banissi (ed.), Proceedings of the 21st International Conference on Information Visualisation (IV 2017), 416–421. Los Alamitos, CA: IEEE Computer Society. doi:10.1109/iV.2017.39; authors’ version.

2016

Siirtola, Harri, Poika Isokoski, Tanja Säily & Terttu Nevalainen. 2016. Interactive text visualization with Text Variation Explorer. Ebad Banissi (ed.), Proceedings of the 20th International Conference on Information Visualisation (IV 2016), 330–335. Los Alamitos, CA: IEEE Computer Society. doi:10.1109/IV.2016.57; authors’ version.

Presentations

2019

Säily, Tanja & Eetu Mäkelä. 2019. The OED and historical text collections: Discovering new words. Invited talk, OED Webinar Series, Oxford University Press, Oxford, UK, July 2019. Webinar.

Säily, Tanja, Eetu Mäkelä & Mika Hämäläinen. 2019. Neologisms in early English letters: How to find them and what they can reveal. 6th International Symposium on History of English Lexicography and Lexicology (HEL-LEX 6), Gargnano, Italy, June 2019. Presentation slides.

Säily, Tanja. 2019. Variation and change in individual writers of early English letters. Invited talk, lecture series Flexible Speakers – Flexible Writers, University of Erlangen-Nuremberg, Erlangen, Germany, June 2019.

Hämäläinen, Mika, Tanja Säily & Eetu Mäkelä. 2019. Automatic normalization of historical English for neologism detection. 40th Annual Conference of the International Computer Archive of Modern and Medieval English (ICAME 40), Neuchâtel, Switzerland, June 2019. Abstract.

Nevalainen, Terttu. 2019. Why “cannot” instead of “can not”? Searching corpora for spelling variation. 40th Annual Conference of the International Computer Archive of Modern and Medieval English (ICAME 40), Neuchâtel, Switzerland, June 2019.

Säily, Tanja, Eetu Mäkelä & Terttu Nevalainen. 2019. Between the OED and historical text collections: The case of discovering new words. Invited talk, OED Lecture Series, Oxford University Press, Oxford, UK, February 2019. Presentation slides.

Säily, Tanja. 2019. Sociolinguistic variation in the history of English: Productivity, neologisms and gendered styles. Invited talk, Research Colloquium, Department of Language Science and Technology, University of Saarland, Saarbrücken, Germany, February 2019. Abstract.

2018

Säily, Tanja, Eetu Mäkelä & Mika Hämäläinen. 2018. STRATAS: Neologism detection in historical corpora. Joint Seminar of the Academy of Finland DIGIHUM Programme and the Japan Society for the Promotion of Science (JSPS), Helsinki, Finland, November 2018.

Säily, Tanja, Mika Hämäläinen & Eetu Mäkelä. 2018. Neologism detection in historical corpora. Digital Humanities Research Seminar, Helsinki, Finland, September 2018. Presentation slides.

Kaislaniemi, Samuli & Anni Sairio. 2018. Charting spelling variation and editorial reliability in English historical letters. 20th International Conference on English Historical Linguistics (ICEHL 20), Edinburgh, UK, August 2018. Presentation slides.

Säily, Tanja, Eetu Mäkelä & Mika Hämäläinen. 2018. Developing an environment for neologism detection in historical corpora. 20th International Conference on English Historical Linguistics (ICEHL 20), Edinburgh, UK, August 2018. Presentation slides.

Siirtola, Harri, Tanja Säily & Terttu Nevalainen. 2018. Fingerprinting historical texts in TVE2. 20th International Conference on English Historical Linguistics (ICEHL 20), Edinburgh, UK, August 2018.

Siirtola, Harri & Tanja Säily. 2018. Visualization of text corpora – seven years on. 39th Annual Conference of the International Computer Archive of Modern and Medieval English (ICAME 39), Tampere, Finland, May 2018.

Hämäläinen, Mika, Tanja Säily & Eetu Mäkelä. 2018. Normalizing early English letters for neologism retrieval. Poster, Digital Humanities in the Nordic Countries (DHN 2018), Helsinki, Finland, March 2018. Poster.

Litola, Katja & Johanna Marttila. 2018. Early Modern Finnish from a letter corpus perspective. Finnish Network of Nineteenth Century Studies, Helsinki, Finland, January 2018.

2017

Sairio, Anni, Samuli Kaislaniemi, Terttu Nevalainen, Tanja Säily & Anna Merikallio. 2017. ERRATAS: Charting the reliability of modern editions of English historical texts. Poster, HELDIG Digital Humanities Summit, Helsinki, Finland, October 2017. Flyer.

Säily, Tanja, Terttu Nevalainen, Anni Sairio, Samuli Kaislaniemi, Anna Merikallio, Taru Nordlund, Katja Litola, Johanna Marttila, Eetu Mäkelä, Poika Isokoski & Harri Siirtola. 2017. STRATAS: Combining texts and contextual information in historical sociolinguistics. Poster, HELDIG Digital Humanities Summit, Helsinki, Finland, October 2017. Abstract; presentation slides.

Säily, Tanja, Eetu Mäkelä & Jukka Suomela. 2017. Social embedding of neologisms in early English correspondence. The Dynamics of Lexical Innovation: Data, Methods, Models (DynLex Workshop), Munich, Germany, June 2017. Presentation slides.

Nordlund, Taru & Katja Litola. 2017. Variaationtutkimus ja kirjoitettu kieli. XLIV Kielitieteen päivät, Jyväskylä, Finland, May 2017.

Siirtola, Harri, Tanja Säily & Terttu Nevalainen. 2017. Text Variation Explorer 2: A new tool for exploring corpora. Software demonstration, 38th Annual Conference of the International Computer Archive of Modern and Medieval English (ICAME 38), Prague, Czech Republic, May 2017.

Nordlund, Taru, Anni Sairio, Tanja Säily, Eetu Mäkelä & Harri Siirtola. 2017. Interfacing structured and unstructured data in sociolinguistic research on language change. Annual Seminar of the Academy of Finland DIGIHUM Programme, Helsinki, Finland, May 2017.

Sairio, Anni, Samuli Kaislaniemi & Terttu Nevalainen. 2017. Towards a social history of epistolary spelling: Charting orthographical reliability in editions of English historical letters. Historical Sociolinguistics Network (HiSoN 2017), New York, USA, April 2017. Abstract.

Nordlund, Taru, Katja Litola & Johanna Marttila. 2017. Compiling a corpus from scratch: 19th century Finnish as a sociolinguistic laboratory. Historical Sociolinguistics Network (HiSoN 2017), New York, USA, April 2017.

Säily, Tanja, Jukka Suomela & Eetu Mäkelä. 2017. From old-English grubbers to cheeky blighters? Variation in the productivity of –er in the history of English. The 5th International Symposium on History of English Lexicography and Lexicology (HEL-LEX 5), Zurich, Switzerland, February 2017. Presentation slides.

2016

Nevalainen, Terttu, Tanja Säily & the STRATAS team. 2016. STRATAS: Interfacing structured and unstructured data in sociolinguistic research on language change. Building and Using Language Technology (BAULT 2016), Helsinki, Finland, December 2016. Poster.

Säily, Tanja. 2016. Kuinka lukea tuhansia kirjeitä yhdellä silmäyksellä: Vanhat kirjeet uudessa käyttöliittymässä [How to read thousands of letters at a glance: Old letters in a new interface]. Digital Humanities Theme Event, Think Corner, University of Helsinki, Finland, November 2016. YouTube video.

Kaislaniemi, Samuli, with Anni Sairio, Anna Merikallio, Terttu Nevalainen & Tanja Säily. 2016. ERRATAS: Charting editorial interference and orthographical reliability in editions of English historical letters. Working from Manuscript Sources: Colloquium of the VARIANTTI-network (Finnish network on textual criticism and scholarly editing), Turku, Finland, November 2016. Abstract.

Säily, Tanja, Jukka Suomela & Eetu Mäkelä. 2016. Variation in morphological productivity in the history of English: The case of –er. International Society for the Linguistics of English (ISLE 4), Poznań, Poland, September 2016. Presentation slides.

Sairio, Anni, Samuli Kaislaniemi, Tanja Säily & Terttu Nevalainen. 2016. ‘My languishinge spiritts’ or ‘my languishing spirits’? Charting editorial interference and orthographical reliability in modern editions of English historical letters. International Society for the Linguistics of English (ISLE 4), Poznań, Poland, September 2016.

Siirtola, Harri, Terttu Nevalainen & Tanja Säily. 2016. Comparing like with like? Tools for exploring families of corpora. Digital Humanities Congress (DHC 2016), Sheffield, UK, September 2016. Abstract.

Mäkelä, Eetu, Tanja Säily & Terttu Nevalainen. 2016. Developing an interface for historical sociolinguistics. Digital Humanities Congress (DHC 2016), Sheffield, UK, September 2016. Abstract.

Mäkelä, Eetu, Tanja Säily & Terttu Nevalainen. 2016. Khepri – a modular view-based tool for exploring (historical sociolinguistic) data. Digital Humanities (DH 2016), Kraków, Poland, July 2016. Abstract.

Säily, Tanja. 2016. Kielen vaihtelu ja muutos: Korpusmenetelmien ja visualisoinnin uusia tuulia [Language variation and change: New trends in corpus methods and visualization]. Guest lecture, University of Tampere, Finland, April 2016.

Litola, Katja, Taru Nordlund, Johanna Utriainen & the STRATAS team. 2016. STRATAS: Interfacing structured and unstructured data in sociolinguistic research on language change. Poster, Historical Sociolinguistics Network (HiSoN 2016), Helsinki, Finland, March 2016.

2015

Nevalainen, Terttu & Tanja Säily. 2015. Digital diversity in language studies: Is multidisciplinary collaboration the key to analysing rich data? Conference of Langnet, the Finnish Graduate School in Language Studies, Tampere, Finland, November 2015.

Säily, Tanja. 2015. Neologisms in early English letters: The case of –ity. HSHL Seminar on Neologisms, Helsinki, Finland, November 2015. Presentation slides.

Mäkelä, Eetu, Terttu Nevalainen & Tanja Säily. 2015. Developing an interface for historical sociolinguistics. Software demonstration, From Data to Evidence: Big Data, Rich Data, Uncharted Data (d2e), Helsinki, Finland, October 2015.