As of March 2016, we have started to release the revised versions of monograph items in Fenno-Ugrica collection. Based on vocabulary that gathered in the proof-reading campaign during summer 2015, we have enhanced our OCRing (optical text recognition) procedures and managed to improve the quality of monographs.
Liiketila art collective and Minority Languages Project of the National Library Finland have launched a crowdsourcing, or perhaps even a nichesourcing task for the public, aiming to collect spoken Uralic languages to be utilized in an art performance called MAA (land, soil, nation, ground, earth, country).
The National Library of Finland (NLF) has been awarded by the Kone Foundation to realize the Minority Languages Project in 2016. The project will produce digitized materials mostly in the Uralic languages, but in Yiddish and Romani too. NLF’s objective is to make sure that new corpora in these languages will be made available for the open and interactive use of both the academic community and the language societies.
Fenno-Ugrica, an online collection for materials in Uralic languages at the National Library of Finland, has reached new milestones. Since its launch in June 2013, there has been more than 250 000 downloads from this collection up to date. On Sunday, we passed by another landmark: 150 000 downloads in 2015 alone.
Since the very beginning of the Digitization Project of Kindred Languages, our objective has been the securing the wider-than-usual availability of new corpora, which do base to the digitized material. Our primary intention has to spread the Fenno-Ugrica data to such tools and environments, which are made available for both, the academic and the language communities.
Monographs written in the Uralic languages and digitised by the National Library will be available through the National Library’s Finna search portal as of this Wednesday.
The Helsinki Book Fair is about to begin in a week. The theme country of this year’s event is Russia, thus we wanted to highlight our services related to Russian studies and Uralic languages. Naturally, the Digitization Project of Kindred Languages will be present in this event at the National Library’s stand (6e71). Please, find here a brief summary of our programme at the Book Fair.
In 2015, our mission has been the improvement of the usage and usability of digitized content. During the project, we have advanced methods that will refine the raw data for further use, especially in the linguistic research. Finally, we have reached the stage, in which we may start producing the word lists out of digitized and edited content and we have now released some word lists (arranged by frequency and “all hits”) in 12 languages in Fenno-Ugrica.