Minority Languages Project – An Introduction

The National Library of Finland (NLF) has been awarded by the Kone Foundation to realize the Minority Languages Project in 2016. The project will produce digitized materials mostly in the Uralic languages, but in Yiddish and Romani too. NLF’s objective is to make sure that new corpora in these languages will be made available for the open and interactive use of both the academic community and the language societies.

Continue reading

Releasing the Komi newspapers at Fenno-Ugrica

Last year, we released a plenty of monographs in Komi languages in our online collection, Fenno-Ugrica. In addition to the monographs, we also are publishing newspapers in both, Komi-Permyak and Komi-Zyrian. All in all, there will be 23 titles and around 40 000 pages of Komi newspapers in our collection by the end of June 2015.

Continue reading

Post-prodcution of our digital content

Anis Moubarik, an information system specialist at the National Library and a member of DPKL team, will introduce you to that procedure what happens to a digitized book in our post-production processes. During the project, Anis has been in charge of creating both, OCR’ed PDFs that are available in our Fenno-Ugrica collection and Alto XML files per book, which are made available for editing in Revizor, the text editor for enhancing the data.

Continue reading

Money Well Spent

Fenno-Ugrica collection was released in June 2013 to support the research of Uralic studies in humanities. At that time, there were around 700-800 downloads on the monthly basis, which wasn’t that bad, I reckon. Actually, I was rather happy to notice that there are people who are using our stuff, even though some might consider the amount of monthly downloads as low.

Continue reading

The 10th of December – The Mari Language Day

The first Mari language grammar book (Sochineniya) was published in Saint-Petersburg in 1775. There is no noted author straight in the book but some researchers suppose that the metropolitan Veniamin Putsek-Grigorovich who was a missionary in the region of Kazan and studied local minority nations at least partly took part in the creation of this book.

The grammar book is the monument of the written Mari and Mari language literature. At the times of 1770, the Mari people were called with Russian language name Cheremis. Mari language has two variants Hill and Meadow Mari each of them could be divided into two other dialects Eastern and North-Western.

Continue reading

Stage One Completed, Second Round to Begin

The continuation phase of the Digitization Project of Kindred Languages (2014-2015) took off in January 2014. Since then, we have conducted the copyright clearance for all material that will be digitized and published by the end of this year. Also, naturally, we have signed the needed agreements with the National Library of Russia on digitization of the material. Not to mention, there’s a great deal of work done behind the scenes: the development of our OCR editor has taken a step forward, and a plenty of time has been spend on post-production of material here in Helsinki. By the mid-July, we have published exactly new 400 monographs in Khanty, Mansi, Hill and Meadow Mari, Nenets, Selkup, Komi-Permyak, Komi-Zyrian and Udmurt in our Fenno-Ugrica collection. We more than are glad that we have passed the first round of release now.

Continue reading