CIFU XII, Day 3

The day was dedicated to our symposium, Language Technology through Citizen Science, which was consisted of nine fine presentations, which were either (1) presenting the open-source language technological achievements and tools directed at the documentation of minority Uralic languages through the application of Citizen Science methods and crowdsourcing possibilities or (2) present and develop innovations for advancing the utilization of Citizen Science and crowdsourcing in open-source language technology.

I am not going to analyze all individual presentations in this blog entry, but I merely try to give a brief overview of my thoughts on some selected discussions.

As the first speaker was unable to attend, we decided to have a brief discussion with the participating attendees in order to use the available time properly. Jeremy Bradley also held a brief intro on interface development for linguistic tools. In my opinion, this discussion reminded me a lot about the paper of Joris Van Zundert, kept at the DH2015 in Sydney a few weeks ago. The key question for Van Zundert, in my interpretation, was that since lot of scholarly decisions are made by the code, thus it is vital to understand what’s inside. In the DH field, it is probably hard to combine the textual criticism and critical code studies. According to Van Zundert, there’s a distinction between scholarly code and code for scholars and there’s a merging trend to combine two fields, but it takes two to tango, so people in DH (researchers) should get more acquainted with code and people who are developing the code, shouldn’t define themselves as mere servants for scholars, as they have been educated to do. I firmly do believe, that this is the approach to take into consideration when designing the interfaces, which should appease the needs of a hard-core linguist on one hand and the curiosity of a layman in another. Also, this issue should be discussed in my home organization, especially when we try to provide services for the research.

Due to absence of the first speaker, I had the pleasure to be the first speaker of the symposium. As usual, I briefly introduced our project and the Fenno-Ugrica collection for the audience, but I mainly tried to focus on the concept of nichesourcing. I have been rallying for this method for past 18 months or so, but unfortunately the results have been disappointing so far. Despite the noble aspiration of interplay with the lingual societies, we haven’t been successful enough to edit enough material that could be utilized in research. Perhaps, we need to look for other methods in order to accelerate the production of edited data. The slides of my presentation can be retrieved here.

Heidi (and Tommi) Jauhainen presented their research project The Finno-Ugric languages and the Internet, which aim is to build an automated system that searches the Internet for text written in small Uralic languages. Heidi Jauhiainen also showcased their Wanca collection, which is available as Beta at the moment, but the native speakers and scholars are invited to create a personal account and help them to verify the language labels given to the pages by our automatic language identifier. The site is mesmerizing! It is great to browse the collection and find some text extracts in so many Finno-Ugric languages, like Kven, Livvi, Nganasan or Ludic. It would be nice to help them to find some audience to ease their work. Ask what you could for them and drop a line to Heidi in order to get additional information: name.surname@helsinki.fi

Antti Kanner of the University of Helsinki had a paper, which was titled as Multilingual terminology work and lexicography on virtual open source collaboration platforms. Kanner introduced the possibilities offered by wiki platforms for terminology and lexicography work and presented two relevant projects, the Bank of Finnish Terms in Arts and Sciences (BFT) and the lexicographical wiki platform sanat.csc.fi, which contains only Ludic Karelian dictionary so far. What interested me in his presentation was the use of the crowd to enrich the both, the terminology of BFT and the Ludic dictionary. The first one reminds the concept of nichesourcing to me, since they have engaged the professionals of every field to define the terms. A good example of a qualitative crowdsourcing.

The remaining speakers of the symposium were:

  • Esther Simon (Research Institute for Linguistics, Hungarian Academy of Sciences) Language technology support for Finno-Ugric digital communities.
  • Marja-Liisa Olthuis and Trond Trosterud (University of Tromsø) Aanaar Saami e-lexicography
  • Tommi Pirinen (Dublin City University) Omorfi – A free and open source lexical database for computational linguistics of Finnish through combination of expert and crowdsourced data
  • Sven-Erik Soosaar (The University of Tartu) Creating open source language technology for Tundra Nenets – development problems and future prospects
  • Jeremy Bradley (Ludwig Maximilian University of Munich) A corpus-based analysis of syntactic structures: Postpositional constructions in Mari
  • Jack Rueter (University of Helsinki) On the development of open-source morphological analyzers for Uralic minority languages

On my personal behalf, I would like to thank all the presenters as well as the various attendees for your activity. It was a pleasure and see you soon again.

Yours &c.,
Jussi-Pekka

1,063 thoughts on “CIFU XII, Day 3

  1. It is in point of fact a great and useful piece of info.
    I am happy that you just shared this useful
    info with us. Please keep us informed like this.
    Thank you for sharing.

  2. I do not know if it’s just me or if everyone else experiencing problems with your blog.
    It looks like some of the written text in your content are running
    off the screen. Can somebody else please comment and let me know
    if this is happening to them as well? This may be a problem with my web browser because I’ve had this happen previously.

    Many thanks

  3. Hey there I am so glad I found your web site, I
    really found you by mistake, while I was researching on Bing for something else,
    Anyways I am here now and would just like to say thank you for a tremendous post and a all round enjoyable
    blog (I also love the theme/design), I don’t have time to look over it all at the minute but I
    have book-marked it and also added in your RSS feeds, so when I have time I will be back to read a great deal more, Please do keep up the
    fantastic b.

  4. I recently tried CBD gummies for the from the word go cbd balm for pain everything and they exceeded my expectations. The correctness was entertaining, and they helped me unwind and relax. My anxiety noticeably decreased, and I felt a meaning of all-embracing well-being. These gummies are for the time being a elementary in my self-care routine. Warmly advocate in place of a logical and balsamic experience.

  5. I recently embarked on a expedition to multiply autoflower weed seeds for the win initially duration, and it was an incredibly cheap seeds satisfying experience. As a beginner, I was initially apprehensive, but the prepare turned gone from to be surprisingly straightforward. Before all crazy, the germination aspect was velvety sailing. The seeds sprouted with all speed, and their vigor was impressive. I followed the recommended guidelines on the subject of lighting, nutrients, and watering, and the plants responded positively. Whole of the biggest advantages of autoflowering strains is their cleverness to automatically transition from vegetative growth to flowering, regardless of beat cycle.

  6. شركة التنظيف المثالي لتنظيف مكيفات في القطيف هي شركة متخصصة تهتم بالحفاظ على أداء ونظافة أنظمة التكييف في المنازل والمنشآت التجارية. تقدم هذه الشركة خدمات تنظيف مكيفات محترفة وفعالة باستخدام أحدث التقنيات والمعدات.يتضمن تنظيف المكيفات إزالة الأوساخ والغبار والرواسب التي تتراكم في الوحدة الداخلية والخارجية للمكيف، مما يساهم في تحسين جودة الهواء وزيادة كفاءة التبريد. فريق العمل المدرب لدى شركة التنظيف المثالي يضمن أن يتم القيام بالخدمة بدقة وجودة عالية.إذا كنت في القطيف وتبحث عن شركة تقدم خدمات تنظيف مكيفات موثوقة ومحترفة، فإن شركة التنظيف المثالي تقدم لك الحلاً المثاليًا للمحافظة على أنظمة التكييف الخاصة بك في حالة ممتازة.
    تنظيف مكيفات سبليت القطيف

  7. I gave https://www.cornbreadhemp.com/collections/full-spectrum-cbd-oil a prove with a view the first time, and I’m amazed! They tasted excessive and provided a intelligibility of calmness and relaxation. My stress melted away, and I slept outstrip too. These gummies are a game-changer for me, and I enthusiastically put forward them to anyone seeking spontaneous emphasis recess and better sleep.

  8. I recently tried CBD gummies as a replacement for the first prematurely and was pleasantly surprised next to the results. Initially skeptical, I initiate that it significantly helped with my anxiety and be in the land of nod issues without any remarkable side effects. The fuel was easy to put to use, with nitid dosage instructions. It had a merciful, vulgar liking that was not unpleasant. Within a week, I noticed a decided improvement in my overall well-being, hunch more relaxed and rested. I appreciate the regular approximate to wellness CBD offers and plan to continue using it.

Leave a Reply to dagathomo Cancel reply

Your email address will not be published. Required fields are marked *