Greetings from Aalto University, Otaniemi!

Mikko Kurimo is the coordinator of Teflon and leads the research group at Aalto University. Prof. Kurimo has been the head of the automatic speech recognition (ASR) group at Aalto since 2000. He has led the group in many national and international machine learning and ASR projects. Kurimo’s work is internationally best known for unsupervised subword language modeling for morphologically complex languages such as Finnish, Estonian, Turkish and Arabic. His research interests include deep learning methods for automatic speech recognition and spoken language modeling.

Aalto’s team is in charge of the ASR technology that is required in Teflon and will also provide the game platform for Teflon. The platform is based on the previous projects of Mikko Kurimo and Sari Ylinen where English was the target language. Now the goal in Teflon is to apply it to Finnish, Swedish and Norwegian.

The Aalto team consists of:

  • Tamas Grósz is a post-doctoral researcher whose specification area is ASR and computational paralinguistics. His background lies in developing machine learning based ASR systems. 
  • Ragheb Al-Ghezi is a doctoral researcher whose main research focus lies in applying ASR and machine learning in language assessment. He has a versatile background in natural language processing and second language (L2) learning.
  • Ekaterina Voskoboinik is a doctoral researcher whose main research focus lies in applying statistical language modeling and machine learning in ASR and spoken L2 assessment. Her background is on learning and analysing the representations of words and subwords for morphologically rich languages.
  • Yaroslav Getman is a doctoral researcher whose main research focus lies in applying self-supervised learning and pre-trained models for ASR in low-resource tasks like L2 learners’ ASR for spoken language assessment.
  • Aku Rouhe is a doctoral researcher whose main research focus is in ASR. He has a versatile background speech and spoken language modeling including speaker adaptation, voice activity detection, subword language modeling using statistical morphemes, speech translation and decoding algorithms and miscue tolerant ASR in L2 reading tutoring.
  • Nhan Phan is a master’s student whose research topic is developing a mobile app to give feedback for beginner level L2 learners’ in read aloud tasks. His skills also include game programming using Unity and his task is to modify the game platform for Teflon tasks.

We are excited and looking forward to the experiment with the Nordic languages.

Aalto University is the coordinating partner of the Teflon project. Aalto University describes itself as a meeting place for the fields of science, art, technology, and business. Today the university is one of the most prestigious ones in Finland. The university started operating in 2010 and the goal was to merge together the old Helsinki School of Economics, Helsinki University of Technology and the University of Art and Design Helsinki and create a new multidisciplinary university. The Aalto university campus in Otaniemi is situated in Greater Helsinki in the city of Espoo, just a metro ride away from central Helsinki. 

The campus was originally built almost in the middle of the forest in Otaniemi. The rebuilding after the Second World War in Finland led to a need to educate more engineers – and hence to a need to build a new campus area and laboratory spaces for the Helsinki University of Technology and VTT Technical Research Centre of Finland Ltd. The Otaniemi campus is a park campus design from the 1950s and the city plan of the area is the work of the Finnish architect Alvar Aalto who has along with other esteemed Finnish architects such as Reima and Raili Pietilä and Heikki and Kaija Sirén also designed multiple buildings in the campus. The name of the university is a tribute to Alvar Aalto who himself graduated from the former Helsinki University of Technology.

List of recent publications: 

  • Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Ragheb AlGhezi, Mietta Lennes, Tamás Grósz, Krister Lindén and Mikko Kurimo. Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks. Language Resources & Evaluation (2022).
  • Kathania, Hemant; Kadiri, Sudarsana; Kadyan, Virender; Kurimo, Mikko. Data augmentation using spectral warping for low resource children ASR. Journal of Signal Processing Systems. Accepted for publication.
  • Kathania, H , Kadiri, S , Alku, P & Kurimo, M. A Formant Modification Method for Improved ASR of Children’s Speech. Speech Communication (2022), vol. 136 , pp. 98 – 106.
  • Kathania, Hemant; Kadiri, Sudarsana; Alku, Paavo; Kurimo, Mikko. Using data augmentation and time-scale modification to improve ASR of children’s speech in noisy environments. Applied Sciences (Springer International Publishing AG), 2021.
  • Ylinen Sari, Smolander Anna-Riikka, Karhila Reima, Kakouros Sofoklis, Lipsanen Jari, Huotilainen Minna, Kurimo Mikko. The Effects of a Digital Articulatory Game on the Ability to Perceive Speech-Sound Contrasts in Another Language. Frontiers in Education, 2021 ; Vol 6.
  • Ragheb Al-Ghezi, Yaroslav Getman, Ekaterina Voskoboinik, Mittul Singh, Mikko Kurimo. Automatic Rating of Spontaneous Speech for Low-Resource Languages. IEEE Spoken Language Technology Workshop (SLT 2022), 2022.
  • Grósz, T., Porjazovski, D., Getman, Y., Kadiri, S., Kurimo, M. Wav2vec2-based Paralinguistic Systems to Recognise Vocalised Emotions and Stuttering. ACM Multimedia 2022 Conference : Grand Challenges.
  • Yaroslav Getman, Ragheb Al-Ghezi, Katja Voskoboinik, Tamás Grósz, Mikko Kurimo, Giampiero Salvi, Torbjørn Svendsen, and Sofia Strömbergsson. Wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. Proc. Interspeech 2022. 
  • Ragheb Al-Ghezi, Yaroslav Getman, Aku Rouhe, Raili Hildén and Mikko Kurimo. Self-supervised End-to-End ASR for Low Resource L2 Swedish. Proc. Interspeech 2021.

https://www.aalto.fi/fi/aalto-yliopisto

https://research.aalto.fi/fi/persons/mikko-kurimo

Photo: Unto Rautio

Researchers of Teflon project: Sofia Strömbergsson

Author: trainee Elina Aittokallio, Tampere University

Greetings from Tampere!In the following blogpost we will briefly introduce one of the researchers working on the Teflon project, Sofia Strömbergsson from Karolinska Institutet (KI), the largest center of medical academic research university in Sweden.

Kuvan esikatselu

Strömbergsson is an associate professor of Karolinska Institutet. She is a teacher and conductor of research concerning children’s speech and language disorders. Strömbergsson earned her doctoral degree in April 2014 at the Royal Institute of Technology (KTH), Stockholm. Strömbergsson also has a medical science degree in SLP (Lund 2007) and in computational linguistics (Uppsala, 2000). 

We’ve had the honor to hold an interview on Sofia’s job as a researcher:

What sparked your interest into the field of logopedics? How did your career path form?

I’ve always been interested in language, and particularly in its spoken form – different ways of speaking, and how that’s perceived by listeners. I first started out in the field of speech technology, and I worked a few years in the industry developing text-to-speech for Swedish. I really enjoyed the task of trying to “teach” the computer how to read. After some time, though, I wanted to make use of my interest in speech and language in ways that would benefit others more directly. That’s when I decided to embark on a study program in logopedics. When I finished, an opportunity opened up for pursuing PhD studies at KTH, where I had the chance to combine my two interests into a thesis project on how children perceive automatically “corrected” versions of their own recorded misarticulation. In 2014, after I finished, I joined the Division of Speech-Language Pathology at KI, where I still am today.  

How do you find Karolinska Institutet as a place to work and do research at? 

When I joined KI and the Division of Speech and Language Pathology in 2014, it felt a little bit like coming home. Although I had really enjoyed my time at KTH, I had been a bit of an odd bird, being the only SLP among engineers (and, admittedly, also some phoneticians). At KI, I experienced that I could contribute with my background in computational linguistics as a new perspective, but to a research core that lied closer to my own interests. I thrive here, where I can focus on research centered around clinical relevance, and to increase our understanding of if, how and why certain intervention methods work, and others don’t. 

Regarding the ongoing Teflon project:

What is your role in the project?  

I lead the part of TEFLON that focuses on trying out the speech training app in a clinical population. 

What type of joys and challenges have you faced during the project? 

Up until now, I’ve been working alone at my site on the project. Although we’ve had regular meetings in the bigger group, it hasn’t always been easy to manage the KI part of the project single-handedly. Therefore, I’m thrilled to have a project assistant, Magdalena Pettersson, joining me now in October! 

Has the project raised any new questions for future research? 

As we are still in the beginning of the project, I can’t say it has. (But I might have more to say about it in 2024!)

Aside from Teflon, are you involved in other research projects at the moment?  

Yes, I run a project called SPETS (http://ki.se/spets). In SPETS, we explore and compare different types of intervention for preschool-aged children with Developmental Language Disorder (DLD). It’s a longitudinal study, following children with DLD for 2 years, to explore what intervention they receive, and how their language skills, communicative participation, and quality of life change over time.

Lastly, as you have been very active in the field of research and must have a lot of other work besides it, we’d like to know what is your way to cool down from work.

Do you have any non-work-related passion projects, hobbies, or goals to achieve? 

Although I try to guard my limits, research has its way of intruding on one’s free time. And indeed, I also let it, and often enjoy it! But that said, cooling down and doing other things is important. And there are a lot of things I enjoy doing to relax! Spending time with my family and friends is of course one and seeking wild-life experiences is another. I love to go hiking and cross-country skiing in the Norwegian mountains, and I run a lot, and most of all enjoy running in new places.

We wish Strömbergsson all the success for the project and future research. In addition, we have listed below some research Strömbergsson has been part of and also the PhD from 2014.

Recent research:

Simulating Speech Error Patterns Across Languages and Different Datasets

Strömbergsson S, Götze J, Edlund J, Nilsson Björkenstam K 

Language and speech 2022;65(1):105-142

A survey of Swedish speech-language pathologists’ practices regarding assessment of speech sound disorders

Wikse Barrow C, Körner K, Strömbergsson S 

Logopedics, phoniatrics, vocology 2021;():1-12

Canonical babbling ratio-Concurrent and predictive evaluation of the 0.15 criterion

Nyman A, Strombergsson S, Lohmander A 

Journal of communication disorders 2021;:106164-

Sofia Strömbergsson’s PhD from 2014:

Children’s perception of their synthetically corrected speech production

Strombergsson S, Wengelin A, House D 

Clinical linguistics & phonetics 2014;28(6):373-95

Sources:

https://staff.ki.se/people/sofia-strombergsson

https://nyheter.ki.se/ny-docent-i-logopedi-vid-clintec-0