Skip to main content

Annual Conference Overview |  Programme |  Registration 

Tour de CLARIN

Tour de CLARIN: Interview with Ondřej Tichý

In this Tour de CLARIN blog post, we present an in-depth interview with Ondřej Tichý, a corpus linguist who is deputy chair of the Department of English Linguistics at the Facuty of Arts at Charles University. Dr Tichý collaborates with and is a regular user of the Czech National Corpus

Tour de CLARIN: Interview with Kaja Dobrovoljc

In this Tour de CLARIN blog post, we present an in-depth interview with Kaja Dobrovoljc, a Slovenian corpus linguist who works at the Centre for Language Resources and Technologies and regularly collaborates with CLARIN.SI and uses its infrastructure.

Tour de CLARIN: Interview with Nan Bernstein Ratner

In this Tour de CLARIN blog post, we present an in-depth interview with Nan Bernstein Ratner, who is along with Brian MacWhinney one of the PIs of FluencyBank, a shared database for the study of the development of fluency in typical and disordered populations.

CLARIN.SI presents CSMTiser

Read about the CSMTiser, a supervised machine learning tool that performs word normalization by using Character-level Statistical Machine Translation.

The TalkBank CLARIN Knowledge Centre

TalkBank, which was recognized as a CLARIN Knowledge Centre in 2016, is the world’s largest open access integrated repository for spoken language data. It provides language corpora and other audio resources to support researchers in Psychology, Linguistics, Education, Computer Science, and Speech Pathology.

CLARIN.SI presents Emoji Sentiment Ranking 1.0

In 2015, researchers from the Jožef Stefan Institute in Ljubljana, Slovenia released the first emoji sentiment lexicon, called Emoji Sentiment Ranking 1.0, and published it as a resource in the public language resource repository CLARIN.SI. With 78,500 downloads to date, the lexicon is the most downloaded resource in the CLARIN.SI repository.