Tweets as a Challenge for the Automatic Linguistic Processing
Authors:
Petya
Osenova
St. Kliment Ohridski University of Sofia
Pages:
205-
216
DOI: https://doi.org/10.54664/NZIM1170
Abstract:
The paper focuses on the specificities of the written colloquial speech in tweets as a challenge for the automatic linguistic analysis. Such an analysis includes: text segmentation into words; morphological analysis in parts-of-speech and related grammatical characteristics; dependency syntactic analysis; named entity recognition of people, locations and organizations; handling abbreviations. The problems are of the following kinds: out-of-vocabulary words; word blending; colloquial variants that have not been normalized, etc. The survey explores 630 tweets that discuss the crisis of two banks in Bulgaria in 2014
Keywords:
tweet, Bulgarian language, automatic linguistic processing; written colloquial speech
Download
674 downloads since 12.10.2020 г.
NA