Summarizers – Similarities, Differences and Application in the Lithuanian Language

Journal Title: Kalbu studijos / Studies about Languages - Year 2010, Vol 17, Issue 0

Abstract

Summarizer is a useful system, which allows compressing text and represents it in a shorter way. In Lithuania summarizers are new technological developments and information about them is scarce, although these systems are created and developed by the specialists of computational linguistics in other countries. In the research English and Lithuanian texts were used, which allowed seeing differences among output summaries. In the English texts, summaries using all summarizers were quite informative, useful and did not miss the main information from the original texts, except cases when a high compression rate was used. While summarizing Lithuanian texts, some summarizers confront with language recognition problem – the output summaries in Lithuanian are with unrecognizable symbols, or in individual cases some symbols are missing. Summaries in the Lithuanian language are not of the same quality as the summaries in English. It was also noticed, that Lithuanian summaries with higher compression rate are loosing the meaning of the original text. To conclude, it was noticed that commercial summarizers give better summaries in the English language, but in the Lithuanian language better summaries are given by noncommercial summarizers. Furthermore, summaries received using high compression rate are loosing their information and meaning. Analyzed summarizers are best fitted for the English language, therefore using summarizer for other language than English stipulates various problems – from language recognition to rendering of text meaning. It is very important to adapt different languages to summarizers, as this would significantly improve summarizers. Findings of this paper can be used in creating a Lithuanian ummarizer or improving other summarizers.

Authors and Affiliations

Jurgita Lasytė, Bronius Tamulynas

Keywords

Related Articles

Software Testing According to Internationalization Bugs

Bugs, found in localized software, may be classified into two categories: localization bugs and internationalization bugs. The localization bugs are made by the localizer and may be fixed by the localizer. Internationali...

Lyginamųjų konstrukcijų variantai ir sinonimika moksliniame stiliuje

Straipsnyje nagrinėjami lyginamųjų konstrukcijų variantai ir sinonimika moksliniame stiliuje. Lyginamųjų konstrukcijų sudėties struktūrinė analizė leido moksliniame tekste išskirti tris šių konstrukcijų tipus, kuriuose...

Prototipų teorija ir semantika

Kalbos mokėjimas reiškia žinojimą, kaip sukurti ir suprasti sakinius, perteikiančius tam tikrą reikšmę. Tam, kad kalba atliktų savo komunikacinę funkciją, žodžiai ar sakiniai (gali būti ir tekstas) turi perduoti tam ti...

IDUKM aukštajame moksle: ar yra ką pasiūlyti?

ir viduriniame mokymo etape. Tyrėjai išskiria daug šio metodo privalumų, pavyzdžiui, padidėjusią besimokančiųjų motyvaciją, palankesnį požiūrį į kalbų mokymąsi, geresnį turinio (ne kalbinio dalyko) ir kalbos išmokimą. Au...

Insight into the Latest Computer and Internet Terminology

Information technology is an exceptional field where new terms seem to be popping up everyday. The rate of new words being added to the English language is about 15 words a day, whereas in the field of information techn...

Download PDF file
  • EP ID EP134762
  • DOI -
  • Views 103
  • Downloads 0

How To Cite

Jurgita Lasytė, Bronius Tamulynas (2010). Summarizers – Similarities, Differences and Application in the Lithuanian Language. Kalbu studijos / Studies about Languages, 17(0), 46-52. https://europub.co.uk/articles/-A-134762