Sentiment analysis. An example of application and evaluation of RID dictionary and Bayesian classification methods in qualitative data analysis approach

Journal Title: PRZEGLĄD SOCJOLOGII JAKOŚCIOWEJ - Year 2014, Vol 10, Issue 2


The purpose of this article is to present the basic methods for classifying text data. These methods make use of achievements earned in areas such as: natural language processing, the analysis of unstructured data. I introduce and compare two analytical techniques applied to text data. The first analysis makes use of thematic vocabulary tool (sentiment analysis). The second technique uses the idea of Bayesian classification and applies, so-called, naive Bayes algorithm. My comparison goes towards grading the efficiency of use of these two analytical techniques. I emphasize solutions that are to be used to build dictionary accurate for the task of text classification. Then, I compare supervised classification to automated unsupervised analysis’ effectiveness. These results reinforce the conclusion that a dictionary which has received good evaluation as a tool for classification should be subjected to review and modification procedures if is to be applied to new empirical material. Adaptation procedures used for analytical dictionary become, in my proposed approach, the basic step in the methodology of textual data analysis.

Authors and Affiliations

Krzysztof Tomanek


Related Articles

Przekłady prac Niklasa Luhmanna na język polski

Artykuł przedstawia przekłady książek Niklasa Luhmanna na język polski i omawia społeczne czynniki wpływające na zakres i sposób ich dokonywania. Czynniki te są następnie interpretowane w świetle systemowej teorii tłumac...

The Attempt to Use the Computer Program QDA Miner in the Research Project „Cztery dyskursy o nowoczesności – modernizm peryferii na przykładzie Łodzi (XIX–XX wiek)”

The goal of the article is to present the main theoretical and methodological approach of the research project „Cztery dyskursy o nowoczesności – modernizm peryferii na przykładzie Łodzi (XIX–XX wiek)” and to put across...

Euro 2012 and Cracow 2022. Polish Political Elites Towards the Sport Mega Events

The paper undertakes the topic of sport mega events, which are the most spectacular manifestations of commercialization and neoliberalization of contemporary, globalized sport. The analysis approaches case studies of two...

Dlaczego polskie kobiety wchodzą do polityki?

Tekst ten dotyczy motywów wejścia do polityki polskich posłanek V kadencji Sejmu RP. Prezentowane tutaj wnioski są częścią badań dotyczących wpływu płci na wykonywanie roli polityka. Zadaniem artykułu jest skonfrontowani...

Interview With Carolyn Ellis: Autoethnography, Storytelling, and Life as Lived: A Conversation Between Marcin Kafar and Carolyn Ellis

This conversation takes place in Warsaw. Carolyn Ellis has come to Poland to accompany Jerry Rawicki, a Warsaw Ghetto survivor, on his first trip back to Poland since the Holocaust. There she arranged to meet Marcin Kafa...

Download PDF file
  • EP ID EP110690
  • DOI -
  • Views 140
  • Downloads 0

How To Cite

Krzysztof Tomanek (2014). Sentiment analysis. An example of application and evaluation of RID dictionary and Bayesian classification methods in qualitative data analysis approach. PRZEGLĄD SOCJOLOGII JAKOŚCIOWEJ, 10(2), 118-136.