Feature-based Model for Extraction and Classification of High Quality Questions in Online Forum
Journal Title: Journal of Advances in Mathematics and Computer Science - Year 2017, Vol 22, Issue 1
Abstract
Aims: To design and implement a classification-based model using specific features for identification and extraction of high quality questions in a thread. Study Design: The study design is divided into three modules: preprocessing, configuration, and question classification Place and Duration of Study: Department of Computer Science of the Federal University of Technology Akure, between June 2016 and December 2016 Methodology: This research proposes a way of identifying, extracting and classifying questions in order to enhance high quality answers in an online forum. One of the major issues in question extraction and classification in forum is the restriction on the number of categories considered such as Who, What, Where, Where, Which, Why and How which are not sufficient to capture all possible questions. In this work, a number of parameters were proposed and aggregated using fuzzy logic for context based spam detection and removal in order to enhance question identification and classification. Part of speech (POS) tagging was applied to analyse the structure of each extracted sentence based on the presence and position of predefined question tags; with this, issues like case sensitivity, grammatical construction and synonyms are addressed. Question classification is carried out with Naïve Bayes and identifying semantic relationship between extracted questions is achieved with cosine similarity model. Experiments were performed on dataset constructed from Research Gate website. Results: We presented questions extracted from researchgate website into the system. The output consists of the corresponding POS tags and the category the question is classified into. The number of questions extracted from the website is dependent on the number of questions available in a forum. We were able to achieve a successful result of 3015 correctly extracted and classified questions at 80% POS tag occurrence. Conclusion: Our approach to question identification and classification was effective and covers more question categories. This can be applied to any question answering system.
Authors and Affiliations
Bolanle Ojokoh, Tobore Igbe, Ayobami Araoye
General Version of Gauss-type Proximal Point Method and Its Uniform Convergence Analysis for Metrically Regular Mappings
We study the uniform convergence of the general version of Gauss-type proximal point algorithm (GG-PPA), introduced by Alom et al. [1], for solving the parametric generalized equations y ∈ T(x), where T : X 2Y is a set...
A Comparative Analysis of Jellyfish Attacks and Black Hole Attack with Selfish Behavior Attack under AODV Routing Protocol
The applications of mobile adhoc network (MANET) are increasing day-by-day due to the flexibility they provide to seamless communication. However MANETS are vulnerable to number of attacks because of properties like non-...
Magnetic Curves According to Bishop Frame and Type-2 Bishop Frame in Euclidean 3-Space
In this paper, we dene the notions of T-magnetic, N1-magnetic, N2-magnetic curves according to Bishop frame and 1-magnetic, 2-magnetic, B-magnetic curves according to type-2 Bishop frame in Euclidean 3-space. Also, we...
Design of DNA Based Biometric Security System for Examination Conduct
Biometrics is a technique of using characteristics and behavioral traits for identification of people this is more effective that common personal identification number(PIN) for an improved security technique. The basic p...
Variable Viscosity and Thermal Conductivity Effect of Soret and Dufour on Inclined Magnetic Field in Non-Darcy Permeable Medium with Dissipation
The analysis of thermal-diffusion (Soret) and diffusion-thermo (Dufour) effects on variable thermal conductivity and viscosity in a dissipative heat and mass transfer of an inclined magnetic field in a permeable medium p...