Identification of Fake Contents Using Text-mining Techniques

Abstract

In recent years, social media users have become increasingly concerned about sharing content that may be unpleasant or harmful. The widespread use of platforms like Facebook and Twitter has contributed significantly to this growing awareness. The primary objective of our approach is to accelerate and automate the detection of offensive content posted on these platforms, simplifying the process of taking necessary actions and filtering harmful communications. A benchmark dataset, OLID 2019 (Offensive Language Identification Dataset), is available online to aid in this task. Our study focuses on identifying whether a tweet is offensive. Our team, which included several members, rigorously compared various feature extraction methods and model-building algorithms. Ultimately, our comparative analysis revealed that decision trees were the most effective model. The decision trees applied to the normalized dataset resulted in an 84% improvement in the Macro F1 score, which aligns with previous research. In conclusion, a real-time system could be developed across multiple social media platforms to detect and evaluate objectionable posts, enabling timely interventions to promote healthier online behavior and foster a positive societal impact.

Authors and Affiliations

Saqlain Sajjad, Hafiz Muhammad Ghazi, Muhammad Asgher Nadeem, Muhammad Irfan Habib, Muhammad Salman Saeed, Syed Ali Hasnain Naqvi, Zeeshan Ahmad Arfeen, Isheeaq Naeem, Muhammad Irfan

Keywords

Related Articles

Requirements Prioritization-Modeling Through Dependency and Usability with Fusion of Artificial Intelligence Technique

Requirements Prioritization is a crucial part of Requirements Engineering which helps to prioritize the customer’s requirements according to his needs and priorities. This prioritization describes which requirements s...

AI-Based Predictive Tool-Life Computation in Manufacturing Industry

For maximum productivity and optimal utilization of tools, predictive maintenance serves as a standard operation procedure in the manufacturing industry. However, unnecessary or delayed maintenance both causes increas...

AI-Driven Control and Processing System for Smart Homes with Solar Energy

In recent years, the utilization of solar energy has grabbed attention in the industrial and domestic zones. The existing systems to use the services of solar cells are conventional. These systems require parameters (...

Automated Objects Delivery System for Interior Locale using Line Following Robot with Optimized Security Parameters

Automated object delivery robots are increasingly sought for convenience, reliability, efficiency, supporting organizational productivity, elderly assistance, and reducing human error and labor costs in indoor delivery...

Cluster Analysis of COVID-19 Through Genome Sequences Using Python Bioinformatics Library

Introduction and Importance of Study: During the COVID-19 pandemic, mortality rates varied across different regions of the world. To better understand the virus's behavior, it's important to gain in-depth knowledge of...

Download PDF file
  • EP ID EP760590
  • DOI -
  • Views 34
  • Downloads 0

How To Cite

Saqlain Sajjad, Hafiz Muhammad Ghazi, Muhammad Asgher Nadeem, Muhammad Irfan Habib, Muhammad Salman Saeed, Syed Ali Hasnain Naqvi, Zeeshan Ahmad Arfeen, Isheeaq Naeem, Muhammad Irfan (2024). Identification of Fake Contents Using Text-mining Techniques. International Journal of Innovations in Science and Technology, 6(4), -. https://europub.co.uk/articles/-A-760590