An Improved TextRank Keyword Extraction Method Based on the Watts-Strogatz Model

Journal Title: Information Dynamics and Applications - Year 2024, Vol 3, Issue 2

Abstract

Traditional methods for keyword extraction predominantly rely on statistical relationships between words, neglecting the cohesive structure of the extracted keyword set. This study introduces an enhanced method for keyword extraction, utilizing the Watts-Strogatz model to construct a word network graph from candidate words within the text. By leveraging the characteristics of small-world networks (SWNs), i.e., short average path lengths and high clustering coefficients, the method ascertains the relevance between words and their impact on sentence cohesion. A comprehensive weight for each word is calculated through a linear weighting of features including part of speech, position, and Term Frequency-Inverse Document Frequency (TF-IDF), subsequently improving the impact factors of the TextRank algorithm for obtaining the final weight of candidate words. This approach facilitates the extraction of keywords based on the final weight outcomes. Through uncovering the deep hidden structures of feature words, the method effectively reveals the connectivity within the word network graph. Experiments demonstrate superiority over existing methods in terms of precision, recall, and F1-measure.

Authors and Affiliations

Aofan Li, Lin Zhang, Ashim Khadka

Keywords

Related Articles

ECO-LEACH: A Blockchain-Based Distributed Routing Protocol for Energy-Efficient Wireless Sensor Networks

This paper proposes a novel architecture based on blockchain technology to enhance the dependability and safety of wireless sensor networks (WSN) by authenticating WSN nodes. In a WSN, sensor nodes collect and transmit d...

Enhanced Channel Estimation in Multiple-Input Multiple-Output Systems: A Dual Quadratic Decomposition Algorithm Approach for Interference Cancellation

In Multiple-Input Multiple-Output (MIMO) systems, a considerable number of antennas are deployed at each base station, utilizing Time-shifted pilot contamination strategies. It was observed that Time-shifted pilot contam...

Cryptocurrency Investigations in Digital Forensics: Contemporary Challenges and Methodological Advances

Digital forensics, a crucial subset of cybersecurity, encompasses sophisticated tools and methodologies for the interpretation, analysis, and investigation of digital evidence, facilitating the identification and mitigat...

MR Image Feature Analysis for Alzheimer’s Disease Detection Using Machine Learning Approaches

Alzheimer’s disease (AD), a progressive neurological disorder, predominantly impacts cognitive functions, manifesting as memory loss and deteriorating thinking abilities. Recognized as the primary form of dementia, this...

An Optimized Algorithm for Peak to Average Power Ratio Reduction in Orthogonal Frequency Division Multiplexing Communication Systems: An Integrated Approach

The impact of the peak to Average Power Ratio (PAPR) on the efficiency of an Orthogonal Frequency Division Multiplexing (OFDM) communication system is significantly mitigated through an innovative Reconfigurable Integrat...

Download PDF file
  • EP ID EP744287
  • DOI https://doi.org/10.56578/ida030201
  • Views 13
  • Downloads 1

How To Cite

Aofan Li, Lin Zhang, Ashim Khadka (2024). An Improved TextRank Keyword Extraction Method Based on the Watts-Strogatz Model. Information Dynamics and Applications, 3(2), -. https://europub.co.uk/articles/-A-744287