An Improved TextRank Keyword Extraction Method Based on the Watts-Strogatz Model
Journal Title: Information Dynamics and Applications - Year 2024, Vol 3, Issue 2
Abstract
Traditional methods for keyword extraction predominantly rely on statistical relationships between words, neglecting the cohesive structure of the extracted keyword set. This study introduces an enhanced method for keyword extraction, utilizing the Watts-Strogatz model to construct a word network graph from candidate words within the text. By leveraging the characteristics of small-world networks (SWNs), i.e., short average path lengths and high clustering coefficients, the method ascertains the relevance between words and their impact on sentence cohesion. A comprehensive weight for each word is calculated through a linear weighting of features including part of speech, position, and Term Frequency-Inverse Document Frequency (TF-IDF), subsequently improving the impact factors of the TextRank algorithm for obtaining the final weight of candidate words. This approach facilitates the extraction of keywords based on the final weight outcomes. Through uncovering the deep hidden structures of feature words, the method effectively reveals the connectivity within the word network graph. Experiments demonstrate superiority over existing methods in terms of precision, recall, and F1-measure.
Authors and Affiliations
Aofan Li, Lin Zhang, Ashim Khadka
Advancements in Image Recognition: A Siamese Network Approach
In the realm of computer vision, image recognition serves as a pivotal task with extensive applications in intelligent security, autonomous driving, and robotics. Traditional methodologies for image recognition often gra...
Enhancing Image Captioning and Auto-Tagging Through a FCLN with Faster R-CNN Integration
In the realm of automated image captioning, which entails generating descriptive text for images, the fusion of Natural Language Processing (NLP) and computer vision techniques is paramount. This study introduces the Ful...
Enhanced Method for Monitoring Internet Abnormal Traffic Based on the Improved BiLSTM Network Algorithm
The complexity and variability of Internet traffic data present significant challenges in feature extraction and selection, often resulting in ineffective abnormal traffic monitoring. To address these challenges, an impr...
An Optimized Algorithm for Peak to Average Power Ratio Reduction in Orthogonal Frequency Division Multiplexing Communication Systems: An Integrated Approach
The impact of the peak to Average Power Ratio (PAPR) on the efficiency of an Orthogonal Frequency Division Multiplexing (OFDM) communication system is significantly mitigated through an innovative Reconfigurable Integrat...
Comparative Analysis of Seizure Manifestations in Alzheimer’s and Glioma Patients via Magnetic Resonance Imaging
A notable association between Alzheimer's Disease and Epilepsy, two divergent neurological conditions, has been established through previous research, illustrating an elevated seizure development risk in individuals diag...