AMBERT-DWPM: An Adaptive Masking and Dynamic Prototype Learning Framework for Few-Shot Text Classification
Journal Title: International Journal of Knowledge and Innovation Studies - Year 2025, Vol 3, Issue 1
Abstract
Transformer-based language models have demonstrated remarkable success in few-shot text classification; however, their effectiveness is often constrained by challenges such as high intraclass diversity and interclass similarity, which hinder the extraction of discriminative features. To address these limitations, a novel framework, Adaptive Masking Bidirectional Encoder Representations from Transformers with Dynamic Weighted Prototype Module (AMBERT-DWPM), is introduced, incorporating adaptive masking and dynamic weighted prototypical learning to enhance feature representation and classification performance. The standard BERT architecture is refined by integrating an adaptive masking mechanism based on Layered Integrated Gradients (LIG), enabling the model to dynamically emphasize salient text segments and improve feature discrimination. Additionally, a DWPM is designed to assign adaptive weights to support samples, mitigating inaccuracies in prototype construction caused by intraclass variability. Extensive evaluations conducted on six publicly available benchmark datasets demonstrate the superiority of AMBERT-DWPM over existing few-shot classification approaches. Notably, under the 5-shot setting on the DBpedia14 dataset, an accuracy of 0.978±0.004 is achieved, highlighting significant advancements in feature discrimination and generalization capabilities. These findings suggest that AMBERT-DWPM provides an efficient and robust solution for few-shot text classification, particularly in scenarios characterized by limited and complex textual data.
Authors and Affiliations
Junyu Li, Jialin Ma, Ashim Khadka
Assessing the Urban Competitiveness of European Cities Using LOPCOW-RAWEC Methodologies
Urban competitiveness is an essential determinant of the long-term sustainability and economic development of cities, influencing not only local prosperity but also national growth. The accurate measurement of urban comp...
Strategic Optimization of Parcel Distribution in E-Commerce: A Comprehensive Analysis of Logistic Flows and Vehicle Selection Using SWARA-WASPAS Methods
In recent years, e-commerce has emerged as a dominant sales channel, with an increasing number of large-scale companies exclusively operating online. The substantial growth of e-commerce has been paralleled by the growin...
A Method for Creative Scheme Generation for Brand Design of Plush Toys Based on Extension Theory
In the era of branding, the design of plush toy brands often faces a contradiction with the needs of target user groups. Addressing the brand transformation challenges faced by small and micro enterprises in the plush to...
Gear Fault Detection Based on Convolutional Neural Networks and Support Vector Machines
As a critical component of mechanical transmission systems, gears play a vital role in ensuring industrial production runs smoothly. Undetected gear failures can lead to mechanical breakdowns, production interruptions, a...
Selection of CRM Systems Using Objective Criteria for the Needs of Small Companies
This research examines customer relationship management (CRM) systems using multi-criteria decision-making (MCDM) methods, with the intention of selecting the most suitable solution for small companies. The main goal of...