An Advanced YOLOv5s Approach for Vehicle Detection Integrating Swin Transformer and SimAM in Dense Traffic Surveillance
Journal Title: Journal of Industrial Intelligence - Year 2024, Vol 2, Issue 1
Abstract
In the realm of high-definition surveillance for dense traffic environments, the accurate detection and classification of vehicles remain paramount challenges, often hindered by missed detections and inaccuracies in vehicle type identification. Addressing these issues, an enhanced version of the You Only Look Once version v5s (YOLOv5s) algorithm is presented, wherein the conventional network structure is optimally modified through the partial integration of the Swin Transformer V2. This innovative approach leverages the convolutional neural networks' (CNNs) proficiency in local feature extraction alongside the Swin Transformer V2's capability in global representation capture, thereby creating a symbiotic system for improved vehicle detection. Furthermore, the introduction of the Similarity-based Attention Module (SimAM) within the CNN framework plays a pivotal role, dynamically refocusing the feature map to accentuate local features critical for accurate detection. An empirical evaluation of this augmented YOLOv5s algorithm demonstrates a significant uplift in performance metrics, evidencing an average detection precision (mAP@0.5:0.95) of 65.7%. Specifically, in the domain of vehicle category identification, a notable increase in the true positive rate by 4.48% is observed, alongside a reduction in the false negative rate by 4.11%. The culmination of these enhancements through the integration of Swin Transformer and SimAM within the YOLOv5s framework marks a substantial advancement in the precision of vehicle type recognition and reduction of target miss detection in densely populated traffic flows. The methodology's success underscores the efficacy of this integrated approach in overcoming the prevalent limitations of existing vehicle detection algorithms under complex surveillance scenarios.
Authors and Affiliations
Yi Zhang, Zheng Sun
Enhanced Signal Processing Through FPGA-Based Digital Downconversion via the CORDIC Algorithm
To address the rate matching issue between high-bandwidth and high-sampling-rate analog-to-digital converters (ADCs) and low-bandwidth and low-sampling-rate baseband processors, the key technology of digital downconversi...
Lifetime Extension of Wireless Sensor Networks by Perceptive Selection of Cluster Head Using K-Means and Einstein Weighted Averaging Aggregation Operator under Uncertainty
In the realm of Wireless Sensor Networks (WSNs), energy efficiency emerges as a paramount concern due to the inherent limitations in the energy capacity of sensor nodes. The extension of network lifespan is critically de...
Computational Fluid Dynamics Evaluation of Nitrogen and Hydrogen for Enhanced Air Conditioning Efficiency
This study evaluates the potential of nitrogen and hydrogen as alternative working fluids in air conditioning systems to improve thermal comfort and optimize energy efficiency, using computational fluid dynamics (CFD) si...
Strategies for Enhancing Industry 4.0 Adoption in East Africa: An Integrated Spherical Fuzzy SWARA-WASPAS Approach
Developed countries have successfully implemented various Industry 4.0 (I4.0) initiatives, showcasing their ability to reap the benefits of this new industrial revolution. Active pursuit of excellence in Industry 4.0 is...
Evaluating Free Zone Industrial Plant Proposals Using a Combined Full Consistency Method-Grey-CoCoSo Model
Libya's strategic position at the crossroads of Europe and Africa offers access to abundant raw materials, labor, and extensive land for establishing free trade zones. The primary objective of this research is to determi...