Deep Learning-Based Automated Classroom Slide Extraction

Abstract

Automated extraction of valuable content from real-time classroom lectures holds significant potential for enhancing educational accessibility and efficiency. However, capturing the spontaneous insights of live lectures often proves challenging due to rapid visual transitions, instructor movement, and diverse learning styles. This paper presents a novel approach that combines the strengths of YOLO and Scale-Invariant Feature Transform (SIFT) techniques to automatically extract slides from live classroom lectures. YOLO, a real-time object detection algorithm, is employed to identify board area, teacher, and other objects within the video stream. While SIFT, a robust feature-based method, was used to accurately merge key points from multiple pictures of the same region. The proposed method involves a multi-stage process: first, YOLO detects the potential place of the teacher, which occluded the board within the video frames. Subsequently, the teacher was removed from the image. The board was divided into multiple segments, to remove and merge redundant content Scale-invariant feature Transform (SIFT) was employed. Experimental results on a diverse dataset of classroom lecture videos demonstrated the effectiveness of the proposed method in extracting slides across different environments, lecture styles, and recording conditions. The potential benefits include improved note-taking, reduced manual effort in content curation, and enhanced accessibility to lecture materials. The presented approach contributes to the broader goal of leveraging computer vision and machine learning techniques to transform traditional classroom settings into modern, interactive, and adaptive learning environments.

Authors and Affiliations

Zeeshan Azhar, Hassan Nazeer Chaudhry, Farzana Kulsoom, Sanam Narejo

Keywords

Related Articles

Effects of Filters inRetinal Disease Detection onOptical Coherence Tomography (OCT) ImagesUsing Machine Learning Classifiers

Optical Coherence Tomography (OCT) is an essential, non-invasive imaging technique for producing high-resolution images of the retina, crucial in diagnosing and monitoring retinal conditions such as diabet...

Fine-Tuning Audio Compression: Algorithmic Implementation and Performance Metrics

Introduction/Importance of Study: This study introduces a comprehensive evaluation of audio compression algorithms to address the increasing demand for efficient data compression techniques in various audio processing...

Revolutionary Hologram Systems: Pioneering a New Frontier in Visual Technology

New technologies are enabling the development of consolidated, portable holographic displays that can be utilized in a variety of settings, making holographic content more accessible and shareable. Holographic display...

Computational Analysisof ModelHousesof Da Kali KORin Matta Swat

Natural disasters such as floods and earthquakes, exacerbated by global warming and environmental degradation, pose significant challenges for modern architecture. This study critically evaluates a rural residential ho...

Optimizing UAV Wing Performance: A Computational Analysis with Computer-Based Algorithms for Composite Material Integration

Introduction/Importance of Study: The aircraft wing, a vital component, demands intricate design to balance lift generation, drag reduction, and weight minimization. In advanced UAVs (Unmanned Aerial Vehicles), priorit...

Download PDF file
  • EP ID EP760316
  • DOI -
  • Views 21
  • Downloads 0

How To Cite

Zeeshan Azhar, Hassan Nazeer Chaudhry, Farzana Kulsoom, Sanam Narejo (2024). Deep Learning-Based Automated Classroom Slide Extraction. International Journal of Innovations in Science and Technology, 6(2), -. https://europub.co.uk/articles/-A-760316