ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Influence of Event Duration on Automatic Wheeze Classification

Bruno M Rocha, Diogo Pessoa, Alda Marques, Paulo Carvalho, Rui Pedro Paiva

Auto-TLDR; Experimental Design of the Non-wheeze Class for Wheeze Classification

Abstract Slides Poster

Patients with respiratory conditions typically exhibit adventitious respiratory sounds, such as wheezes. Wheeze events have variable duration. In this work we studied the influence of event duration on wheeze classification, namely how the creation of the non-wheeze class affected the classifiers' performance. First, we evaluated several classifiers on an open access respiratory sound database, with the best one reaching sensitivity and specificity values of 98% and 95%, respectively. Then, by changing one parameter in the design of the non-wheeze class, i.e., event duration, the best classifier only reached sensitivity and specificity values of 53% and 75%, respectively. These results demonstrate the importance of experimental design on the assessment of wheeze classification algorithms' performance.

Similar papers

Which are the factors affecting the performance of audio surveillance systems?

Antonio Greco, Antonio Roberto, Alessia Saggese, Mario Vento

Auto-TLDR; Sound Event Recognition Using Convolutional Neural Networks and Visual Representations on MIVIA Audio Events

Abstract Slides Similar

Sound event recognition systems are rapidly becoming part of our life, since they can be profitably used in several vertical markets, ranging from audio security applications to scene classification and multi-modal analysis in social robotics. In the last years, a not negligible part of the scientific community started to apply Convolutional Neural Networks (CNNs) to image-based representations of the audio stream, due to their successful adoption in almost all the computer vision tasks. In this paper, we carry out a detailed benchmark of various widely used CNN architectures and visual representations on a popular dataset, namely the MIVIA Audio Events database. Our analysis is aimed at understanding how these factors affect the sound event recognition performance with a particular focus on the false positive rate, very relevant in audio surveillance solutions. In fact, although most of the proposed solutions achieve a high recognition rate, the capability of distinguishing the events-of-interest from the background is often not yet sufficient for real systems, and prevent its usage in real applications. Our comprehensive experimental analysis investigates this aspect and allows to identify useful design guidelines for increasing the specificity of sound event recognition systems.

ESResNet: Environmental Sound Classification Based on Visual Domain Models

Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel

Auto-TLDR; Environmental Sound Classification with Short-Time Fourier Transform Spectrograms

Influence of Event Duration on Automatic Wheeze Classification

Similar papers

Which are the factors affecting the performance of audio surveillance systems?

ESResNet: Environmental Sound Classification Based on Visual Domain Models

Ballroom Dance Recognition from Audio Recordings

Inception Based Deep Learning Architecture for Tuberculosis Screening of Chest X-Rays

One-Shot Learning for Acoustic Identification of Bird Species in Non-Stationary Environments

Electroencephalography Signal Processing Based on Textural Features for Monitoring the Driver’s State by a Brain-Computer Interface

Epileptic Seizure Prediction: A Semi-Dilated Convolutional Neural Network Architecture

Multi-Scale and Attention Based ResNet for Heartbeat Classification

Automatic Tuberculosis Detection Using Chest X-Ray Analysis with Position Enhanced Structural Information

A Comparison of Neural Network Approaches for Melanoma Classification

Hybrid Network for End-To-End Text-Independent Speaker Identification

Using Machine Learning to Refer Patients with Chronic Kidney Disease to Secondary Care

Video Analytics Gait Trend Measurement for Fall Prevention and Health Monitoring

Audio-Based Near-Duplicate Video Retrieval with Audio Similarity Learning

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy

Detection of Calls from Smart Speaker Devices

Automatic Classification of Human Granulosa Cells in Assisted Reproductive Technology Using Vibrational Spectroscopy Imaging

DenseRecognition of Spoken Languages

Toward Text-Independent Cross-Lingual Speaker Recognition Using English-Mandarin-Taiwanese Dataset

Feature Engineering and Stacked Echo State Networks for Musical Onset Detection

A Low-Complexity R-Peak Detection Algorithm with Adaptive Thresholding for Wearable Devices

Unsupervised Detection of Pulmonary Opacities for Computer-Aided Diagnosis of COVID-19 on CT Images

EasiECG: A Novel Inter-Patient Arrhythmia Classification Method Using ECG Waves

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

Dealing with Scarce Labelled Data: Semi-Supervised Deep Learning with Mix Match for Covid-19 Detection Using Chest X-Ray Images

A Deep Learning Approach for the Segmentation of Myocardial Diseases

Fully Convolutional Neural Networks for Raw Eye Tracking Data Segmentation, Generation, and Reconstruction

Deep Learning on Active Sonar Data Using Bayesian Optimization for Hyperparameter Tuning

Digit Recognition Applied to Reconstructed Audio Signals Using Deep Learning

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Atmospheric Blocking Pattern Recognition in Global Climate Model Simulation Data

Fine-Tuning Convolutional Neural Networks: A Comprehensive Guide and Benchmark Analysis for Glaucoma Screening

Exploring Seismocardiogram Biometrics with Wavelet Transform

Deep Transfer Learning for Alzheimer’s Disease Detection

The Application of Capsule Neural Network Based CNN for Speech Emotion Recognition

Fall Detection by Human Pose Estimation and Kinematic Theory

Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network

Feasibility Study of Using MyoBand for Learning Electronic Keyboard

A Lumen Segmentation Method in Ureteroscopy Images Based on a Deep Residual U-Net Architecture

Adversarially Training for Audio Classifiers

S2I-Bird: Sound-To-Image Generation of Bird Species Using Generative Adversarial Networks

A Systematic Investigation on Deep Architectures for Automatic Skin Lesions Classification

Mood Detection Analyzing Lyrics and Audio Signal Based on Deep Learning Architectures

Anticipating Activity from Multimodal Signals

Deep Learning Based Sepsis Intervention: The Modelling and Prediction of Severe Sepsis Onset

A Novel Computer-Aided Diagnostic System for Early Assessment of Hepatocellular Carcinoma

Location Prediction in Real Homes of Older Adults based on K-Means in Low-Resolution Depth Videos

Memetic Evolution of Training Sets with Adaptive Radial Basis Kernels for Support Vector Machines