Please use this identifier to cite or link to this item:
https://hdl.handle.net/10953/3303
Title: | Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers |
Authors: | Daria Mang, Loredana González Martínez, Francisco David Martinez Muñoz, Damián García Galán, Sebastián Cortina, Raquel |
Abstract: | Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system’s condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input–classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field. |
Keywords: | classification adventitious sounds cochleogram vision transformers deep learning accuracy |
Issue Date: | 21-Jan-2024 |
metadata.dc.description.sponsorship: | This work was supported in part under grant PID2020-119082RB-{C21,C22} funded by MCIN/AEI/10.13039/501100011033, grant P18-RT-1994 funded by the Ministry of Economy, Knowledge and University, Junta de Andalucía, Spain. |
Publisher: | Multidisciplinary Digital Publishing Institute (MDPI) |
Citation: | Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers Mang, L.D. , González Martínez, F.D. , Martinez Muñoz, D. , García Galán, S. , Cortina, R. Sensors, 2024, 24(2), 682 |
Appears in Collections: | DIT-Artículos |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
sensors-24-00682-v2.pdf | Versión publicada del artículo | 1,28 MB | Adobe PDF | View/Open |
This item is protected by original copyright |
Items in RUJA are protected by copyright, with all rights reserved, unless otherwise indicated.