RUJA: Repositorio Institucional de Producción Científica

 

Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

dc.contributor.authorDaria Mang, Loredana
dc.contributor.authorGonzález Martínez, Francisco David
dc.contributor.authorMartinez Muñoz, Damián
dc.contributor.authorGarcía Galán, Sebastián
dc.contributor.authorCortina, Raquel
dc.date.accessioned2024-10-22T07:16:38Z
dc.date.available2024-10-22T07:16:38Z
dc.date.issued2024-01-21
dc.descriptionThis is the publish version of the following article: [Loredana Daria Mang, Francisco David González Martínez, Damian Martinez Muñoz, Sebastián García Galán and Raquel Cortina (2024). Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers. Sensors Volume 24 Issue 2], which has been published at [https://doi.org/10.3390/s24020682] This article may be used for non-commercial purposes in accordance with Sensors terms and conditions for use of self-archived versions. This article may not be enhanced, enriched or otherwise transformed into a derivative work, without express permission from Sensors or by statutory rights under applicable legislation. Copyright notices must not be removed, obscured or modified. The article must be linked to Sensors´s version of record on Sensors online library and any embedding, framing or otherwise making available the article or pages thereof by third parties from platforms, services and websites other than Sensors online library must be prohibited.es_ES
dc.description.abstractEarly identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system’s condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input–classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.es_ES
dc.description.sponsorshipThis work was supported in part under grant PID2020-119082RB-{C21,C22} funded by MCIN/AEI/10.13039/501100011033, grant P18-RT-1994 funded by the Ministry of Economy, Knowledge and University, Junta de Andalucía, Spain.es_ES
dc.identifier.citationClassification of Adventitious Sounds Combining Cochleogram and Vision Transformers Mang, L.D. , González Martínez, F.D. , Martinez Muñoz, D. , García Galán, S. , Cortina, R. Sensors, 2024, 24(2), 682es_ES
dc.identifier.issn1424-8220es_ES
dc.identifier.otherhttps://doi.org/10.3390/s24020682es_ES
dc.identifier.urihttps://www.mdpi.com/1424-8220/24/2/682es_ES
dc.identifier.urihttps://hdl.handle.net/10953/3303
dc.language.isoenges_ES
dc.publisherMultidisciplinary Digital Publishing Institute (MDPI)es_ES
dc.relation.ispartofSensors 2024, 24(2), 682es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.subjectclassificationes_ES
dc.subjectadventitious soundses_ES
dc.subjectcochleogrames_ES
dc.subjectvision transformerses_ES
dc.subjectdeep learninges_ES
dc.subjectaccuracyes_ES
dc.titleClassification of Adventitious Sounds Combining Cochleogram and Vision Transformerses_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.type.versioninfo:eu-repo/semantics/publishedVersiones_ES

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
sensors-24-00682-v2.pdf
Tamaño:
1.25 MB
Formato:
Adobe Portable Document Format
Descripción:
Versión publicada del artículo

Bloque de licencias

Mostrando 1 - 1 de 1
No hay miniatura disponible
Nombre:
license.txt
Tamaño:
1.98 KB
Formato:
Item-specific license agreed upon to submission
Descripción:

Colecciones