RUJA: Repositorio Institucional de Producción Científica

 

Attentional Mechanism Based on a Microphone Array for Embedded Devices and a Single Camera

dc.contributor.authorMartínez-Colón, Antonio
dc.contributor.authorPérez-Lorenzo, José Manuel
dc.contributor.authorRivas, Fernando
dc.contributor.authorViciana-Abad, Raquel
dc.contributor.authorReche-López, Pedro
dc.date.accessioned2025-01-30T07:24:15Z
dc.date.available2025-01-30T07:24:15Z
dc.date.issued2018-11-21
dc.description.abstractThis work presents an attentional mechanism with the capability of detecting the localization of a speaker for interaction purposes, based on audio and video information. The localization is computed in terms of azimuth and elevation angles, to be used as input values for controlling mobile systems such as a pan-tilt videocamera or a robotic head. For this purpose the SRP-PHAT algorithm has been implemented with a commercial array of microphones for embedded devices, in order to estimate the localization of a sound source in the surroundings of the array. In order to improve the limitations of the SRP-PHAT algorithm in the estimation of the z coordinate, the elevation angle is corrected via video information by using Haar cascade classifiers for face detection. Simulations and experiments show the accuracy of the system, as well as the application for controlling a pan-tilt videocamera in a real scenario with speakers and ambient noise.es_ES
dc.identifier.citationMartinez-Colon, A., Perez-Lorenzo, J.M., Rivas, F., Viciana-Abad, R., Reche-Lopez, P. (2019). Attentional Mechanism Based on a Microphone Array for Embedded Devices and a Single Camera. In: Fuentetaja Pizán, R., García Olaya, Á., Sesmero Lorente, M., Iglesias Martínez, J., Ledezma Espino, A. (eds) Advances in Physical Agents. WAF 2018. Advances in Intelligent Systems and Computing, vol 855. Springer, Cham. https://doi.org/10.1007/978-3-319-99885-5_12es_ES
dc.identifier.urihttps://link.springer.com/chapter/10.1007/978-3-319-99885-5_12es_ES
dc.identifier.urihttps://hdl.handle.net/10953/4559
dc.language.isoenges_ES
dc.publisherSpringeres_ES
dc.relation.ispartofWorkshop of Physical Agents (WAF 2018)es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.subjectAttentional mechanismes_ES
dc.subjectAudio source localizationes_ES
dc.subjectMicrophone arrayes_ES
dc.subjectFace detectores_ES
dc.titleAttentional Mechanism Based on a Microphone Array for Embedded Devices and a Single Cameraes_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.type.versioninfo:eu-repo/semantics/acceptedVersiones_ES

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
MColon_WAF2018_cameraready.pdf
Tamaño:
2.2 MB
Formato:
Adobe Portable Document Format
Descripción:

Bloque de licencias

Mostrando 1 - 1 de 1
No hay miniatura disponible
Nombre:
license.txt
Tamaño:
1.98 KB
Formato:
Item-specific license agreed upon to submission
Descripción: