An audio enhancement system to improve intelligibility for social-awareness in HRI
dc.contributor.author | Martínez-Colón, Antonio | |
dc.contributor.author | Viciana-Abad, Raquel | |
dc.contributor.author | Pérez-Lorenzo, José Manuel | |
dc.contributor.author | Evers, Christine | |
dc.contributor.author | Naylor, Patrick A. | |
dc.date.accessioned | 2025-01-27T18:07:48Z | |
dc.date.available | 2025-01-27T18:07:48Z | |
dc.date.issued | 2021-08-28 | |
dc.description.abstract | Improving the ability to interact through voice with a robot is still a challenge especially in real environments where multiple speakers coexist. This work has evaluated a proposal based on improving the intelligibility of the voice information that feeds an existing ASR service in the network and in conditions similar to those that could occur in a care centre for the elderly. The results indicate the feasibility and improvement of a proposal based on the use of an embedded microphone array and the use of a simple beamforming and masking technique. The system has been evaluated with 12 people and results obtained for time responsiveness indicate that the system would allow natural interaction with voice. It is shown to be necessary to incorporate a system to properly employ the masking algorithm, through the intelligent and stable estimation of the interfering signals. In addition, this approach allows to fix as sources of interest other speakers not located in the vicinity of the robot. | es_ES |
dc.description.sponsorship | This work has been funded by the National Research Project TEST-RTI2018-099522-A-C44: “Test-beds for the Evaluation of Social Awareness in Assistance Robotics” and thanks to the collaboration with CSP group at Imperial College London, funded by the Spanish Ministry of Science, Innovation and University through the lectures mobility program (Jose Castillejo’s 2018 grant). Most of the information about the typical life in a retirement house and Felipe’s robot name have been gathered from the experiences during the work developed in Vitalia Teatinos and supported by the Regional Project AT17-5509-UMA ’ROSI’. Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. | es_ES |
dc.identifier.citation | Martínez-Colón, A., Viciana-Abad, R., Perez-Lorenzo, J.M. et al. An audio enhancement system to improve intelligibility for social-awareness in HRI. Multimed Tools Appl 81, 3327–3350 (2022). https://doi.org/10.1007/s11042-021-11291-3 | es_ES |
dc.identifier.issn | 1380-7501 | es_ES |
dc.identifier.other | 10.1007/s11042-021-11291-3 | es_ES |
dc.identifier.uri | https://link.springer.com/article/10.1007/s11042-021-11291-3 | es_ES |
dc.identifier.uri | https://hdl.handle.net/10953/4434 | |
dc.language.iso | eng | es_ES |
dc.publisher | Springer | es_ES |
dc.relation.ispartof | Multimedia Tools and Applications | es_ES |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es_ES |
dc.subject | Beamforming | es_ES |
dc.subject | ASR | es_ES |
dc.subject | Array | es_ES |
dc.subject | Masking | es_ES |
dc.subject | Intelligibility | es_ES |
dc.title | An audio enhancement system to improve intelligibility for social-awareness in HRI | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.type.version | info:eu-repo/semantics/acceptedVersion | es_ES |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- s11042-021-11291-3.pdf
- Tamaño:
- 1.19 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 1.98 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: