TechRxiv
Microphone_Manuscript.pdf (1004.49 kB)
Download file

Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data

Download (1004.49 kB)
preprint
posted on 2022-07-07, 21:33 authored by Ritujoy BiswasRitujoy Biswas, Karan Nathwani, Sunil Kopparapu

The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup. Understandably, it is a practical problem that the performance degrades due to accidental displacement of the microphone from the correct position through routine use and maintenance. In this paper, an approach is formulated which makes it possible to determine the optimal position of the microphone, given the geometry of the room, the location of the speaker, and the inexact location of the microphone. The metric used to measure the correctness of the microphone position is the average character error rate (cer) of the ASR system at that position. The proposed approach uses limited available data which is usually the case when dealing with microphone position error analysis while considering room acoustics. A neural network classifier is trained using limited data to associate the location of the microphone with the corresponding class of cer values. The obtained prediction is used to trace the microphone back to the optimal position.

History

Email Address of Submitting Author

ritujoybiswas@gmail.com

ORCID of Submitting Author

0000-0003-4108-1217

Submitting Author's Institution

Indian Institute of Technology Jammu

Submitting Author's Country

  • India

Usage metrics

    Licence

    Exports