Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup. Understandably, it is a practical problem that the performance degrades due to accidental displacement of the microphone from the correct position through routine use and maintenance. In this paper, an approach is formulated which makes it possible to determine the optimal position of the microphone, given the geometry of the room, the location of the speaker, and the inexact location of the microphone. The metric used to measure the correctness of the microphone position is the average character error rate (cer) of the ASR system at that position. The proposed approach uses limited available data which is usually the case when dealing with microphone position error analysis while considering room acoustics. A neural network classifier is trained using limited data to associate the location of the microphone with the corresponding class of cer values. The obtained prediction is used to trace the microphone back to the optimal position.
Email Address of Submitting Authorritujoybiswas@gmail.com
ORCID of Submitting Author0000-0003-4108-1217
Submitting Author's InstitutionIndian Institute of Technology Jammu
Submitting Author's Country