loading page

Learning Optimal Microphone Location for Enhanced ASR Performance using Limited Data
  • Ritujoy Biswas ,
  • Karan Nathwani ,
  • Sunil Kopparapu
Ritujoy Biswas
Indian Institute of Technology Jammu

Corresponding Author:[email protected]

Author Profile
Karan Nathwani
Author Profile
Sunil Kopparapu
Author Profile

Abstract

The placement of a microphone at its correct position is crucial for automatic speech recognition (ASR) in a single-microphone distant speech recognition setup. Understandably, it is a practical problem that the performance degrades due to accidental displacement of the microphone from the correct position through routine use and maintenance. In this paper, an approach is formulated which makes it possible to determine the optimal position of the microphone, given the geometry of the room, the location of the speaker, and the inexact location of the microphone. The metric used to measure the correctness of the microphone position is the average character error rate (cer) of the ASR system at that position. The proposed approach uses limited available data which is usually the case when dealing with microphone position error analysis while considering room acoustics. A neural network classifier is trained using limited data to associate the location of the microphone with the corresponding class of cer values. The obtained prediction is used to trace the microphone back to the optimal position.