TechRxiv
capstone research paper.pdf (1.11 MB)
Download file

ACOUSTIC SCENE ANALYSIS AND CLASSIFICATION USING DENSENET CONVOLUTIONAL NEURAL NETWORK

Download (1.11 MB)
preprint
posted on 12.05.2022, 17:07 authored by Samyak Doshi, Tushar Patidar, Shubhankar Gautam, Rajkishor kumar

In this paper we present an account of state-of the-art in Acoustic Scene Classification (ASC), the task of environmental scenario classification through the sounds they produce. Our work aims to classify 50 different outdoor and indoor scenario using environmental sounds. We use a dataset ESC-50 from the IEEE challenge on Detection and Classification of Acoustic Scenes and Events (DCASE). In this we propose to use 2000 different environmental audio recordings. In this method the raw audio data is converted into Mel-spectrogram and other characteristics like Tonnetz, Chroma and MFCC. The generated Mel-spectrogram is fed as an input to neural network for training. Our model follows structure of neural network in the form of convolution and pooling. With a focus on real time environmental classification and to overcome the problem of low generalization in the model, the paper introduced augmentation to achieve modified noise based audio by adding gaussian white noise. Active researches are going on, in the audio domain and we have seen a lot of progress in the past years.

History

Email Address of Submitting Author

samyak.2018@vitstudent.ac.in

ORCID of Submitting Author

0000-0003-1046-2527

Submitting Author's Institution

VIT University

Submitting Author's Country

India