Sound-based multiple-equipment activity recognition using convolutional neural networks

Document Type

Article

Publication Title

Automation in Construction

Publication Date

3-2022

Abstract/ Summary

Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment.

Share

COinS