Intelligent method for moving object identification based on feature clustering in the mel-frequency cepstral domain

Authors

DOI:

https://doi.org/10.34185/1562-9945-4-165-2026-11

Keywords:

moving object identification, mel-frequency cepstral coefficients, clustering, monitoring automation, acoustic signal, digital signal processing, industrial automation, motion phase segmentation

Abstract

The implementation of intelligent monitoring systems in modern industrial automation and advanced transportation networks requires the development of highly reliable methods for moving object identification. The relevance of this study is driven by the necessity to create systems capable of stable operation under conditions of intense acoustic noise, limited optical visibility, and dynamic environmental parameters. Traditional amplitude-based sound processing methods prove to be ineffective due to their high sensitivity to signal non-stationarity and the Doppler effect, which prevents accurate segmentation of an object’s dynamic states as the distance to the sensor changes.

The objective of this work is to develop and scientifically substantiate a method for identifying motion phases based on the complex application of Mel-frequency cepstral analysis and intelligent clustering. The proposed approach is based on the synthesis of Short-Time Fourier Transform) algorithms using the Hamming window to minimize spectral leakage, psychoacoustic filtering based on Mel-scale filters, and Discrete Cosine Transform for feature disintegration and decorrelation in the cepstral domain.

The research methodology is based on converting a non-stationary acoustic signal into a time-frequency representation followed by the calculation of Mel-Frequency Cepstral Coefficients. This allowed for the formation of a multidimensional feature matrix that acts as a unique "digital fingerprint" of the object. A significant advantage of this method is its invariance to amplitude (loudness) fluctuations, achieved through logarithmic energy scaling within each critical frequency band. For automatic state recognition, an unsupervised machine learning method – K-means clustering – was applied, allowing for the grouping of feature vectors into compact centroids corresponding to specific motion phases.

The scientific novelty of the results lies in the improvement of the dynamic state segmentation process for moving objects by transitioning from time-domain characteristic analysis to analysis within the cepstral feature space. It has been experimentally confirmed that distinct differentiation is possible between the phases of approach, transit (point of closest proximity), and departure of the object. The results demonstrate high computational stability and speed, ensuring the possibility of integrating the method into local microprocessor-based automated control systems, PLCs, and real-time technical diagnostic systems. The practical value of the work lies in the potential to create autonomous acoustic detectors for monitoring industrial equipment and traffic flows without the need for expensive video surveillance systems.

References

Polyvoda, O., Rudakova, H., Kondratieva, I., Rozov, Y., & Lebedenko, Y. (2020). Digital Acoustic Signal Processing Methods for Diagnosing Electromechanical Systems. In V. Lyt-vynenko, S. Babichev, W. Wójcik, O. Vynokurova, S. Vyshemyrskaya, & S. Radetskaya (Eds.), Lecture Notes in Computational Intelligence and Decision Making. ISDMCI 2019. Advances in Intelligent Systems and Computing (Vol. 1020, pp. 97–109). Springer, Cham. https://doi.org/10.1007/978-3-030-26474-1_7

Bondarenko, M. E., & Ivashchenko, H. S. (2025). Vykorystannia poslidovnosti metodiv poperednoi obrobky v systemakh holosovoi identyfikatsii [Using a sequence of preprocessing methods in voice identification systems]. Systems of Control, Navigation and Communica-tion. Collection of Scientific Works, (2), 90–96. https://doi.org/10.26906/SUNZ.2025.2.090

Rudresh, M. D., Latha, A. S., Suganya, J., & Nayana, C. G. (2017). Performance analysis of speech digit recognition using cepstrum and vector quantization. 2017 International Con-ference on Electrical, Electronics, Communication, Computer, and Optimization Techniques (ICEECCOT), 1–6. https://doi.org/10.1109/ICEECCOT.2017.8284580

Savula, A. A., & Korotynskyi, A. P. (2024). Zastosuvannia zvukovykh oznak dlia moni-torynhu tekhniko-ekspluatatsiinoho stanu promyslovoho obladnannia [Application of sound features for monitoring the technical and operational state of industrial equipment]. Scientific Notes of Taurida National V.I. Vernadsky University. Series: Technical Sciences, 35(74), 6(2), pp 215 – 220. https://doi.org/10.32782/2663-5941/2024.6.2/30

Alkhatib, B., & Kamal Eddin, M. M. W. (2020). Voice Identification Using MFCC and Vector Quantization. Baghdad Science Journal, 17(3 Suppl.), 1019–1026. https://doi.org/10.21123/bsj.2020.17.3(Suppl.).1019

Abdul, Z., & Al-Talabani, A. (2022). Mel Frequency Cepstral Coefficient and its Applica-tions: A Review. IEEE Access, 10, 122136–122158. https://doi.org/10.1109/ACCESS.2022.3223444

Published

2026-04-30