TY - GEN
T1 - Classification of audio signals using statistical features on time and wavelet transform domains
AU - Lambrou, T.
AU - Kudumakis, P.
AU - Speller, R.
AU - Sandler, M.
AU - Linney, A.
N1 - Published in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)
cited By 82
PY - 1998
Y1 - 1998
N2 - This paper presents a study on musical signal classification, using wavelet transform analysis in conjunction with statistical pattern recognition techniques. A comparative evaluation between different wavelet analysis architectures in terms of their classification ability, as well as between different classifiers is carried out. We seek to establish which statistical measures clearly distinguish between the three different musical styles of rock, piano, and jazz. Our preliminary results suggest that the features collected by the adaptive splitting wavelet transform technique performed better compared to the other wavelet based techniques, achieving an overall classification accuracy of 91.67%, using either the minimum distance classifier or the least squares minimum distance classifier. Such a system can play a useful part in multimedia applications which require content based search, classification, and retrieval of audio signals, as defined in MPEG-7.
AB - This paper presents a study on musical signal classification, using wavelet transform analysis in conjunction with statistical pattern recognition techniques. A comparative evaluation between different wavelet analysis architectures in terms of their classification ability, as well as between different classifiers is carried out. We seek to establish which statistical measures clearly distinguish between the three different musical styles of rock, piano, and jazz. Our preliminary results suggest that the features collected by the adaptive splitting wavelet transform technique performed better compared to the other wavelet based techniques, achieving an overall classification accuracy of 91.67%, using either the minimum distance classifier or the least squares minimum distance classifier. Such a system can play a useful part in multimedia applications which require content based search, classification, and retrieval of audio signals, as defined in MPEG-7.
UR - https://ieeexplore.ieee.org/xpl/conhome/5518/proceeding
U2 - 10.1109/ICASSP.1998.679665
DO - 10.1109/ICASSP.1998.679665
M3 - Published conference contribution
SP - 3621
EP - 3624
BT - Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing
PB - IEEE Explore
ER -