Chibelushi, C.C. (2004) Fuzzy Audio-Visual Feature Maps for Speaker Identification. In: Applications and Science in Soft Computing. Springer-Verlag, Berlin, pp. 317-322.
Full text not available from this repository.Abstract or description
Speech-based person recognition by machine has not reached the level of technological maturity required by some of its potential applications. The deficiencies revolve around sub-optimal pre-processing, feature extraction or selection, and classification, particularly under conditions of input data variability. The joint use of audible and visible manifestations of speech aims to alleviate these shortcomings, but the development of effective combination techniques is challenging. This paper proposes and evaluates a combination approach for speaker identification based on fuzzy modelling of acoustic and visual speaker characteristics. The proposed audio-visual model has been evaluated experimentally on a speaker identification task. The results show that the joint model outperforms its isolated components in terms of identification accuracy. In particular, the cross-modal coupling of audio-visual streams is shown to improve identification accuracy.
Item Type: | Book Chapter, Section or Conference Proceeding |
---|---|
Faculty: | Previous Faculty of Computing, Engineering and Sciences > Computing |
Depositing User: | Claude CHIBELUSHI |
Date Deposited: | 13 May 2013 23:07 |
Last Modified: | 24 Feb 2023 13:38 |
URI: | https://eprints.staffs.ac.uk/id/eprint/1106 |