A Review of Speech-Based Bimodal Recognition

Chibelushi, C.C.; Deravi, Farzin; Mason, John S. D.

Tools

Lists

Chibelushi, C.C., Deravi, Farzin and Mason, John S. D. (2002) A Review of Speech-Based Bimodal Recognition. IEEE Transactions on Multimedia, 4 (1). pp. 23-37. ISSN 1520-9210

Full text not available from this repository. (Request a copy)

Official URL: http://dx.doi.org/10.1109/6046.985551

Abstract or description

Speech recognition and speaker recognition by
machine are crucial ingredients for many important applications
such as natural and flexible human-machine interfaces. Most
developments in speech-based automatic recognition have relied
on acoustic speech as the sole input signal, disregarding its visual
counterpart. However, recognition based on acoustic speech alone can be afflicted with deficiencies that preclude its use in many real-world applications, particularly under adverse conditions.
The combination of auditory and visual modalities promises
higher recognition accuracy and robustness than can be obtained with a single modality. Multimodal recognition is therefore acknowledged as a vital component of the next generation of spoken language systems. This paper reviews the components of bimodal recognizers, discusses the accuracy of bimodal recognition, and highlights some outstanding research issues as well as possible application domains.

Item Type:	Article
Faculty:	Previous Faculty of Computing, Engineering and Sciences > Computing
Depositing User:	Claude CHIBELUSHI
Date Deposited:	11 Dec 2012 12:22
Last Modified:	24 Feb 2023 13:35
Related URLs:	http://ieeexplore.ieee.org/stamp/stamp.j...
URI:	https://eprints.staffs.ac.uk/id/eprint/70

Actions (login required)

: View Item