Speech Processing in Computer Vision Applications