Text this: Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment