Text this: NN speech recognition utilizing aligned DTW local distance scores