Text this: Learning sufficient representation for spatio-temporal deep network using information filter