Text this: The impact of pre-processing and feature selection on text classification