An Analysis of Document Retrieval and Clustering Using an Effective Semantic Distance Measure