Clustering the Headlines dataset