A significant excess weight in tf–idf is arrived at by a superior term frequency (inside the offered document) and a low document frequency of your expression in the whole collection of documents; the weights consequently tend to filter out widespread terms. It was typically made use of as being a weighting factor in searches of information re