This document discusses various similarity measures that can be used to quantify the similarity between documents, queries, or a document and query in an information retrieval system. It describes classic measures like Dice coefficient, overlap coefficient, Jaccard coefficient, and cosine coefficient. It provides examples of calculating these measures and compares the relations between different measures. The document also discusses using term-document matrices and shows an example matrix.
2. Chapter 3 Similarity Measures Written by Kevin E. Heinrich Presented by Zhao Xinyou [email_address] 2007.6.7 Some materials (Examples) are taken from Website.
10. Simple Measure Technology Documents Set PP29 Retrieved A Relevant B Retrieved and Relevant A ∩B Precision = Returned Relevant Documents / Total Returned Documents Recall = Returned Relevant Documents / Total Relevant Documents P(A,B) = |A ∩B| / |A| R(A,B) = |A ∩B| / |B|