Similarity Criteria Issues in Similarity Retrieval

The wide use of the internet coupled with the steadily decreasing cost in computing and storage has led to an expansion of the data that users expect to retrieve from simple numeric and alphanumeric, to include images, audio, video, as well as more abstract data found in bioinformatics applications, where the retrieval criterion is one of similarity. An inherent difficulty with similarity retrieval is deciding on a criterion for the similarity. In this proposal we explore issues involved in retrieval that is based on different criteria of similarity.

NSF Grant IIS-08-12377

System Site:

Relevant Publications:

  1. B. E. Teitler, J. Sankaranarayanan, H. Samet
    Online document clustering using the GPU.
    Technical report, Computer Science Department, University of Maryland, College Park, MD, August 2010.[link]
    Categories: [spatio-textual search engine]

  2. J. Sankaranarayanan, H. Samet
    Images in news.
    In Proceedings of the 24th International Conference on Pattern Recognition, pages 3240-3243, Istanbul, Turkey, August 2010.[link]
    Categories: [spatio-textual search engine, Twitter]

  3. S. Nutanong, E. H. Jacox, H. Samet
    An incremental Hausdorff distance calculation algorithm.
    PVLDB, 4(8):506-517, August 2011.[link]
    Also Proceedings of the 37th International Conference on Very Large Data Bases (VLDB)
    Categories: [spatial algorithms, similarity searching]