Displaying 1 to 4 from 4 results

tika-similarity - Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features

  •    Python

This project demonstrates using the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features. The script can iterate over all files in the current directory or given files by command line and derives their metadata features, then computes the union of all features. The union of all features become the "golden feature set" that all document features are compared to via intersect. The length of that intersect per file divided by the length of the unioned set becomes the similarity score.

koolsla - Food recommendation tool with Machine learning.

  •    Python

koolsla (Coleslaw) is a recommendation tool based on Machine Learning with contents. Developed with the power of tf-idf and Cosine Similarity. The user gives a natural number that corresponds to the ID of a unique dish name. Through tf-idf the plot summaries of 424508 different dishes that reside in the dataset, are analyzed and vectorized. Set of dishes (number set by user) is chosen as recommendations based on their cosine similarity with the vectorized input.