Closed
Milestone
Mar 16, 2020–Mar 20, 2020
POC
This should deliver a simple implementation of TF-IDF based on draft from the Exposé with minor improvements. Goal is to have a basic Pipeline but not much the integration in FAMER. Main aims:
-
parse csv from Benchmark datasets and represent in some form as documents -
pre-processing to calculate df -
basic TF-IDF similarity measurement -
performance measurement using ground truth
Loading
Loading
Loading
Loading