pre_processing

poi_interlinking.pre_processing.frequent_terms.extract_freqterms(fname, encoding, exp_path)[source]

Extract and count occurrences of all distinct terms found in fname file and sort them in descending order.

Parameters
  • fname (str) – Input filename to search for optimal thresholds.

  • encoding (str) – The encoding of the fname. Valid options are latin or global.

  • exp_path (str) – Prefix to be used in naming the output file with the extracted frequent terms.

Return Home