pre_process

interlinking.pre_process.extract_freqterms(fname, encoding)[source]

Extract frequent terms found in the fname dataset and save them to a file under default_data_path.

Parameters
  • fname (str) – File name of the dataset.

  • encoding (str) – Encoding of the input dataset (global | latin).

Return Home