Documentation

find_similar’s interface

find_similar.find_similar(text_to_check, texts, language='russian', count=5, dictionary=None, remove_stopwords=True, keywords=None) list[TokenText][source]

The main function to search similar texts.

Parameters:
  • text_to_check – Text to find similars

  • texts – List of str or TokenText. In these texts we will search similars

  • language – Language, default=’russian’

  • count – Results count

  • dictionary – default = None. If you want to replace one words to others

  • keywords – default = None.

  • remove_stopwords – default = True. Remove or not stopwords

Returns:

Result list sorted by similarity percent (cos)

How to use different parameters