Documentation
find_similar’s interface
- find_similar.find_similar(text_to_check, texts, language='russian', count=5, dictionary=None, remove_stopwords=True, keywords=None) list[TokenText] [source]
The main function to search similar texts.
- Parameters:
text_to_check – Text to find similars
texts – List of str or TokenText. In these texts we will search similars
language – Language, default=’russian’
count – Results count
dictionary – default = None. If you want to replace one words to others
keywords – default = None.
remove_stopwords – default = True. Remove or not stopwords
- Returns:
Result list sorted by similarity percent (cos)