Synonyms/CHANGELOG.md
2018-03-04 10:33:13 +08:00

25 lines
731 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# 3.3
* 增加分词接口
# 3.2
* 将发布证书改为MIT
# 3.1
* 对空间临近词的邻居进行缓存,提高返回速度
* nearby中处理OOV返回 ([], [])
# 3.0 - 更简单的定制和配置,增加了额外的开销
* 去掉nearby words, 使用 kdtree检索空间词汇的最近临
* 增加了对sk-learn的依赖但是减少了对词向量的预处理
* 优化了分词所使用的字典,也可以使用环境变量声明主字典
* 支持自定义word2vec模型使用环境变量声明
# 2.5
* 使用空间距离近的词汇优化编辑距离计算
# 2.3
* 计算相似度时增加平滑策略
# v1.6
* use ```jieba``` instead of ```thulac``` as tokeninzer.
* refine console log for Jupyter notebook.