Commit Graph

38 Commits

Author SHA1 Message Date
Hai Liang Wang
4c97dfb7ce Check string for chinese words 2018-03-04 23:11:12 +08:00
Hai Liang Wang
7bfee1403a [compare] 更新分数计算 2018-03-04 22:56:13 +08:00
Hai Liang Wang
ddd2a97720 [nearby] Check eq before computing sim 2018-03-04 17:49:40 +08:00
Hai Liang Wang
e3d91154e6 export seg method 2018-03-04 10:27:18 +08:00
Hai Liang Wang
97b676ee82 Closed #45 enable cache for nearby words 2018-03-04 09:52:44 +08:00
Hai Liang Wang
d4f20e94a9 Refine vocab and nearby 2018-03-04 00:02:28 +08:00
Hai Liang Wang
a1af98a5a9 Support customized dict and model 2018-03-03 21:18:43 +08:00
Hai Liang Wang
4450ba836c refine format 2018-03-02 13:19:03 +08:00
Hai Liang Wang
0e5794cfff Leverage distance computing algorithm in compare API 2018-03-02 11:08:54 +08:00
Hai Liang Wang
dac98aa891 Closed #43 smoothing scores in compare API 2018-03-01 23:29:49 +08:00
AlexSun1995
129fe23cc8 将模型库中不存在词的向量值由全0替换为随机向量 2018-02-02 21:04:52 +08:00
Hain Wang
043a8ced04
Merge pull request #34 from weiyunfei/master
Resolve the ImportError when there's "utils" model.
2018-01-29 11:36:20 +08:00
wei
2db34e1479 Resolve the ImportError when there's "utils" model. 2018-01-25 16:13:44 +08:00
Hai Liang Wang
9356e62c78 #31 remove synonyms/data.py 2018-01-23 10:33:10 +08:00
cclauss
f8a2e30a17
import shutil and from six import string_types, u 2018-01-18 11:31:57 +01:00
Hai Liang Wang
e73f2a0dd3 Refine distance params, upgrade to v2 2017-12-31 19:01:05 +08:00
huyingxi
54a978e63e
Update __init__.py
update similarity_distance weights
2017-12-30 23:22:35 +08:00
bobbercheng
cdb85530f4
Update __init__.py
Use _levenshtein_distance() to replace _unigram_overlap. We still need to adjust weight for better result. Right now following two sentences are still regarded as same.

synonyms.compare('目前你用什么方法来保护朋友',  '目前你用什么方法来保护家人')
2017-11-14 09:53:28 -06:00
inuyasha2012
c41ee7397d fix python3 open stopwords file UnicodeDecodeError bug 2017-11-08 09:37:48 +08:00
huyingxi
de9c7ec748
Update __init__.py
updata sentence similarity's threshold
2017-11-04 10:45:56 -05:00
Hai Liang Wang
a1283d9926 #6 move useless code into data.py 2017-10-31 17:17:01 +08:00
Hai Liang Wang
312eae0067 #6 simplify code and support py2,3 2017-10-31 16:54:55 +08:00
huyingxi
86c188bb57
Update __init__.py 2017-10-30 22:18:54 -05:00
huyingxi
c0a4775551 inhence sentence similarity 2017-10-31 10:54:31 +08:00
Hai Liang Wang
4575f3e7c6 use jieba as tokenizer 2017-10-28 10:06:11 +08:00
jiangbo
55be4367b7 chore: reattach stdout to sys 2017-10-21 22:16:00 +08:00
Hai Liang Wang
07de9b2aef refactor with pkl, add benchmark 2017-10-21 11:45:15 +08:00
Hai Liang Wang
6132372654 update README.md 2017-10-21 10:12:17 +08:00
Hai Liang Wang
532d57b987 fix display mth 2017-10-21 09:59:26 +08:00
Hai Liang Wang
0c7be0b24c avoid overflow with similarity 2017-10-18 10:16:55 +08:00
Hai Liang Wang
2316042eea add compatible codes for py2 2017-10-18 09:49:09 +08:00
Hai Liang Wang
554f4c8f76 Add compare similarity api 2017-10-16 22:29:51 +08:00
Hai Liang Wang
24fdb36e8f Update README.md for usage 2017-09-29 10:24:34 +08:00
Hai Liang Wang
29f35ba39d Add demo as example 2017-09-28 22:48:39 +08:00
Hai Liang Wang
1251479fd8 Update README.md 2017-09-28 22:14:26 +08:00
Hai Liang Wang
76a4ff486c Enable build vocab 2017-09-28 22:01:28 +08:00
Hai Liang Wang
ea072b70c4 enable build dict 2017-09-28 21:56:24 +08:00
Hai Liang Wang
2167069153 init 2017-09-27 15:27:47 +08:00