Wang Bin
|
c7ac8a7096
|
small tweak, map stopWords' value are not used, no need to be string
|
2015-02-27 18:07:23 +08:00 |
|
Wang Bin
|
d76fbfb017
|
unify Cut method, return channel instead of array
|
2015-02-27 17:04:50 +08:00 |
|
Wang Bin
|
76b9df8511
|
change cut method to return a channel string, not []string
|
2015-02-27 11:37:55 +08:00 |
|
Wang Bin
|
87fe3a19f8
|
refactor, removed duplicated codes
|
2015-02-26 18:03:08 +08:00 |
|
Wang Bin
|
dd7e4ec802
|
refactor, removed duplicated codes
|
2015-02-26 17:59:34 +08:00 |
|
Wang Bin
|
ed0bc44921
|
removed third party dependency
|
2015-02-26 17:45:59 +08:00 |
|
Wang Bin
|
73cb677aca
|
added test for load cutomized stop words file
|
2015-02-25 12:04:09 +08:00 |
|
Wang Bin
|
e60dcd3e9e
|
fixed the bug of textrank, updated tests
|
2015-02-17 12:04:06 +08:00 |
|
Wang Bin
|
74dbb7d525
|
added simple implemention of textrank, corresponding to jieba commit #4030d8ed86dd3ff54e215ebe88c141b2a8345eda
|
2015-02-04 18:23:47 +08:00 |
|
Wang Bin
|
70a8da2687
|
ExtractTags not return words with weight, instead of words, this is correpsonding to jieba commit #f5ca87e0884ec6f013b4f9c4119aa47d44e090fe
|
2015-02-04 16:05:23 +08:00 |
|
Wang Bin
|
5e01cf47a3
|
small refactor, move stopwords to a seperate file
|
2015-01-26 18:34:19 +08:00 |
|
Wang Bin
|
b828b25f67
|
fix performance problem of extrag_tags, corresponding to jieba commit #eb98eb92484d3d302cd96049be43c224fe45414a
|
2015-01-26 18:31:10 +08:00 |
|
Wang Bin
|
2981f336d1
|
make test more readable
|
2015-01-26 16:50:45 +08:00 |
|
Wang Bin
|
052c571015
|
allow user to manually set idf file and stop words file, corresponding to jieba commit 16d626d3477c8649d42094d7d7d4ae7c619783de
|
2014-08-14 12:05:29 +08:00 |
|
Wang Bin
|
8c785ad36a
|
initial commit
|
2013-10-31 18:20:04 +08:00 |
|