Wang Bin
|
323b6714fa
|
removed cache directory, the refactor I made before was not clear
|
2015-03-24 14:06:10 +08:00 |
|
Wang Bin
|
d257da40a7
|
try to refactor, not finished yet
|
2015-03-20 18:38:08 +08:00 |
|
Wang Bin
|
16929faf57
|
removed old tokenize module, updated README
|
2015-03-18 17:31:41 +08:00 |
|
Wang Bin
|
f596ac063d
|
added more tests
|
2015-03-17 16:34:36 +08:00 |
|
Wang Bin
|
a14788addb
|
fixed a but in tokenizer under search mode, added more tests
|
2015-03-17 16:29:09 +08:00 |
|
Wang Bin
|
2c95c61d33
|
added jieba tokenizer for bleve
|
2015-03-17 15:30:13 +08:00 |
|
Wang Bin
|
1aabc4a2f3
|
removed unnecessary MarshalBinary/UnmarshalBinary method
|
2015-03-16 15:55:41 +08:00 |
|
Wang Bin
|
8bf9888a1c
|
make some public variable/function to private
|
2015-02-28 18:23:59 +08:00 |
|
Wang Bin
|
1c8d4fbf23
|
make some public variable/function to private
|
2015-02-28 18:17:48 +08:00 |
|
Wang Bin
|
d06ba85b0b
|
make some public variable/function to private
|
2015-02-28 18:08:57 +08:00 |
|
Wang Bin
|
a43924173d
|
make some public variable/function to private
|
2015-02-28 17:53:08 +08:00 |
|
Wang Bin
|
30b40c34df
|
make some public variable/function to private
|
2015-02-28 17:51:03 +08:00 |
|
Wang Bin
|
84bd2e14e2
|
make some public variable/function to private
|
2015-02-28 17:43:10 +08:00 |
|
Wang Bin
|
b23dc63eb2
|
make some public variable/function to private
|
2015-02-28 17:40:58 +08:00 |
|
Wang Bin
|
2c6a897751
|
make some public variable/function to private
|
2015-02-28 17:35:04 +08:00 |
|
Wang Bin
|
db06c66086
|
make some public variable/function to private
|
2015-02-28 17:32:40 +08:00 |
|
Wang Bin
|
4b0ddcf666
|
make some public variable/function to private
|
2015-02-28 17:27:16 +08:00 |
|
Wang Bin
|
858ceb5a0b
|
small tweaks, add docs
|
2015-02-28 17:08:04 +08:00 |
|
Wang Bin
|
142b90f76a
|
updated REAMD.md, fixed a small bug in textrank
|
2015-02-28 14:55:32 +08:00 |
|
Wang Bin
|
c7ac8a7096
|
small tweak, map stopWords' value are not used, no need to be string
|
2015-02-27 18:07:23 +08:00 |
|
Wang Bin
|
00fae2358d
|
unify Cut method, return channel instead of array
|
2015-02-27 17:56:26 +08:00 |
|
Wang Bin
|
43480db509
|
unify Cut method, return channel instead of array
|
2015-02-27 17:30:45 +08:00 |
|
Wang Bin
|
c03b3eac1c
|
unify Cut method, return channel instead of array
|
2015-02-27 17:15:23 +08:00 |
|
Wang Bin
|
d76fbfb017
|
unify Cut method, return channel instead of array
|
2015-02-27 17:04:50 +08:00 |
|
Wang Bin
|
76b9df8511
|
change cut method to return a channel string, not []string
|
2015-02-27 11:37:55 +08:00 |
|
Wang Bin
|
87fe3a19f8
|
refactor, removed duplicated codes
|
2015-02-26 18:03:08 +08:00 |
|
Wang Bin
|
dd7e4ec802
|
refactor, removed duplicated codes
|
2015-02-26 17:59:34 +08:00 |
|
Wang Bin
|
ed0bc44921
|
removed third party dependency
|
2015-02-26 17:45:59 +08:00 |
|
Wang Bin
|
f6c298fc65
|
small refactor for function names
|
2015-02-26 17:38:26 +08:00 |
|
Wang Bin
|
2bda0be7c5
|
small refactor, updated REAMDE
|
2015-02-26 17:27:04 +08:00 |
|
Wang Bin
|
f7fdb9749d
|
rename GetDAG to DAG
|
2015-02-26 16:56:18 +08:00 |
|
Wang Bin
|
aa9ad48b1c
|
refactor variable name
|
2015-02-26 16:07:08 +08:00 |
|
Wang Bin
|
ecf2da12f4
|
code refactor, newTire method now only return errors
|
2015-02-26 15:53:44 +08:00 |
|
Wang Bin
|
55751ed04d
|
tiny code refactor
|
2015-02-26 15:34:58 +08:00 |
|
Wang Bin
|
c2abc24083
|
small refactor
|
2015-02-26 14:59:14 +08:00 |
|
Wang Bin
|
35094877da
|
small refactor
|
2015-02-26 14:42:16 +08:00 |
|
Wang Bin
|
60b2c9f763
|
small refactor, put parse dictionary function in seperate gile
|
2015-02-26 14:22:29 +08:00 |
|
Wang Bin
|
95a27da5cf
|
small refactor, rename files
|
2015-02-26 11:12:05 +08:00 |
|
Wang Bin
|
c7218ce886
|
small refactor
|
2015-02-26 10:29:17 +08:00 |
|
Wang Bin
|
c076a14f76
|
fixed the tests failure issue
|
2015-02-26 10:14:35 +08:00 |
|
Wang Bin
|
67216a8a7d
|
use only one dict to store words and prefixes, this corresponding to jieba commit #f808ea0ebba7056fa1b55081b474329e556933a8
|
2015-02-25 18:27:24 +08:00 |
|
Wang Bin
|
08ac49d10b
|
small refactor, don't compile regular expression every time, corresponding to jieba commit #32a0e92a09614cf5c72f87b1a59a5c4369200516
|
2015-02-25 16:32:28 +08:00 |
|
Wang Bin
|
5702495bf6
|
removed MinFreq, correpsonding to jieba commit #caae26fbfafd75062742823a23e1cc81368b1451
|
2015-02-25 16:01:39 +08:00 |
|
Wang Bin
|
2515d2e5a0
|
removed unused idx parameter from Calc function, this is correpsonding to jieba commit #8a2e7f0e7ed205429ae545f5b875af4eaa8490d1
|
2015-02-25 12:18:24 +08:00 |
|
Wang Bin
|
73cb677aca
|
added test for load cutomized stop words file
|
2015-02-25 12:04:09 +08:00 |
|
Wang Bin
|
e60dcd3e9e
|
fixed the bug of textrank, updated tests
|
2015-02-17 12:04:06 +08:00 |
|
Wang Bin
|
74dbb7d525
|
added simple implemention of textrank, corresponding to jieba commit #4030d8ed86dd3ff54e215ebe88c141b2a8345eda
|
2015-02-04 18:23:47 +08:00 |
|
Wang Bin
|
70a8da2687
|
ExtractTags not return words with weight, instead of words, this is correpsonding to jieba commit #f5ca87e0884ec6f013b4f9c4119aa47d44e090fe
|
2015-02-04 16:05:23 +08:00 |
|
Wang Bin
|
0f7c56b4ef
|
small code refactor
|
2015-02-04 14:47:59 +08:00 |
|
Wang Bin
|
c3dce0d4a1
|
fixed the trie dump/load issue
|
2015-02-04 14:34:12 +08:00 |
|