Wang Bin
|
ecf2da12f4
|
code refactor, newTire method now only return errors
|
2015-02-26 15:53:44 +08:00 |
|
Wang Bin
|
55751ed04d
|
tiny code refactor
|
2015-02-26 15:34:58 +08:00 |
|
Wang Bin
|
c2abc24083
|
small refactor
|
2015-02-26 14:59:14 +08:00 |
|
Wang Bin
|
35094877da
|
small refactor
|
2015-02-26 14:42:16 +08:00 |
|
Wang Bin
|
60b2c9f763
|
small refactor, put parse dictionary function in seperate gile
|
2015-02-26 14:22:29 +08:00 |
|
Wang Bin
|
95a27da5cf
|
small refactor, rename files
|
2015-02-26 11:12:05 +08:00 |
|
Wang Bin
|
c7218ce886
|
small refactor
|
2015-02-26 10:29:17 +08:00 |
|
Wang Bin
|
c076a14f76
|
fixed the tests failure issue
|
2015-02-26 10:14:35 +08:00 |
|
Wang Bin
|
67216a8a7d
|
use only one dict to store words and prefixes, this corresponding to jieba commit #f808ea0ebba7056fa1b55081b474329e556933a8
|
2015-02-25 18:27:24 +08:00 |
|
Wang Bin
|
08ac49d10b
|
small refactor, don't compile regular expression every time, corresponding to jieba commit #32a0e92a09614cf5c72f87b1a59a5c4369200516
|
2015-02-25 16:32:28 +08:00 |
|
Wang Bin
|
5702495bf6
|
removed MinFreq, correpsonding to jieba commit #caae26fbfafd75062742823a23e1cc81368b1451
|
2015-02-25 16:01:39 +08:00 |
|
Wang Bin
|
2515d2e5a0
|
removed unused idx parameter from Calc function, this is correpsonding to jieba commit #8a2e7f0e7ed205429ae545f5b875af4eaa8490d1
|
2015-02-25 12:18:24 +08:00 |
|
Wang Bin
|
73cb677aca
|
added test for load cutomized stop words file
|
2015-02-25 12:04:09 +08:00 |
|
Wang Bin
|
e60dcd3e9e
|
fixed the bug of textrank, updated tests
|
2015-02-17 12:04:06 +08:00 |
|
Wang Bin
|
74dbb7d525
|
added simple implemention of textrank, corresponding to jieba commit #4030d8ed86dd3ff54e215ebe88c141b2a8345eda
|
2015-02-04 18:23:47 +08:00 |
|
Wang Bin
|
70a8da2687
|
ExtractTags not return words with weight, instead of words, this is correpsonding to jieba commit #f5ca87e0884ec6f013b4f9c4119aa47d44e090fe
|
2015-02-04 16:05:23 +08:00 |
|
Wang Bin
|
0f7c56b4ef
|
small code refactor
|
2015-02-04 14:47:59 +08:00 |
|
Wang Bin
|
c3dce0d4a1
|
fixed the trie dump/load issue
|
2015-02-04 14:34:12 +08:00 |
|
Wang Bin
|
9ee7ba2c13
|
use github.com/deckarep/golang-set instead of Trie, to reduce memory usage and improve performance, this is corresponding to jieba commit #4a93f21918a26083c039970edb9457c589c3a0ab
|
2015-02-03 15:20:30 +08:00 |
|
Wang Bin
|
9b2e9d29df
|
fixed the spell mistakes in posseg_test.go
|
2015-02-03 11:31:31 +08:00 |
|
Wang Bin
|
5e01cf47a3
|
small refactor, move stopwords to a seperate file
|
2015-01-26 18:34:19 +08:00 |
|
Wang Bin
|
b828b25f67
|
fix performance problem of extrag_tags, corresponding to jieba commit #eb98eb92484d3d302cd96049be43c224fe45414a
|
2015-01-26 18:31:10 +08:00 |
|
Wang Bin
|
2b15490388
|
make tests more readable
|
2015-01-26 17:10:54 +08:00 |
|
Wang Bin
|
2981f336d1
|
make test more readable
|
2015-01-26 16:50:45 +08:00 |
|
Wang Bin
|
af7fefc5fa
|
make tests more readable
|
2015-01-26 16:20:42 +08:00 |
|
Wang Bin
|
052c571015
|
allow user to manually set idf file and stop words file, corresponding to jieba commit 16d626d3477c8649d42094d7d7d4ae7c619783de
|
2014-08-14 12:05:29 +08:00 |
|
Wang Bin
|
d2acf94693
|
code refactor, simplfied trie model, also added cache for dictionary file
|
2014-08-13 18:21:41 +08:00 |
|
Wang Bin
|
ad077fcb06
|
updated README.md, using bufio.Scanner instead of NewReader, added travis build status image
|
2014-08-11 16:45:34 +08:00 |
|
Wang Bin
|
c8d195143d
|
using bufio.Scanner to reading files, this the recommended way since Go v1.1
|
2014-08-11 16:44:40 +08:00 |
|
Wang Bin
|
b51ae37771
|
fixed the bug cause random testing failure, the Less() function was wrong
|
2014-08-11 16:44:26 +08:00 |
|
Wang Bin
|
bd12e7682d
|
using bufio.Scanner to reading files, this the recommended way since Go v1.1
|
2014-08-11 16:43:38 +08:00 |
|
Wang Bin
|
b46a8dd73d
|
fixed the test failure problem, any tests that required load user data must put to the bottom of all other tests
|
2014-08-09 00:51:45 +08:00 |
|
Wang Bin
|
e2d61242c2
|
fixed jieba issue #137, added test
|
2014-08-08 18:42:58 +08:00 |
|
Wang Bin
|
385abf0a34
|
updated dict.txt, corresponding to jieba commit dafc73425e (diff-d41d8cd98f00b204e9800998ecf8427e)
|
2014-08-08 18:41:19 +08:00 |
|
Wang Bin
|
a70a2af95d
|
updated travis configuration file
set golang version to 1.3
|
2014-08-08 18:02:27 +08:00 |
|
Wang Bin
|
3d685f1980
|
fixed the bug from issue #132 from original jieba issue track
- issue details from https://github.com/fxsjy/jieba/issues/132
- updated tests
- also some code refactors
|
2014-08-08 17:59:49 +08:00 |
|
Wang Bin
|
08cb04815c
|
updated .travis.yml, set go version to 1.1
|
2013-12-02 22:18:45 +08:00 |
|
Wang Bin
|
c5061d17c7
|
added .travis.yml
|
2013-12-02 22:11:56 +08:00 |
|
Wang Bin
|
0cb70d4890
|
added tokenize function
|
2013-11-04 14:28:18 +08:00 |
|
Wang Bin
|
c00cc35314
|
update README.md, added performance data
|
2013-11-04 07:53:36 +08:00 |
|
Wang Bin
|
1104d62912
|
updated README.md
Signed-off-by: Wang Bin <wangbin.zibo@gmail.com>
|
2013-11-01 17:17:06 +08:00 |
|
Wang Bin
|
3cc6dd7d1e
|
removed cache, added tests for LoadUserDict
|
2013-11-01 10:31:12 +08:00 |
|
Wang Bin
|
c214203325
|
Merge branch 'master' of github.com:wangbin/jiebago
|
2013-10-31 18:20:41 +08:00 |
|
Wang Bin
|
8c785ad36a
|
initial commit
|
2013-10-31 18:20:04 +08:00 |
|
wangbin
|
a945418202
|
Initial commit
|
2013-10-30 21:43:12 -07:00 |
|