1
0
mirror of https://github.com/fumiama/jieba.git synced 2026-06-05 00:32:51 +08:00

159 Commits

Author SHA1 Message Date
Wang Bin
142b90f76a updated REAMD.md, fixed a small bug in textrank 2015-02-28 14:55:32 +08:00
Wang Bin
c7ac8a7096 small tweak, map stopWords' value are not used, no need to be string 2015-02-27 18:07:23 +08:00
Wang Bin
00fae2358d unify Cut method, return channel instead of array 2015-02-27 17:56:26 +08:00
Wang Bin
43480db509 unify Cut method, return channel instead of array 2015-02-27 17:30:45 +08:00
Wang Bin
c03b3eac1c unify Cut method, return channel instead of array 2015-02-27 17:15:23 +08:00
Wang Bin
d76fbfb017 unify Cut method, return channel instead of array 2015-02-27 17:04:50 +08:00
Wang Bin
76b9df8511 change cut method to return a channel string, not []string 2015-02-27 11:37:55 +08:00
Wang Bin
87fe3a19f8 refactor, removed duplicated codes 2015-02-26 18:03:08 +08:00
Wang Bin
dd7e4ec802 refactor, removed duplicated codes 2015-02-26 17:59:34 +08:00
Wang Bin
ed0bc44921 removed third party dependency 2015-02-26 17:45:59 +08:00
Wang Bin
f6c298fc65 small refactor for function names 2015-02-26 17:38:26 +08:00
Wang Bin
2bda0be7c5 small refactor, updated REAMDE 2015-02-26 17:27:04 +08:00
Wang Bin
f7fdb9749d rename GetDAG to DAG 2015-02-26 16:56:18 +08:00
Wang Bin
aa9ad48b1c refactor variable name 2015-02-26 16:07:08 +08:00
Wang Bin
ecf2da12f4 code refactor, newTire method now only return errors 2015-02-26 15:53:44 +08:00
Wang Bin
55751ed04d tiny code refactor 2015-02-26 15:34:58 +08:00
Wang Bin
c2abc24083 small refactor 2015-02-26 14:59:14 +08:00
Wang Bin
35094877da small refactor 2015-02-26 14:42:16 +08:00
Wang Bin
60b2c9f763 small refactor, put parse dictionary function in seperate gile 2015-02-26 14:22:29 +08:00
Wang Bin
95a27da5cf small refactor, rename files 2015-02-26 11:12:05 +08:00
Wang Bin
c7218ce886 small refactor 2015-02-26 10:29:17 +08:00
Wang Bin
c076a14f76 fixed the tests failure issue 2015-02-26 10:14:35 +08:00
Wang Bin
67216a8a7d use only one dict to store words and prefixes, this corresponding to jieba commit #f808ea0ebba7056fa1b55081b474329e556933a8 2015-02-25 18:27:24 +08:00
Wang Bin
08ac49d10b small refactor, don't compile regular expression every time, corresponding to jieba commit #32a0e92a09614cf5c72f87b1a59a5c4369200516 2015-02-25 16:32:28 +08:00
Wang Bin
5702495bf6 removed MinFreq, correpsonding to jieba commit #caae26fbfafd75062742823a23e1cc81368b1451 2015-02-25 16:01:39 +08:00
Wang Bin
2515d2e5a0 removed unused idx parameter from Calc function, this is correpsonding to jieba commit #8a2e7f0e7ed205429ae545f5b875af4eaa8490d1 2015-02-25 12:18:24 +08:00
Wang Bin
73cb677aca added test for load cutomized stop words file 2015-02-25 12:04:09 +08:00
Wang Bin
e60dcd3e9e fixed the bug of textrank, updated tests 2015-02-17 12:04:06 +08:00
Wang Bin
74dbb7d525 added simple implemention of textrank, corresponding to jieba commit #4030d8ed86dd3ff54e215ebe88c141b2a8345eda 2015-02-04 18:23:47 +08:00
Wang Bin
70a8da2687 ExtractTags not return words with weight, instead of words, this is correpsonding to jieba commit #f5ca87e0884ec6f013b4f9c4119aa47d44e090fe 2015-02-04 16:05:23 +08:00
Wang Bin
0f7c56b4ef small code refactor 2015-02-04 14:47:59 +08:00
Wang Bin
c3dce0d4a1 fixed the trie dump/load issue 2015-02-04 14:34:12 +08:00
Wang Bin
9ee7ba2c13 use github.com/deckarep/golang-set instead of Trie, to reduce memory usage and improve performance, this is corresponding to jieba commit #4a93f21918a26083c039970edb9457c589c3a0ab 2015-02-03 15:20:30 +08:00
Wang Bin
9b2e9d29df fixed the spell mistakes in posseg_test.go 2015-02-03 11:31:31 +08:00
Wang Bin
5e01cf47a3 small refactor, move stopwords to a seperate file 2015-01-26 18:34:19 +08:00
Wang Bin
b828b25f67 fix performance problem of extrag_tags, corresponding to jieba commit #eb98eb92484d3d302cd96049be43c224fe45414a 2015-01-26 18:31:10 +08:00
Wang Bin
2b15490388 make tests more readable 2015-01-26 17:10:54 +08:00
Wang Bin
2981f336d1 make test more readable 2015-01-26 16:50:45 +08:00
Wang Bin
af7fefc5fa make tests more readable 2015-01-26 16:20:42 +08:00
Wang Bin
052c571015 allow user to manually set idf file and stop words file, corresponding to jieba commit 16d626d3477c8649d42094d7d7d4ae7c619783de 2014-08-14 12:05:29 +08:00
Wang Bin
d2acf94693 code refactor, simplfied trie model, also added cache for dictionary file 2014-08-13 18:21:41 +08:00
Wang Bin
ad077fcb06 updated README.md, using bufio.Scanner instead of NewReader, added travis build status image 2014-08-11 16:45:34 +08:00
Wang Bin
c8d195143d using bufio.Scanner to reading files, this the recommended way since Go v1.1 2014-08-11 16:44:40 +08:00
Wang Bin
b51ae37771 fixed the bug cause random testing failure, the Less() function was wrong 2014-08-11 16:44:26 +08:00
Wang Bin
bd12e7682d using bufio.Scanner to reading files, this the recommended way since Go v1.1 2014-08-11 16:43:38 +08:00
Wang Bin
b46a8dd73d fixed the test failure problem, any tests that required load user data must put to the bottom of all other tests 2014-08-09 00:51:45 +08:00
Wang Bin
e2d61242c2 fixed jieba issue #137, added test 2014-08-08 18:42:58 +08:00
Wang Bin
385abf0a34 updated dict.txt, corresponding to jieba commit dafc73425e (diff-d41d8cd98f00b204e9800998ecf8427e) 2014-08-08 18:41:19 +08:00
Wang Bin
a70a2af95d updated travis configuration file
set golang version to 1.3
2014-08-08 18:02:27 +08:00
Wang Bin
3d685f1980 fixed the bug from issue #132 from original jieba issue track
- issue details from https://github.com/fxsjy/jieba/issues/132
 - updated tests
 - also some code refactors
2014-08-08 17:59:49 +08:00