源文雨
|
36c17a10b5
|
fs.File -> io.Reader
|
2022-12-03 10:54:06 +08:00 |
|
源文雨
|
6982ead703
|
优化 jieba
|
2022-11-30 16:00:56 +08:00 |
|
源文雨
|
7c685f789e
|
优化 Segmenter
|
2022-11-30 15:27:46 +08:00 |
|
源文雨
|
6b239b5918
|
优化 analyze
|
2022-11-30 14:18:36 +08:00 |
|
源文雨
|
f3da9e6420
|
优化 dict, add fs.File 支持
|
2022-11-30 14:14:48 +08:00 |
|
源文雨
|
c8785c7994
|
drop chan in DictLoader
|
2022-11-30 13:49:06 +08:00 |
|
源文雨
|
d487545eb5
|
优化 tag_extracker
|
2022-11-30 13:35:21 +08:00 |
|
源文雨
|
ae85ccb20a
|
优化 TextRanker float
|
2022-11-30 13:15:12 +08:00 |
|
源文雨
|
a8d1e81f73
|
优化 TextRanker
|
2022-11-30 12:56:04 +08:00 |
|
源文雨
|
21cdb2e863
|
优化 textrank
|
2022-11-30 12:38:47 +08:00 |
|
源文雨
|
8bbc755ed4
|
优化
|
2022-11-30 12:18:15 +08:00 |
|
Wang Bin
|
7440fa00df
|
tweak examples for nalyse
|
2015-05-07 17:43:35 +08:00 |
|
Wang Bin
|
b3ea94ba47
|
tweak docs
|
2015-05-06 16:11:11 +08:00 |
|
Wang Bin
|
f96acf3e85
|
added example_test.go, rename analyse_test.go to tag_extracker_test.go, removed bench.sh
|
2015-05-06 15:57:05 +08:00 |
|
Wang Bin
|
122bad0a8d
|
code refactor, added more documents
|
2015-05-06 12:55:04 +08:00 |
|
Wang Bin
|
87caff09cb
|
small refactor according to go vet check results
|
2015-05-04 18:56:55 +08:00 |
|
Wang Bin
|
52fad00403
|
refactor analyse module
|
2015-05-04 16:39:37 +08:00 |
|
Wang Bin
|
500e6bd10e
|
tweak style
|
2015-05-04 15:11:55 +08:00 |
|
Wang Bin
|
7cf16072e6
|
updated all tests to use Fatal/Fatalf to fail tests ealier
|
2015-03-30 18:01:21 +08:00 |
|
Wang Bin
|
c397cafe8a
|
uniform the api
|
2015-03-30 17:52:09 +08:00 |
|
Wang Bin
|
68fed7e250
|
make struct Jieba's fields private
|
2015-03-30 16:12:02 +08:00 |
|
Wang Bin
|
0ca4053394
|
fixed the test failure in textrank
|
2015-03-30 13:06:44 +08:00 |
|
Wang Bin
|
51c63cb9ad
|
small refactor the interface, use contructors instead of pointers for entry
|
2015-03-30 13:00:56 +08:00 |
|
Wang Bin
|
48a0bd390b
|
fixed a typo in previous commit
|
2015-03-30 11:13:00 +08:00 |
|
Wang Bin
|
a66bf2a0bd
|
move dictPath function private
|
2015-03-30 11:02:57 +08:00 |
|
Wang Bin
|
79adffe328
|
added a new interface for caching
|
2015-03-28 15:49:32 +08:00 |
|
Wang Bin
|
59da5b5e3a
|
removed dict.go, functions move to util.go, also use interface to simplify code
|
2015-03-25 18:28:37 +08:00 |
|
Wang Bin
|
1c378c28a7
|
finished all OOP refactor
|
2015-03-24 18:34:07 +08:00 |
|
Wang Bin
|
16929faf57
|
removed old tokenize module, updated README
|
2015-03-18 17:31:41 +08:00 |
|
Wang Bin
|
f596ac063d
|
added more tests
|
2015-03-17 16:34:36 +08:00 |
|
Wang Bin
|
a14788addb
|
fixed a but in tokenizer under search mode, added more tests
|
2015-03-17 16:29:09 +08:00 |
|
Wang Bin
|
2c95c61d33
|
added jieba tokenizer for bleve
|
2015-03-17 15:30:13 +08:00 |
|
Wang Bin
|
8bf9888a1c
|
make some public variable/function to private
|
2015-02-28 18:23:59 +08:00 |
|
Wang Bin
|
1c8d4fbf23
|
make some public variable/function to private
|
2015-02-28 18:17:48 +08:00 |
|
Wang Bin
|
d06ba85b0b
|
make some public variable/function to private
|
2015-02-28 18:08:57 +08:00 |
|
Wang Bin
|
142b90f76a
|
updated REAMD.md, fixed a small bug in textrank
|
2015-02-28 14:55:32 +08:00 |
|
Wang Bin
|
c7ac8a7096
|
small tweak, map stopWords' value are not used, no need to be string
|
2015-02-27 18:07:23 +08:00 |
|
Wang Bin
|
d76fbfb017
|
unify Cut method, return channel instead of array
|
2015-02-27 17:04:50 +08:00 |
|
Wang Bin
|
76b9df8511
|
change cut method to return a channel string, not []string
|
2015-02-27 11:37:55 +08:00 |
|
Wang Bin
|
87fe3a19f8
|
refactor, removed duplicated codes
|
2015-02-26 18:03:08 +08:00 |
|
Wang Bin
|
dd7e4ec802
|
refactor, removed duplicated codes
|
2015-02-26 17:59:34 +08:00 |
|
Wang Bin
|
ed0bc44921
|
removed third party dependency
|
2015-02-26 17:45:59 +08:00 |
|
Wang Bin
|
73cb677aca
|
added test for load cutomized stop words file
|
2015-02-25 12:04:09 +08:00 |
|
Wang Bin
|
e60dcd3e9e
|
fixed the bug of textrank, updated tests
|
2015-02-17 12:04:06 +08:00 |
|
Wang Bin
|
74dbb7d525
|
added simple implemention of textrank, corresponding to jieba commit #4030d8ed86dd3ff54e215ebe88c141b2a8345eda
|
2015-02-04 18:23:47 +08:00 |
|
Wang Bin
|
70a8da2687
|
ExtractTags not return words with weight, instead of words, this is correpsonding to jieba commit #f5ca87e0884ec6f013b4f9c4119aa47d44e090fe
|
2015-02-04 16:05:23 +08:00 |
|
Wang Bin
|
5e01cf47a3
|
small refactor, move stopwords to a seperate file
|
2015-01-26 18:34:19 +08:00 |
|
Wang Bin
|
b828b25f67
|
fix performance problem of extrag_tags, corresponding to jieba commit #eb98eb92484d3d302cd96049be43c224fe45414a
|
2015-01-26 18:31:10 +08:00 |
|
Wang Bin
|
2981f336d1
|
make test more readable
|
2015-01-26 16:50:45 +08:00 |
|
Wang Bin
|
052c571015
|
allow user to manually set idf file and stop words file, corresponding to jieba commit 16d626d3477c8649d42094d7d7d4ae7c619783de
|
2014-08-14 12:05:29 +08:00 |
|