1
0
mirror of https://github.com/fumiama/jieba.git synced 2026-06-05 00:32:51 +08:00

51 Commits

Author SHA1 Message Date
源文雨
36c17a10b5 fs.File -> io.Reader 2022-12-03 10:54:06 +08:00
源文雨
6982ead703 优化 jieba 2022-11-30 16:00:56 +08:00
源文雨
7c685f789e 优化 Segmenter 2022-11-30 15:27:46 +08:00
源文雨
6b239b5918 优化 analyze 2022-11-30 14:18:36 +08:00
源文雨
f3da9e6420 优化 dict, add fs.File 支持 2022-11-30 14:14:48 +08:00
源文雨
c8785c7994 drop chan in DictLoader 2022-11-30 13:49:06 +08:00
源文雨
d487545eb5 优化 tag_extracker 2022-11-30 13:35:21 +08:00
源文雨
ae85ccb20a 优化 TextRanker float 2022-11-30 13:15:12 +08:00
源文雨
a8d1e81f73 优化 TextRanker 2022-11-30 12:56:04 +08:00
源文雨
21cdb2e863 优化 textrank 2022-11-30 12:38:47 +08:00
源文雨
8bbc755ed4 优化 2022-11-30 12:18:15 +08:00
Wang Bin
7440fa00df tweak examples for nalyse 2015-05-07 17:43:35 +08:00
Wang Bin
b3ea94ba47 tweak docs 2015-05-06 16:11:11 +08:00
Wang Bin
f96acf3e85 added example_test.go, rename analyse_test.go to tag_extracker_test.go, removed bench.sh 2015-05-06 15:57:05 +08:00
Wang Bin
122bad0a8d code refactor, added more documents 2015-05-06 12:55:04 +08:00
Wang Bin
87caff09cb small refactor according to go vet check results 2015-05-04 18:56:55 +08:00
Wang Bin
52fad00403 refactor analyse module 2015-05-04 16:39:37 +08:00
Wang Bin
500e6bd10e tweak style 2015-05-04 15:11:55 +08:00
Wang Bin
7cf16072e6 updated all tests to use Fatal/Fatalf to fail tests ealier 2015-03-30 18:01:21 +08:00
Wang Bin
c397cafe8a uniform the api 2015-03-30 17:52:09 +08:00
Wang Bin
68fed7e250 make struct Jieba's fields private 2015-03-30 16:12:02 +08:00
Wang Bin
0ca4053394 fixed the test failure in textrank 2015-03-30 13:06:44 +08:00
Wang Bin
51c63cb9ad small refactor the interface, use contructors instead of pointers for entry 2015-03-30 13:00:56 +08:00
Wang Bin
48a0bd390b fixed a typo in previous commit 2015-03-30 11:13:00 +08:00
Wang Bin
a66bf2a0bd move dictPath function private 2015-03-30 11:02:57 +08:00
Wang Bin
79adffe328 added a new interface for caching 2015-03-28 15:49:32 +08:00
Wang Bin
59da5b5e3a removed dict.go, functions move to util.go, also use interface to simplify code 2015-03-25 18:28:37 +08:00
Wang Bin
1c378c28a7 finished all OOP refactor 2015-03-24 18:34:07 +08:00
Wang Bin
16929faf57 removed old tokenize module, updated README 2015-03-18 17:31:41 +08:00
Wang Bin
f596ac063d added more tests 2015-03-17 16:34:36 +08:00
Wang Bin
a14788addb fixed a but in tokenizer under search mode, added more tests 2015-03-17 16:29:09 +08:00
Wang Bin
2c95c61d33 added jieba tokenizer for bleve 2015-03-17 15:30:13 +08:00
Wang Bin
8bf9888a1c make some public variable/function to private 2015-02-28 18:23:59 +08:00
Wang Bin
1c8d4fbf23 make some public variable/function to private 2015-02-28 18:17:48 +08:00
Wang Bin
d06ba85b0b make some public variable/function to private 2015-02-28 18:08:57 +08:00
Wang Bin
142b90f76a updated REAMD.md, fixed a small bug in textrank 2015-02-28 14:55:32 +08:00
Wang Bin
c7ac8a7096 small tweak, map stopWords' value are not used, no need to be string 2015-02-27 18:07:23 +08:00
Wang Bin
d76fbfb017 unify Cut method, return channel instead of array 2015-02-27 17:04:50 +08:00
Wang Bin
76b9df8511 change cut method to return a channel string, not []string 2015-02-27 11:37:55 +08:00
Wang Bin
87fe3a19f8 refactor, removed duplicated codes 2015-02-26 18:03:08 +08:00
Wang Bin
dd7e4ec802 refactor, removed duplicated codes 2015-02-26 17:59:34 +08:00
Wang Bin
ed0bc44921 removed third party dependency 2015-02-26 17:45:59 +08:00
Wang Bin
73cb677aca added test for load cutomized stop words file 2015-02-25 12:04:09 +08:00
Wang Bin
e60dcd3e9e fixed the bug of textrank, updated tests 2015-02-17 12:04:06 +08:00
Wang Bin
74dbb7d525 added simple implemention of textrank, corresponding to jieba commit #4030d8ed86dd3ff54e215ebe88c141b2a8345eda 2015-02-04 18:23:47 +08:00
Wang Bin
70a8da2687 ExtractTags not return words with weight, instead of words, this is correpsonding to jieba commit #f5ca87e0884ec6f013b4f9c4119aa47d44e090fe 2015-02-04 16:05:23 +08:00
Wang Bin
5e01cf47a3 small refactor, move stopwords to a seperate file 2015-01-26 18:34:19 +08:00
Wang Bin
b828b25f67 fix performance problem of extrag_tags, corresponding to jieba commit #eb98eb92484d3d302cd96049be43c224fe45414a 2015-01-26 18:31:10 +08:00
Wang Bin
2981f336d1 make test more readable 2015-01-26 16:50:45 +08:00
Wang Bin
052c571015 allow user to manually set idf file and stop words file, corresponding to jieba commit 16d626d3477c8649d42094d7d7d4ae7c619783de 2014-08-14 12:05:29 +08:00