1
0
mirror of https://github.com/fumiama/jieba.git synced 2026-06-05 00:32:51 +08:00

Commit Graph

  • 5c1e948fb8 optimize: addToken 性能优化 (#1) master 昔音幻离 2026-01-04 00:00:01 +08:00
  • 36c17a10b5 fs.File -> io.Reader 源文雨 2022-12-03 10:54:06 +08:00
  • 35ac98dc5f add compare 源文雨 2022-11-30 16:39:37 +08:00
  • 9e2fa12d75 add license 源文雨 2022-11-30 16:29:21 +08:00
  • 45c0f09d80 upd readme 源文雨 2022-11-30 16:28:27 +08:00
  • 1803c768ee pass tests & update readme 源文雨 2022-11-30 16:27:07 +08:00
  • 6982ead703 优化 jieba 源文雨 2022-11-30 16:00:56 +08:00
  • 4d76899e79 优化 tokenizer 源文雨 2022-11-30 15:39:50 +08:00
  • a433e052c5 优化 posseg 源文雨 2022-11-30 15:34:39 +08:00
  • 7c685f789e 优化 Segmenter 源文雨 2022-11-30 15:27:46 +08:00
  • b2508252d5 优化 finalseg 源文雨 2022-11-30 14:47:50 +08:00
  • 6b239b5918 优化 analyze 源文雨 2022-11-30 14:18:36 +08:00
  • f3da9e6420 优化 dict, add fs.File 支持 源文雨 2022-11-30 14:14:48 +08:00
  • c8785c7994 drop chan in DictLoader 源文雨 2022-11-30 13:49:06 +08:00
  • d487545eb5 优化 tag_extracker 源文雨 2022-11-30 13:35:21 +08:00
  • ae85ccb20a 优化 TextRanker float 源文雨 2022-11-30 13:15:12 +08:00
  • a8d1e81f73 优化 TextRanker 源文雨 2022-11-30 12:56:04 +08:00
  • 21cdb2e863 优化 textrank 源文雨 2022-11-30 12:38:47 +08:00
  • 8bbc755ed4 优化 源文雨 2022-11-30 12:18:15 +08:00
  • ab8b95ef87 Merge branch 'release/v0.3.2' Wang Bin 2015-05-08 16:35:21 +08:00
  • 6b75cef871 tweak SuggestFrequency, added example Wang Bin 2015-05-08 16:34:28 +08:00
  • c48eb5b4a7 added AddWord/DeleteWord/SuggestFrequency functions, this is correpsonding to jieba commit #59aa8b69b1399569ea6b417280c993da703baba8 Wang Bin 2015-05-08 11:57:46 +08:00
  • 3d91f615cf moved tokenizers to a seperated module Wang Bin 2015-05-07 18:52:29 +08:00
  • 7440fa00df tweak examples for nalyse Wang Bin 2015-05-07 17:43:35 +08:00
  • 16bd618c0c fixed tests, tweak examples Wang Bin 2015-05-07 17:38:08 +08:00
  • 385514113a Merge branch 'release/v0.3' Wang Bin 2015-05-07 15:26:03 +08:00
  • d03900e757 updated README Wang Bin 2015-05-07 15:25:09 +08:00
  • 1497472b82 added example_parallel_cut_test.go Wang Bin 2015-05-07 15:10:16 +08:00
  • 939e903f31 small code refactor Wang Bin 2015-05-07 10:59:01 +08:00
  • c068670e9b added more examples Wang Bin 2015-05-06 17:50:29 +08:00
  • b3ea94ba47 tweak docs Wang Bin 2015-05-06 16:11:11 +08:00
  • f96acf3e85 added example_test.go, rename analyse_test.go to tag_extracker_test.go, removed bench.sh Wang Bin 2015-05-06 15:57:05 +08:00
  • 4e8887da5e added package documentation Wang Bin 2015-05-06 15:23:35 +08:00
  • c937cdac5d added example_test.go Wang Bin 2015-05-06 15:19:00 +08:00
  • 122bad0a8d code refactor, added more documents Wang Bin 2015-05-06 12:55:04 +08:00
  • 87caff09cb small refactor according to go vet check results Wang Bin 2015-05-04 18:56:55 +08:00
  • cfed2de1cd added tests for dictionary/dictionary.go Wang Bin 2015-05-04 18:40:50 +08:00
  • 1a9466927a move tokenizer.go to top directory Wang Bin 2015-05-04 18:20:35 +08:00
  • 52fad00403 refactor analyse module Wang Bin 2015-05-04 16:39:37 +08:00
  • 500e6bd10e tweak style Wang Bin 2015-05-04 15:11:55 +08:00
  • edef39719d move jieba to a seperate module, tweak posseg module Wang Bin 2015-04-30 17:01:02 +08:00
  • d9f77563bf added util module Wang Bin 2015-04-30 15:26:34 +08:00
  • 732196127b added more tests for dictionary.go Wang Bin 2015-04-30 11:21:16 +08:00
  • ac7628edaf added tests for dictionary.go, fixed a small bug Wang Bin 2015-04-30 11:03:54 +08:00
  • ae54d82c68 added tests for dictionary.go, fixed a small bug Wang Bin 2015-04-30 11:02:01 +08:00
  • 0124ebadce put dictionary to a seperated module Wang Bin 2015-04-29 18:51:38 +08:00
  • b19eb4f6fe code refactor, use uint for map key to improve performance Wang Bin 2015-04-06 20:24:07 +08:00
  • 17ab0b2cc7 small style refactor Wang Bin 2015-04-04 17:42:41 +08:00
  • 83efde1e61 small refactors, removed sort in dag, save logTotal in segmenter Wang Bin 2015-04-04 17:10:40 +08:00
  • 5c6a2eff74 Merge branch 'posseg' into develop Wang Bin 2015-04-04 15:39:21 +08:00
  • 847dae9d38 added bench.sh Wang Bin 2015-04-04 15:35:01 +08:00
  • e8cf1e9a9c small refactor Wang Bin 2015-04-04 15:30:11 +08:00
  • 188133261f small tweaks, added bench.sh for benchmark Wang Bin 2015-04-04 15:26:26 +08:00
  • bbe302a351 removed sorts to slightly improve performance Wang Bin 2015-04-03 16:48:45 +08:00
  • d22cc9b6b6 fixed a typo in jieba_test.go Wang Bin 2015-04-02 18:29:24 +08:00
  • 84ad6fe25e code refactor, updated RegexpSplit function to match Python's re.split function Wang Bin 2015-04-02 18:25:00 +08:00
  • 0ab9063f43 added benchmarks for posseg Wang Bin 2015-03-31 13:49:54 +08:00
  • 3852f660aa added benchmark for Cut related functions Wang Bin 2015-03-31 12:03:01 +08:00
  • 7cf16072e6 updated all tests to use Fatal/Fatalf to fail tests ealier Wang Bin 2015-03-30 18:01:21 +08:00
  • c397cafe8a uniform the api Wang Bin 2015-03-30 17:52:09 +08:00
  • 7a7f8af517 move DAG related function to a seperated file, rename Calc to Routes Wang Bin 2015-03-30 17:10:48 +08:00
  • 68fed7e250 make struct Jieba's fields private Wang Bin 2015-03-30 16:12:02 +08:00
  • c4c3a5f9ad refactor Cut function, make CutAll a seperate function, to simplify the logic of Cut function Wang Bin 2015-03-30 15:18:36 +08:00
  • 556b96b137 removed unused method/property Wang Bin 2015-03-30 14:31:41 +08:00
  • 328310cfbb removed all cache load/dump related codes, benchmark shows read from dict file is faster than load from gob file Wang Bin 2015-03-30 14:25:08 +08:00
  • 0ca4053394 fixed the test failure in textrank Wang Bin 2015-03-30 13:06:44 +08:00
  • 51c63cb9ad small refactor the interface, use contructors instead of pointers for entry Wang Bin 2015-03-30 13:00:56 +08:00
  • 48a0bd390b fixed a typo in previous commit Wang Bin 2015-03-30 11:13:00 +08:00
  • a66bf2a0bd move dictPath function private Wang Bin 2015-03-30 11:02:57 +08:00
  • 79adffe328 added a new interface for caching Wang Bin 2015-03-28 15:49:32 +08:00
  • e11060513c merge trie.go into jieba.go Wang Bin 2015-03-28 12:14:11 +08:00
  • 45c7854fac finished generilzation of dictionary load Wang Bin 2015-03-28 10:51:00 +08:00
  • e155fe5467 refactor to generalize set dictionary function, not finished yet Wang Bin 2015-03-25 18:46:14 +08:00
  • 59da5b5e3a removed dict.go, functions move to util.go, also use interface to simplify code Wang Bin 2015-03-25 18:28:37 +08:00
  • 7fe5e7d4c4 small refactor, replace WordTagFreq with Entry Wang Bin 2015-03-25 17:53:25 +08:00
  • 800ecaa8c9 small refactor Wang Bin 2015-03-25 16:01:05 +08:00
  • 8687ca58b8 removed unecessary stateTag struct, using string instead Wang Bin 2015-03-25 15:13:46 +08:00
  • 1c378c28a7 finished all OOP refactor Wang Bin 2015-03-24 18:34:07 +08:00
  • 73d87e4ed6 refactor posseg, added Posseg struct Wang Bin 2015-03-24 16:54:02 +08:00
  • 0027927b6d code refactor for RegexpSplit function, moved it to util.go, add return chan string Wang Bin 2015-03-24 14:40:06 +08:00
  • 323b6714fa removed cache directory, the refactor I made before was not clear Wang Bin 2015-03-24 14:06:10 +08:00
  • d257da40a7 try to refactor, not finished yet Wang Bin 2015-03-20 18:38:08 +08:00
  • 6a38b43000 Merge branch 'release/v0.2' Wang Bin 2015-03-18 17:36:28 +08:00
  • 16929faf57 removed old tokenize module, updated README Wang Bin 2015-03-18 17:31:41 +08:00
  • f596ac063d added more tests Wang Bin 2015-03-17 16:34:36 +08:00
  • a14788addb fixed a but in tokenizer under search mode, added more tests Wang Bin 2015-03-17 16:29:09 +08:00
  • 2c95c61d33 added jieba tokenizer for bleve Wang Bin 2015-03-17 15:30:13 +08:00
  • 1aabc4a2f3 removed unnecessary MarshalBinary/UnmarshalBinary method Wang Bin 2015-03-16 15:55:41 +08:00
  • ebf4635334 Merge branch 'release/v0.1' Wang Bin 2015-02-28 18:26:18 +08:00
  • 8bf9888a1c make some public variable/function to private Wang Bin 2015-02-28 18:23:59 +08:00
  • 1c8d4fbf23 make some public variable/function to private Wang Bin 2015-02-28 18:17:48 +08:00
  • d06ba85b0b make some public variable/function to private Wang Bin 2015-02-28 18:08:57 +08:00
  • a43924173d make some public variable/function to private Wang Bin 2015-02-28 17:53:08 +08:00
  • 30b40c34df make some public variable/function to private Wang Bin 2015-02-28 17:51:03 +08:00
  • 84bd2e14e2 make some public variable/function to private Wang Bin 2015-02-28 17:43:10 +08:00
  • b23dc63eb2 make some public variable/function to private Wang Bin 2015-02-28 17:40:58 +08:00
  • 2c6a897751 make some public variable/function to private Wang Bin 2015-02-28 17:35:04 +08:00
  • db06c66086 make some public variable/function to private Wang Bin 2015-02-28 17:32:40 +08:00
  • 4b0ddcf666 make some public variable/function to private Wang Bin 2015-02-28 17:27:16 +08:00
  • 858ceb5a0b small tweaks, add docs Wang Bin 2015-02-28 17:08:04 +08:00