1
0
mirror of https://github.com/fumiama/Retrieval-based-Voice-Conversion-WebUI.git synced 2026-06-05 17:20:25 +08:00
Commit Graph

147 Commits

Author SHA1 Message Date
github-actions[bot]
f4df04a5af chore(format): run black on dev 2026-04-18 09:31:50 +00:00
源文雨
3affc9415d fix(train): unsupported gloo device on win 2026-04-18 17:30:48 +08:00
源文雨
cc50ede4fb fix(train): extract f0 & feature hang 2026-04-18 17:03:52 +08:00
github-actions[bot]
96f7bccb12 chore(format): run black on dev (#136)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-11-26 13:06:55 +08:00
源文雨
43d19eb00e fix(dml): train extract_f0_print error
ModuleNotFoundError: No module named 'torch.privateuseone' due to new prosess
2025-11-21 16:52:17 +08:00
源文雨
53d54681cf fix: i18n & dl & tests 2025-11-21 15:06:04 +08:00
源文雨
215a3edcef fix: remove outdated codes & add weights_only=True 2025-11-18 13:57:03 +08:00
源文雨
033306439c fix(train): cannot extract feature on non-cuda devices (fix #123) 2025-07-08 16:40:38 +09:00
github-actions[bot]
3b4a546ced chore(format): run black on dev (#121)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-06-19 17:56:04 +09:00
源文雨
6223116f6b fix: no attribute 'FILE_LIKE'' & update instructions 2025-06-19 17:25:21 +09:00
github-actions[bot]
b88cc1e8fe chore(format): run black on dev (#111)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-02-08 23:10:24 +08:00
Yongkun Li
ef9c8eb656 fix: Add weight whitelist support for torch 2.6 (#110) 2025-02-07 15:26:01 +08:00
github-actions[bot]
51c85fcc49 chore(format): run black on dev (#101)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-28 23:20:17 +09:00
源文雨
5969314e8d optimize(uvr5): apply jit to spec_utils & fix flac save
also fix #85
2024-11-28 23:19:05 +09:00
github-actions[bot]
19619161be chore(format): run black on dev (#98)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-28 18:07:17 +09:00
源文雨
7befbd10d9 optimize(train): combine extract f0 together 2024-11-28 18:03:17 +09:00
github-actions[bot]
d3add81469 chore(format): run black on dev (#94)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-28 03:21:10 +09:00
源文雨
a8783c6639 optimize: some training optimizations (#95)
* optimzie(train&uvr5): rm sf & simp. AudioPre

* fix(audio): too many mallocs

* feat(audio): load_audio support stereo

* fix(audio): float32 wav saving

* fix(train): missing ckpt var
2024-11-28 03:20:14 +09:00
源文雨
4b68fb0e13 feat: update to latest torch & gradio version 2024-11-27 22:16:06 +09:00
源文雨
ba5572b8e0 typo&fix: remove unnecessary is_hp3 check 2024-11-27 16:11:53 +09:00
源文雨
9ee9f5fd81 fix: Pipeline.pipeline for empty f0_file
- Added checks for empty f0_file, skipping faulty inp_f0 creation if empty as it should
2024-10-10 01:57:04 +09:00
Tegar Bangun Suganda
277eadca9a Changing 2333333 to latest
Removing hard-coded "2333333" in latest saved checkpoint model. This make the code more logical and informative.
2024-08-14 21:02:11 +08:00
tkyaji
168616517a chore: bump librosa to version 0.10.2
There is a bug in librosa 0.9.1.
https://github.com/librosa/librosa/pull/1594

As a result, an error occurs when executing the "Vocals/Accompaniment Separation & Reverberation Removal" function.

To address this issue, librosa has been upgraded to version 0.10.2.
Additionally, torchcrepe has been upgraded due to its dependency on librosa.
2024-07-01 17:51:58 +09:00
源文雨
add4642b7e fix(train): parameter issue 2024-06-16 18:20:53 +09:00
源文雨
e936e24a91 optimize(infer): move ipex into rvc 2024-06-14 22:01:39 +09:00
源文雨
3b7d7c6d1a optimize(f0): move fcpe into rvc.f0 2024-06-14 21:33:46 +09:00
源文雨
e298fde29c optimize(crepe): move crepe into rvc.f0 2024-06-14 14:29:36 +09:00
源文雨
1e94e007d5 optimize(rvc.f0): rename inner defs 2024-06-13 00:51:22 +09:00
源文雨
8ac5597a3f optimize(rmvpe): move rmvpe into rvc.f0 2024-06-13 00:42:42 +09:00
源文雨
77b371d615 optimize(f0): move some f0s into rvc.f0 2024-06-13 00:10:22 +09:00
Alex Murkoff
1e22d468ea feat(audio): use PyAV instead of ffmpeg (#31)
* feat(audio): use PyAV instead of ffmpeg

replaced usage of ffmpeg in favor of PyAV (`av`)

* refactor(audio): store all of the audio related functions in the `infer.lib.audio`

refactors previous commit to have singular functions for each task, all located in `infer.lib.audio`

* fix(audio): remove downsample_audio from mdxnet.py

it is no longer needed, since it's imported from infer.lib.audio

* docs: remove every ffmpeg mention in the documentation to avoid confusion

* chore(requirements): remove ffmpeg-python and ffmpy from all requirements

* fix(audio): fix loading for UVR

wrapped gathering of META info from the stream into a function

fixes loading for UVR

* fix(audio): use np.frombuffer() instead of direct conversion of the resampled frames

this fixes traceback on preprocessing

* feat(audio): pre-allocate decoded_audio array in the load_audio function

this should improve performance, even if just a little

* Revert "docs: remove every ffmpeg mention in the documentation to avoid confusion"

This reverts commit 1e05bbce03.

* chore(format): run black on dev

* fix(requirements): revert removal of ffmpeg in unitest.yml and Dockerfile

* Revert "fix(requirements): revert removal of ffmpeg in unitest.yml and Dockerfile"

This reverts commit e28a0eebb2.

* feat(audio): pre-allocate numpy array to store the AudioFrame data in ndarray of dtype float32

* chore(format): run black on dev

* fix(audio): fix the decoded_audio size estimation

in estimated_total_samples we multiply by `sr` instead of `container.streams.audio[0].rate` since we want to estimate size of the OUTPUT file, not the input one. - Added dynamic resizing, in case something goes wrong and the size of decoded_audio is estimated incorrectly

Fixed function `load_audio` when the input audio's samplerate does not match the desired samplerate (`sr`)

* chore(format): run black on dev

* refactor(audio): remove `clean_path()` function as it serves no purpose anymore

* docs: remove everything related to ffmpeg

this includes everything except for formats support specification in the training_tips docs, since it has nothing to do with what ffmpeg does/did but rather what audio formats are supported (all the ones that ffmpeg supports!)

* docs: fix order of the steps in preparation in the READMEs

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-12 20:13:26 +09:00
源文雨
22715eab7c optimize(rmvpe): move mel&stft into rvc 2024-06-12 17:29:23 +09:00
github-actions[bot]
b4f7bbbe39 chore(format): run black on dev (#40)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-12 14:55:48 +09:00
源文雨
54f7ae097d optimize(jit): move hubert & synthesizer into rvc 2024-06-12 00:03:26 +09:00
源文雨
f956b333fa optimize(infer): move onnx into rvc 2024-06-11 17:21:05 +09:00
源文雨
4c4492a40e fix(i18n): missing translations 2024-06-11 16:06:08 +09:00
Alex Murkoff
70b43e8924 chore(i18n): use english as the base language for i18n (#22)
* chore(i18n): use english as the base language for i18n

rewrite all of the locale files to use english as base for translation

* fix(i18n): update rest of the scripts that rely on the chinese-base i18n translation

* chore(i18n): change some of the base strings to be more correct

* chore(i18n): sync locale on dev

* chore(format): run black on dev

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-11 12:33:56 +09:00
源文雨
7572c44911 optimize(rvc): move . into layers 2024-06-10 22:22:58 +09:00
源文雨
1a4cb9294e optimize(infer): move syns into rvc 2024-06-10 22:03:57 +09:00
github-actions[bot]
fe7a2bf41a chore(format): run black on dev (#23)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-10 21:38:19 +09:00
源文雨
e33ef19200 optimize(infer.synthesizer): all modules inherit from one 2024-06-10 21:34:35 +09:00
github-actions[bot]
b67050b2f7 chore(format): run black on dev (#19)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-10 01:14:46 +09:00
源文雨
b23ea7c6e7 optimize(train): move discriminators into rvc 2024-06-10 01:10:57 +09:00
源文雨
6bbe079100 fix(uvr5): cannot import name 'nets_61968KB' 2024-06-08 01:32:14 +09:00
源文雨
49488dcae9 optimize(rvc.utils): more type defs & rename 2024-06-07 19:33:45 +09:00
源文雨
5eed789fe7 optimize(rvc): move commons to rvc.utils
- remove redundant attentions_onnx
- shrink models_onnx
- add some type note to rvc.utils
2024-06-07 00:42:35 +09:00
源文雨
6f90ce3046 optimize(uvr5): remove redundant files 2024-06-06 21:34:45 +09:00
源文雨
53e596954c optimize(vc.hash): audio padding 2024-06-06 02:16:09 +09:00
github-actions[bot]
a322e706f9 chore(format): run black on dev (#2106)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-05 15:32:33 +09:00
源文雨
0d2189fdeb optimize(vc.modules): use general get_synthesizer 2024-06-05 15:27:20 +09:00