1
0
mirror of https://github.com/fumiama/Retrieval-based-Voice-Conversion-WebUI.git synced 2026-06-06 01:30:24 +08:00
Commit Graph

223 Commits

Author SHA1 Message Date
Yongkun Li
ef9c8eb656 fix: Add weight whitelist support for torch 2.6 (#110) 2025-02-07 15:26:01 +08:00
github-actions[bot]
b0552625f7 chore(format): run black on dev (#104)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-01-01 00:24:50 +09:00
源文雨
fe11be3c94 fix(train): matplotlib deprecation (#103) 2025-01-01 00:23:16 +09:00
github-actions[bot]
89f7fa25cc chore(format): run black on dev (#102)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-29 00:36:19 +09:00
源文雨
ef9db1fd44 fix(rt): replace with new f0 2024-11-29 00:35:10 +09:00
github-actions[bot]
51c85fcc49 chore(format): run black on dev (#101)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-28 23:20:17 +09:00
源文雨
5969314e8d optimize(uvr5): apply jit to spec_utils & fix flac save
also fix #85
2024-11-28 23:19:05 +09:00
github-actions[bot]
19619161be chore(format): run black on dev (#98)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-28 18:07:17 +09:00
源文雨
7befbd10d9 optimize(train): combine extract f0 together 2024-11-28 18:03:17 +09:00
github-actions[bot]
d3add81469 chore(format): run black on dev (#94)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-28 03:21:10 +09:00
源文雨
a8783c6639 optimize: some training optimizations (#95)
* optimzie(train&uvr5): rm sf & simp. AudioPre

* fix(audio): too many mallocs

* feat(audio): load_audio support stereo

* fix(audio): float32 wav saving

* fix(train): missing ckpt var
2024-11-28 03:20:14 +09:00
源文雨
4b68fb0e13 feat: update to latest torch & gradio version 2024-11-27 22:16:06 +09:00
源文雨
ba5572b8e0 typo&fix: remove unnecessary is_hp3 check 2024-11-27 16:11:53 +09:00
源文雨
9ee9f5fd81 fix: Pipeline.pipeline for empty f0_file
- Added checks for empty f0_file, skipping faulty inp_f0 creation if empty as it should
2024-10-10 01:57:04 +09:00
Tegar Bangun Suganda
277eadca9a Changing 2333333 to latest
Removing hard-coded "2333333" in latest saved checkpoint model. This make the code more logical and informative.
2024-08-14 21:02:11 +08:00
github-actions[bot]
8cd1fdd372 chore(format): run black on dev (#66)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-07-01 17:53:21 +09:00
tkyaji
168616517a chore: bump librosa to version 0.10.2
There is a bug in librosa 0.9.1.
https://github.com/librosa/librosa/pull/1594

As a result, an error occurs when executing the "Vocals/Accompaniment Separation & Reverberation Removal" function.

To address this issue, librosa has been upgraded to version 0.10.2.
Additionally, torchcrepe has been upgraded due to its dependency on librosa.
2024-07-01 17:51:58 +09:00
源文雨
e4138bf1e8 feat(rvcmd): update version to v0.2.5 2024-06-17 21:31:51 +09:00
源文雨
839c60e56d feat(rvcmd): update version to v0.2.5 2024-06-17 21:29:40 +09:00
github-actions[bot]
9ee26d698a chore(format): run black on dev (#60)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-16 20:08:03 +09:00
源文雨
add4642b7e fix(train): parameter issue 2024-06-16 18:20:53 +09:00
github-actions[bot]
1410bd4d15 chore(format): run black on dev (#59)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-16 17:30:03 +09:00
源文雨
3a79d81907 fix(rtrvc): parameter issue 2024-06-16 17:25:01 +09:00
源文雨
0d5cd347bc fix(rtrvc): skip head unimplemented 2024-06-16 16:46:59 +09:00
源文雨
df83554ac1 fix(rtrvc): parameter issue 2024-06-16 15:04:48 +09:00
github-actions[bot]
a246a669cd chore(format): run black on dev (#58)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-16 14:38:44 +09:00
源文雨
8ce397da9c optimize(rtrvc): impl. rvc f0s 2024-06-16 14:36:50 +09:00
源文雨
c51a73f521 optimize(infer): move jit into rvc 2024-06-14 22:44:07 +09:00
源文雨
e936e24a91 optimize(infer): move ipex into rvc 2024-06-14 22:01:39 +09:00
源文雨
3b7d7c6d1a optimize(f0): move fcpe into rvc.f0 2024-06-14 21:33:46 +09:00
github-actions[bot]
83ce9a9903 chore(format): run black on dev (#50)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-14 14:35:34 +09:00
源文雨
e298fde29c optimize(crepe): move crepe into rvc.f0 2024-06-14 14:29:36 +09:00
源文雨
1e94e007d5 optimize(rvc.f0): rename inner defs 2024-06-13 00:51:22 +09:00
源文雨
8ac5597a3f optimize(rmvpe): move rmvpe into rvc.f0 2024-06-13 00:42:42 +09:00
源文雨
77b371d615 optimize(f0): move some f0s into rvc.f0 2024-06-13 00:10:22 +09:00
github-actions[bot]
d44a942882 chore(format): run black on dev (#45)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-12 23:30:50 +09:00
Alex Murkoff
09285d5f5b perf: improve codec handling in load_filepaths_and_text function in infer.lib.train.utils (#44) 2024-06-12 12:53:56 +00:00
源文雨
e486649a91 optimize(rmvpe): move deepunet&e2e into rvc 2024-06-12 20:51:46 +09:00
Alex Murkoff
1e22d468ea feat(audio): use PyAV instead of ffmpeg (#31)
* feat(audio): use PyAV instead of ffmpeg

replaced usage of ffmpeg in favor of PyAV (`av`)

* refactor(audio): store all of the audio related functions in the `infer.lib.audio`

refactors previous commit to have singular functions for each task, all located in `infer.lib.audio`

* fix(audio): remove downsample_audio from mdxnet.py

it is no longer needed, since it's imported from infer.lib.audio

* docs: remove every ffmpeg mention in the documentation to avoid confusion

* chore(requirements): remove ffmpeg-python and ffmpy from all requirements

* fix(audio): fix loading for UVR

wrapped gathering of META info from the stream into a function

fixes loading for UVR

* fix(audio): use np.frombuffer() instead of direct conversion of the resampled frames

this fixes traceback on preprocessing

* feat(audio): pre-allocate decoded_audio array in the load_audio function

this should improve performance, even if just a little

* Revert "docs: remove every ffmpeg mention in the documentation to avoid confusion"

This reverts commit 1e05bbce03.

* chore(format): run black on dev

* fix(requirements): revert removal of ffmpeg in unitest.yml and Dockerfile

* Revert "fix(requirements): revert removal of ffmpeg in unitest.yml and Dockerfile"

This reverts commit e28a0eebb2.

* feat(audio): pre-allocate numpy array to store the AudioFrame data in ndarray of dtype float32

* chore(format): run black on dev

* fix(audio): fix the decoded_audio size estimation

in estimated_total_samples we multiply by `sr` instead of `container.streams.audio[0].rate` since we want to estimate size of the OUTPUT file, not the input one. - Added dynamic resizing, in case something goes wrong and the size of decoded_audio is estimated incorrectly

Fixed function `load_audio` when the input audio's samplerate does not match the desired samplerate (`sr`)

* chore(format): run black on dev

* refactor(audio): remove `clean_path()` function as it serves no purpose anymore

* docs: remove everything related to ffmpeg

this includes everything except for formats support specification in the training_tips docs, since it has nothing to do with what ffmpeg does/did but rather what audio formats are supported (all the ones that ffmpeg supports!)

* docs: fix order of the steps in preparation in the READMEs

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-12 20:13:26 +09:00
源文雨
22715eab7c optimize(rmvpe): move mel&stft into rvc 2024-06-12 17:29:23 +09:00
github-actions[bot]
b4f7bbbe39 chore(format): run black on dev (#40)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-12 14:55:48 +09:00
源文雨
54f7ae097d optimize(jit): move hubert & synthesizer into rvc 2024-06-12 00:03:26 +09:00
源文雨
f956b333fa optimize(infer): move onnx into rvc 2024-06-11 17:21:05 +09:00
源文雨
4c4492a40e fix(i18n): missing translations 2024-06-11 16:06:08 +09:00
github-actions[bot]
da7dee427a chore(format): run black on dev (#30) 2024-06-11 13:11:10 +09:00
Alex Murkoff
9d699b1d99 perf: use hashing to determine the format in infer/lib/audio.py (#26) 2024-06-11 13:02:54 +09:00
Alex Murkoff
70b43e8924 chore(i18n): use english as the base language for i18n (#22)
* chore(i18n): use english as the base language for i18n

rewrite all of the locale files to use english as base for translation

* fix(i18n): update rest of the scripts that rely on the chinese-base i18n translation

* chore(i18n): change some of the base strings to be more correct

* chore(i18n): sync locale on dev

* chore(format): run black on dev

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-11 12:33:56 +09:00
github-actions[bot]
a6c6262d91 chore(format): run black on dev (#25)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-06-10 22:27:48 +09:00
源文雨
7572c44911 optimize(rvc): move . into layers 2024-06-10 22:22:58 +09:00
源文雨
1a4cb9294e optimize(infer): move syns into rvc 2024-06-10 22:03:57 +09:00