There is a bug in librosa 0.9.1.
https://github.com/librosa/librosa/pull/1594
As a result, an error occurs when executing the "Vocals/Accompaniment Separation & Reverberation Removal" function.
To address this issue, librosa has been upgraded to version 0.10.2.
Additionally, torchcrepe has been upgraded due to its dependency on librosa.
* chore: remove type subscription for np.ndarray specification
this is done to avoid errors on python<3.9
* chore: remove import of `Any` from typing since it's unused
* refactor(scripts): simplify run.sh script
a lot of the contents of run.sh is either wrong, unnecessary, or both! a program runner-script that installs software on your PC is a terrible idea, since users themselves should make such decisions, not some random installer/runner scripts from a github repo. the overfixation on python3.8 makes this script practically entirely useless on any linux system that isn't based on debian, or is based on debian but is heavily modified. on archlinux this script is useless, it wont install anything for you and it wont find python3.8 since there's no such thing as python3.8 ANYWHERE except for a few distros where older versions of software are not stored in an archive, but rather in the packages repo
* refactor(scripts): apply a lot more of refactoring of run script
* fix(scripts): remove env variable setting for macOS systems
use --pycmd python instead of --pycmd python3
* feat(audio): use PyAV instead of ffmpeg
replaced usage of ffmpeg in favor of PyAV (`av`)
* refactor(audio): store all of the audio related functions in the `infer.lib.audio`
refactors previous commit to have singular functions for each task, all located in `infer.lib.audio`
* fix(audio): remove downsample_audio from mdxnet.py
it is no longer needed, since it's imported from infer.lib.audio
* docs: remove every ffmpeg mention in the documentation to avoid confusion
* chore(requirements): remove ffmpeg-python and ffmpy from all requirements
* fix(audio): fix loading for UVR
wrapped gathering of META info from the stream into a function
fixes loading for UVR
* fix(audio): use np.frombuffer() instead of direct conversion of the resampled frames
this fixes traceback on preprocessing
* feat(audio): pre-allocate decoded_audio array in the load_audio function
this should improve performance, even if just a little
* Revert "docs: remove every ffmpeg mention in the documentation to avoid confusion"
This reverts commit 1e05bbce03.
* chore(format): run black on dev
* fix(requirements): revert removal of ffmpeg in unitest.yml and Dockerfile
* Revert "fix(requirements): revert removal of ffmpeg in unitest.yml and Dockerfile"
This reverts commit e28a0eebb2.
* feat(audio): pre-allocate numpy array to store the AudioFrame data in ndarray of dtype float32
* chore(format): run black on dev
* fix(audio): fix the decoded_audio size estimation
in estimated_total_samples we multiply by `sr` instead of `container.streams.audio[0].rate` since we want to estimate size of the OUTPUT file, not the input one. - Added dynamic resizing, in case something goes wrong and the size of decoded_audio is estimated incorrectly
Fixed function `load_audio` when the input audio's samplerate does not match the desired samplerate (`sr`)
* chore(format): run black on dev
* refactor(audio): remove `clean_path()` function as it serves no purpose anymore
* docs: remove everything related to ffmpeg
this includes everything except for formats support specification in the training_tips docs, since it has nothing to do with what ffmpeg does/did but rather what audio formats are supported (all the ones that ffmpeg supports!)
* docs: fix order of the steps in preparation in the READMEs
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>