Max Morrison comments

Results 31 comments of


                                            Max Morrison

[WIP] Decoding from the categorical posterior distribution

@justinsalamon we can further narrow the dependency requirement to just two functions (librosa.sequence.viterbi and librosa.sequence._viterbi) and a custom error class (librosa.util.exceptions.ParameterError). Those 3 have no other dependencies within librosa. I...

Bug in Viterbi decoding?

Hi @jongwook. I think my discussions with @justinsalamon today prompted this issue. I used librosa.sequence.viterbi to decode discrete emissions directly from the posterior observations emitted by the network. I tried...

Bug in Viterbi decoding?

A PR has been started! Regarding the theoretical justification, yes: the output of softmax (the posterior distribution) should be used and not the raw logits. As the network was trained...

Figure out default model storage directory on Windows.

Currently, files are downloaded into `C:\tmp\nussl`. PyTorch relies on either a explicit string argument or two environment variables. [Here](https://pytorch.org/docs/stable/_modules/torch/utils/model_zoo.html#load_url) is the relevant code for PyTorch. Keras creates the folder `C:\Users\\.keras`....

DeepClustering incompatible with PyTorch on Windows

Sorry for the wait. I'm getting a `FileNotFoundError`: `FileNotFoundError: [Errno 2] No such file or directory: '/media/ext/models/deep_clustering_vocal_44k_long.model'`

DeepClustering incompatible with PyTorch on Windows

Ah, so that's why you had me run the DeepSeparation script. That makes sense to me now. For reference, that script with DeepSeparation worked just fine on Windows, whereas DeepClustering...

inference speed

The inference speed is dependent on the compute you are using. If you want comparable results to those given in the paper, make sure your compute is comparable to the...

TypeError: can't convert np.ndarray of type numpy.uint16.

You do not need to modify `torchcrepe`. `ljspeech` is not a dataset that I implemented. You should check how you setup that dataset and compare it to the examples provided...

TypeError: can't convert np.ndarray of type numpy.uint16.

Yep, you were both correct. `librosa` updated the return type of `librosa.sequence.viterbi` from `np.int64` to `np.uint16`. Fixed in `torchcrepe` version 0.0.16.

How do i train my own model？

For a new dataset, you'll make a function in [download.py](https://github.com/descriptinc/cargan/blob/master/cargan/data/download.py). This function downloads and unzips the dataset, then prepares a cache of 22.05 kHz wav files. See the other examples...