Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ONNX conversion? #1

Open
tripathiarpan20 opened this issue Jan 7, 2024 · 1 comment
Open

ONNX conversion? #1

tripathiarpan20 opened this issue Jan 7, 2024 · 1 comment

Comments

@tripathiarpan20
Copy link

Greetings!

I happened to notice you mentioning ONNX conversion in the original basic-pitch repo (spotify/basic-pitch#64 (comment)), I was hoping you could provide some insights on the same, and if possible provide some relevant code.

Was hoping to deploy it fully on the web using ONNXruntime with WebGPU and benchmark the speedup.

References:
https://onnxruntime.ai/docs/tutorials/web/
https://github.com/webonnx/wonnx

Thanks in advance

@achimmihca
Copy link
Owner

Hi,

onnx conversion is not a big deal.

However, this is only the neural network of the machine learning model.
What is not part of the model itself is

  • load samples of an audio file
  • convert the samples of the audio file to the input format for the machine learning model (typically mono channel, specific sample rate, etc.)

These steps are somewhere in the basic-pitch python code.
But I did not have enough time to understand and rewrite this for my own (C#) code. As a result, I took the shortcut of compiling Basic Pitch to an exe and just call this from my code.

But it would be great if you could write code to load audio file and create the correct inputs for the model. Good luck!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants