Replies: 2 comments
-
Thanks!
Cool idea. Neat that you've built something for yourself already to allow using it with Khoj! We could use whisper to allow folks to search, chat with audio, video stuff as well. But I'll have to think about how to prioritize this. The tradeoff between number of data sources we support and having Khoj work smoothly is real 😅 Can we also try make your video/audio to transcripts tool be an external plugin for Khoj? Khoj supports this since a few months now. A Khoj third party plugin just needs to output data into a structured jsonl file and then Khoj can index it for search and chat on it's own. |
Beta Was this translation helpful? Give feedback.
-
Would be an honor to have my first app used. Let me know how I can help. I have a career-defining exam in the coming week so I'll be out of commission, but will make sure the app is current in the repository. I currently use it for med school lectures, or to summarize YT channels with GPT-4 or Claude 2, like "CoinBureau" or "David Shapiro ~ AI". |
Beta Was this translation helpful? Give feedback.
-
First, congrats on the YCombinator news!
My idea:
Include transcription option, so users can ingest video and audio knowledge. I have tons of knowledge only available on video that I have been transcribing for ingestion by Khoj.
I made my own Python app that can transcribe all the way from video -> audio -> text automatically with a GUI, but it's nowhere as robust and optimized as the Whisper.cpp project.
What do you think?
Beta Was this translation helpful? Give feedback.
All reactions