YingqingHe / Awesome-LLMs-meet-Multimodal-Generation Star 85 Code Issues Pull requests 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio). text-to-speech multimodality text-to-image text-to-audio text-to-video text-to-music multimodal-models aigc large-language-models text-to-3d multimodal-generation text-to-sound large-vision-language-models multimodal-large-language-models Updated May 27, 2024 HTML
GeoHaberC / Story-to-Video Star 31 Code Issues Pull requests Create a Movie animation plus Audio plus Subtitle from a text file ffmpeg text-to-video chatgpt text-to-sound Updated Mar 23, 2023 Python
kennethleungty / Text-to-Audio-with-Bark Star 14 Code Issues Pull requests Exploring Bark, the Open-Source Text-to-Audio Generative Model data-science machine-learning text-to-speech ai deep-learning speech artificial-intelligence bark text-prompt text-to-audio text-to-music prompt-engineering generative-ai text-to-sound gen-ai Updated Oct 10, 2023 Jupyter Notebook
ericpesto / ai-sample-generator Star 9 Code Issues Pull requests Create .wav audio samples with text-to-sound generative AI python music windows macos cli ai music-composition samples wav synthesis cli-app music-generation generative electronic-music sound-synthesis sample-generation prompt-engineering generative-ai text-to-sound Updated Nov 18, 2023 Python