F5TTS-TensorRT-LLM

该项目主要受https://github.com/DakeQQ/F5-TTS-ONNX启发，前处理和后处理由于耗时较低并没有进行优化，都用的onnx方案。而backbone部分主要是onnx版的trtllm实现，重写了网络，重写一些trtllm不支持的算子和操作。最终推理性能提高了4倍左右。

目录中model和example分别对应Tensorrt-LLM源码中的tensorrt_llm/models和example。

在Tensorrt-LLM源码中的tensorrt_llm/models和example目录下分别新建f5tts/ 目录，然后将repo中的代码放入对应的目录。
example/f5tts目录如下。需要通过导出前处理以及后处理的onnx、basic_ref_zh.wav和vocab.txt可从源码repo中获取。

.  
├── F5_Decode.onnx  
├── F5_Preprocess.onnx  
├── __pycache__  
│   └── diffusion.cpython-310.pyc  
├── basic_ref_zh.wav  
├── ckpts  
│   └── model_1200000.pt  
├── convert_checkpoint.py  
├── sample_tts.py  
└── vocab.txt

tensorrt_llm/models/f5tts目录如下。

.  
├── model.py  
├── module.py

在tensorrt_llm/models/init.py导入f5tts

from .f5tts.model import F5TTS

模型并在MODEL_MAP中注册模型。

'F5TTS': F5TTS

1.convert_checkpoint

cd example/f5tts
python convert_checkpoint.py

2.build engine(支持Tensor 并行, --tp_size)

trtllm-build --checkpoint_dir ./tllm_checkpoint/ --remove_input_padding disable --bert_attention_plugin disable

3.inference

python sample.py

Reference

F5-TTS-ONNX：https://github.com/DakeQQ/F5-TTS-ONNX
F5-TTS：https://github.com/SWivid/F5-TTS

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Export_ONNX		Export_ONNX
example		example
model		model
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

F5TTS-TensorRT-LLM

1.convert_checkpoint

2.build engine(支持Tensor 并行, --tp_size)

3.inference

Reference

About

Releases

Packages

Languages

Bigfishering/f5-tts-trtllm

Folders and files

Latest commit

History

Repository files navigation

F5TTS-TensorRT-LLM

1.convert_checkpoint

2.build engine(支持Tensor 并行, --tp_size)

3.inference

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages