Inference Speed #391

joaopedro-fg · 2024-03-18T18:35:16Z

Hello!
I'm using PyABSA in an application where I have to do aspect term extractation and polarity for about 3000 texts every 15 minutes. At the moment, I'm using an Nvidia L4, however, it still takes about 30 minutes to process all the texts. Is there any way to speed up the inference process?

yangheng95 · 2024-04-01T15:58:27Z

Maybe you can use smaller modeling length (e.g., 80) and larger batch size (64 or 128).
And you can try the fp16 precision using torch.cuda.amp.autocast().

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference Speed #391

Inference Speed #391

joaopedro-fg commented Mar 18, 2024

yangheng95 commented Apr 1, 2024

Inference Speed #391

Inference Speed #391

Comments

joaopedro-fg commented Mar 18, 2024

yangheng95 commented Apr 1, 2024