Fix multi-image and 2x speed improvements (DS-VL2) #157

Blaizzy · 2024-12-23T13:54:23Z

This PR introduces critical improvements and optimizations for multi-image inference, resulting in significant performance gains.

Fixes Multi-Image Inference
Resolved issues to ensure correct and efficient handling of multi-image inputs.
Achieves ~2x Speedup
Improved processing efficiency for both prompt handling and token generation, nearly doubling performance.

Performance remains consistent and stable, as shown below:

Significant improvements are evident in multi-image inference, as illustrated below:

…eration speedup)

Blaizzy added 2 commits December 23, 2024 14:42

fix multi-image and use .tolist() for (2.16× prompt and 1.83x for gen…

05d460d

…eration speedup)

format

0483a10

Blaizzy changed the title ~~Fix multi-image and 2x speed improvements~~ Fix multi-image and 2x speed improvements (DS-VL2) Dec 23, 2024

Provide feedback