Do vtensor need 64K/128K physical memory policy? #24

nalinaly · 2024-08-08T07:48:13Z

vAttention said that: if use 2M pageSize, 128M physical memory can be wasted per-request in the worst-case in Llama-3-8B (TP-1), but if use 64KB, 128M would be only 4M
Do vtensor have the same problem？ Will vtensor integrate 64K/128K pageSize in the future?

dream110fly mentioned this issue Sep 19, 2024

question about torch 2.1.0 integration #22

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do vtensor need 64K/128K physical memory policy? #24

Do vtensor need 64K/128K physical memory policy? #24

nalinaly commented Aug 8, 2024

Do vtensor need 64K/128K physical memory policy? #24

Do vtensor need 64K/128K physical memory policy? #24

Comments

nalinaly commented Aug 8, 2024