Multi GPUs #28

yisunlp · 2024-08-30T01:21:58Z

I ran mem_spd_test.py and got the following error:
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
I did not make any changes except the path of the model.
I manually changed the device and got the same error as #24
Any suggestions？

xzwj1699 · 2024-09-02T03:13:09Z

hi, if you are using accelerate to distribute your model to multi-GPUs, you should add "LlamaDecoderLayer_KIVI" to the "no_split_module_class" like

device_map = infer_auto_device_map(
                model, no_split_module_classes=["LlamaDecoderLayer_KIVI"], ****map_kwargs)**

and according to my experience, this may help

# this is the original code located in KIVI/quant/new_pack.py:232
# _minmax_along_last_dim[grid](data, mn, mx,
      data.numel(), data.shape[0], num_groups, group_size,
      BLOCK_SIZE_N=BLOCK_SIZE_N, num_warps=8) 

# modified code
with torch.cuda.device(data.device):
  _minmax_along_last_dim[grid](data, mn, mx,
        data.numel(), data.shape[0], num_groups, group_size,
        BLOCK_SIZE_N=BLOCK_SIZE_N, num_warps=8) 
# some other code...
with torch.cuda.device(data.device):
  _pack_along_last_dim[grid](bit, data, code, data.shape[0], 
	data.shape[1], feat_per_int, 
	BLOCK_SIZE_N=BLOCK_SIZE_N, 
	num_warps=8)

yisunlp · 2024-09-02T06:47:56Z

I changed my code and got

yisunlp · 2024-09-02T07:07:18Z

Could you please provide the original code for testing memory and multi-batch speed?

xzwj1699 · 2024-09-02T07:29:42Z

I am not the paper author nor the repo owner... I am the one who opened issue24 several months ago... and I have never encountered this error before.
Good luck.

yisunlp · 2024-09-02T07:35:35Z

I solved the problem, thank you very much for your help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi GPUs #28

Multi GPUs #28

yisunlp commented Aug 30, 2024

xzwj1699 commented Sep 2, 2024

yisunlp commented Sep 2, 2024

yisunlp commented Sep 2, 2024

xzwj1699 commented Sep 2, 2024

yisunlp commented Sep 2, 2024

Multi GPUs #28

Multi GPUs #28

Comments

yisunlp commented Aug 30, 2024

xzwj1699 commented Sep 2, 2024

yisunlp commented Sep 2, 2024

yisunlp commented Sep 2, 2024

xzwj1699 commented Sep 2, 2024

yisunlp commented Sep 2, 2024