Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AssertionError: model parallel group is not initialized #104

Open
nikich340 opened this issue Feb 9, 2023 · 1 comment
Open

AssertionError: model parallel group is not initialized #104

nikich340 opened this issue Feb 9, 2023 · 1 comment

Comments

@nikich340
Copy link

Traceback (most recent call last):
  File "D:\_ruGPT3\ru-gpts\run_gpt3medium.py", line 11, in <module>
    gpt = RuGPT3XL.from_pretrained("sber-gpt3-medium", seq_len=512)
  File "D:\_ruGPT3\ru-gpts\src\xl_wrapper.py", line 178, in from_pretrained
    mpu.model_parallel_cuda_manual_seed(seed)
  File "D:\_ruGPT3\ru-gpts\src\mpu\random.py", line 217, in model_parallel_cuda_manual_seed
    model_parallel_seed = offset + get_model_parallel_rank()
  File "D:\_ruGPT3\ru-gpts\src\mpu\initialize.py", line 109, in get_model_parallel_rank
    return torch.distributed.get_rank(group=get_model_parallel_group())
  File "D:\_ruGPT3\ru-gpts\src\mpu\initialize.py", line 90, in get_model_parallel_group
    assert _MODEL_PARALLEL_GROUP is not None, \
AssertionError: model parallel group is not initialized

Used "Usage" example code but with rugpt3-medium model. Os: windows 10, py 3.9, gpu: rtx 3060 laptop.

@LEv145
Copy link

LEv145 commented Feb 14, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants