Skip to content

多卡train_full Pixtral-12B一段时间后报错: torch.distributed.elastic.multiprocessing.errors.ChildFailedError #2000

多卡train_full Pixtral-12B一段时间后报错: torch.distributed.elastic.multiprocessing.errors.ChildFailedError

多卡train_full Pixtral-12B一段时间后报错: torch.distributed.elastic.multiprocessing.errors.ChildFailedError #2000

Triggered via issue January 10, 2025 02:59
Status Success
Total duration 8s
Artifacts

label_issue.yml

on: issues
label_issue
0s
label_issue
Fit to window
Zoom out
Zoom in

Annotations

1 warning
label_issue
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636