在训练nanoGPT时出现了以下报错，请问我该怎么做 #17525

1os3 · 2023-04-29T00:11:11Z

1os3
Apr 29, 2023

(base) PS E:\GPT-2\nanoGPT> python train.py config/train_shakespeare_char.py --compile=False --block_size=64
Overriding config with config/train_shakespeare_char.py:

train a miniature character-level shakespeare model

good for debugging and playing on macbooks and such

out_dir = 'out-shakespeare-char'
eval_interval = 250 # keep frequent because we'll overfit
eval_iters = 200
log_interval = 10 # don't print too too often

we expect to overfit on this small dataset, so only save when val improves

always_save_checkpoint = False

wandb_log = False # override via command line if you like
wandb_project = 'shakespeare-char'
wandb_run_name = 'mini-gpt'

dataset = 'shakespeare_char'
gradient_accumulation_steps = 1
batch_size = 64
block_size = 256 # context of up to 256 previous characters

baby GPT model :)

n_layer = 6
n_head = 6
n_embd = 384
dropout = 0.2

learning_rate = 1e-3 # with baby networks can afford to go a bit higher
max_iters = 5000
lr_decay_iters = 5000 # make equal to max_iters usually
min_lr = 1e-4 # learning_rate / 10 usually
beta2 = 0.99 # make a bit bigger because number of tokens per iter is small

warmup_iters = 100 # not super necessary potentially

on macbook also add

device = 'cpu' # run on cpu only

compile = False # do not torch compile the model

Overriding: compile = False
Overriding: block_size = 64
tokens per iteration will be: 4,096
Traceback (most recent call last):
File "E:\GPT-2\nanoGPT\train.py", line 110, in
ctx = nullcontext() if device_type == 'cpu' else torch.amp.autocast(device_type=device_type, dtype=ptdtype)
File "D:\PSAutoRecover\lib\site-packages\torch\amp\autocast_mode.py", line 234, in init
raise RuntimeError('Current CUDA Device does not support bfloat16. Please switch dtype to float16.')
RuntimeError: Current CUDA Device does not support bfloat16. Please switch dtype to float16.
最后一行好像是错误
Uploading nanoGPT…

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

在训练nanoGPT时出现了以下报错，请问我该怎么做 #17525

{{title}}

Replies: 0 comments

Select a reply

在训练nanoGPT时出现了以下报错，请问我该怎么做 #17525

1os3 Apr 29, 2023

train a miniature character-level shakespeare model

good for debugging and playing on macbooks and such

we expect to overfit on this small dataset, so only save when val improves

baby GPT model :)

on macbook also add

device = 'cpu' # run on cpu only

compile = False # do not torch compile the model

Replies: 0 comments

1os3
Apr 29, 2023