feature(wrh): add RoPE for unizero #263

ruiheng123 · 2024-08-13T07:19:42Z

We add RoPE for unizero

puyuan1996 · 2024-08-13T07:24:30Z

lzero/model/unizero_world_models/transformer.py

@@ -55,6 +55,15 @@ def __init__(self, config: TransformerConfig) -> None:
        self.blocks = nn.ModuleList([Block(config) for _ in range(config.num_layers)])
        self.ln_f = nn.LayerNorm(config.embed_dim)

+        self.config.rope_theta = 500000
+        self.config.max_seq_len = 2048


这个参数确定一下，是否应该设置成与实际训练的长度一致

rope_theta 是影响位置编码的频率，应该就用默认的就行。max_seq_len 是最大序列长度，它决定了预计算频率张量的长度，如果我们希望在测试时支持更长的序列，应该将 max_seq_len 设置为能覆盖我们期望的最大测试序列长度，例如如果我们测试最长是2048，这个值应该设置为2048， 10有点太小了，如果测试长度>10会报错。

puyuan1996 · 2024-08-13T07:25:37Z

zoo/atari/config/atari_unizero_config.py

+max_env_step = int(5e5)
+reanalyze_ratio = 0.
+batch_size = 2
+num_unroll_steps = 10


训练的时候，不是用的debug config吧

不是，只是提交上来的是debug版的，但Training的过程里用的不是

PaParaZz1 · 2024-09-20T07:40:48Z

This PR will be updated in #266.

dyyoungg and others added 2 commits August 8, 2024 18:45

feature(pu): add rope in unizero's transformer

1f1df62

feature(wrh): add RoPE for unizero

c6ef89d

puyuan1996 added the enhancement New feature or request label Aug 13, 2024

puyuan1996 reviewed Aug 13, 2024

View reviewed changes

ruiheng123 added 2 commits August 13, 2024 07:30

feature(wrh): add RoPE for unizero

1767a4e

feature(wrh): add RoPE for unizero

b82a027

puyuan1996 mentioned this pull request Aug 15, 2024

feature(pu): add rope in unizero's transformer #261

Closed

PaParaZz1 closed this Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(wrh): add RoPE for unizero #263

feature(wrh): add RoPE for unizero #263

ruiheng123 commented Aug 13, 2024

puyuan1996 Aug 13, 2024

puyuan1996 Aug 13, 2024

puyuan1996 Aug 13, 2024

ruiheng123 Aug 13, 2024

PaParaZz1 commented Sep 20, 2024

feature(wrh): add RoPE for unizero #263

feature(wrh): add RoPE for unizero #263

Conversation

ruiheng123 commented Aug 13, 2024

puyuan1996 Aug 13, 2024

Choose a reason for hiding this comment

puyuan1996 Aug 13, 2024

Choose a reason for hiding this comment

puyuan1996 Aug 13, 2024

Choose a reason for hiding this comment

ruiheng123 Aug 13, 2024

Choose a reason for hiding this comment

PaParaZz1 commented Sep 20, 2024