UC Berkeley CS285 Deep Reinforcement Learning Fall 2022

My Solutions of Assignments for Berkeley CS 285: Deep Reinforcement Learning, Decision Making, and Control.

Disclaimer: My solutions did pass all the Gradescope tests but they may still contain errors. Also, don't copy code directly and the solutions here are meant to help you if you get trapped.

Getting Started

# requirements Python=3.7

mujoco==2.2.0
gym==0.25.2
tensorboard==2.10.0
tensorboardX==2.5.1
matplotlib==3.5.3
ipython==7.34.0
moviepy==1.0.3
pyvirtualdisplay==3.0
torch==1.12.1
opencv-python==4.6.0.66
ipdb==0.13.9
swig==4.0.2
box2d-py==2.3.8
networkx==2.5

Folder Structure

Only necessary files are displayed.

- hw[number]
    + cs285 (contains running scripts)
        + agents
        + policies
        + ...
    + report.pdf
    + requirement.txt
    + setup.py
    + cs285_hw[num].pdf (hw instructions)
    
- lectures

Homework Topic Summary

- HW 1: Imitation Learning
    + Behavioral Cloning (BC)
    + DAgger

- HW 2: Policy Gradients
    + Implementing Policy Gradients
    + Small-Scale Experiments
    + Implementing Neural Network Baselines
    + Implementing Generalized Advantage Estimation

- HW 3: Q-Learning and Actor-Critic Algorithms
    - Q-Learning
        + basic Q-learning performance (DQN)
        + double Q-learning (DDQN)
        + experimenting with hyperparameters
    - Actor-Critic
    - Soft Actor-Critic (SAC)

- HW 4: Model-Based Reinforcement Learning
    + Dynamics Model (Dyna)
    + Action Selection using MBRL / CEM
    + On-Policy Data Collection
    + Ensembles (MBPO)

- HW 5: Exploration Strategies and Offline Reinforcement Learning
    + Random Network Distillation (RND) Algorithm
    + Boltzman Exploration
    + Conservative Q-Learning (CQL) Algorithm
    + Advantage Weighted Actor Critic (AWAC) Algorithm
    + Implicit Q-Learning (IQL) Algorithm

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
hw1		hw1
hw2		hw2
hw3		hw3
hw4		hw4
hw5		hw5
lectures		lectures
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hw1

hw1

hw2

hw2

hw3

hw3

hw4

hw4

hw5

hw5

lectures

lectures

.gitignore

.gitignore

README.md

README.md

Repository files navigation

UC Berkeley CS285 Deep Reinforcement Learning Fall 2022

Getting Started

Folder Structure

Homework Topic Summary

About

Releases

Packages

Languages

xd00099/CS285-DeepReinforcementLearning-Berkeley

Folders and files

Latest commit

History

Repository files navigation

UC Berkeley CS285 Deep Reinforcement Learning Fall 2022

Getting Started

Folder Structure

Homework Topic Summary

About

Topics

Resources

Stars

Watchers

Forks

Languages