SayCan-Extended

Extending SayCan Framework: Enhancing language-driven robotic task planning and execution with improved scoring, direct control, and pipeline optimization.

Overview

This project extends the original SayCan robotic task planning framework. Key updates include:

Direct Control Mechanisms using PyBullet.
Enhanced Scoring Systems for affordance and language evaluations.
Batch Processing Optimization to improve throughput and reduce latency.

Setup and Installation

Follow these steps to set up the environment and run the experiments:

1. Clone the Repository

git clone https://github.com/csce585-mlsystems/SayCan-Extended.git
cd SayCan-Extended

2. Create and Activate Conda Environment

conda create -n saycan python=3.8
conda activate saycan

3. Install Dependencies

Ensure all required packages are installed:

pip install --upgrade pip
pip install -r requirements.txt

4. Required Assets

The project will download necessary assets automatically on the first run, including:

UR5e robot URDF files
Robotiq 2F-85 gripper files
Bowl assets
ViLD pretrained model weights

Environment

The simulation environment is built using PyBullet and includes:

UR5e robotic arm
Robotiq 2F-85 gripper
Manipulatable objects (blocks, bowls)
Cameras for top-down and perspective views

Key Components

Vision Module: ViLD (Vision-Language Detection) for zero-shot object detection.
Language Module: GPT-3.5-turbo-instruct for task planning and decomposition.
Manipulation Module: PyBullet direct control for pick-and-place actions.

Usage

Run the main notebook SayCanWithDirectControl.ipynb to:

Set up the PyBullet simulation environment.
Perform object detection using ViLD.
Execute tasks with optimized scoring and batch processing.

To Start:

Launch Jupyter Notebook:

jupyter notebook

Hardware Requirements

CUDA-capable GPU recommended for running vision and language models.
Tested on Python 3.8 with PyTorch and JAX.

Updating Dependencies

To install or update dependencies based on your conda environment:

pip freeze > requirements.txt

Citation

If you use this work, please cite:

@misc{saycan2022,
    title={Do As I Can, Not As I Say: Grounding Language in Robotic Affordances},
    author={Michael Ahn and Anthony Brohan and Noah Brown and ... Andy Zeng},
    year={2022}
}

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Links

Code: GitHub Repository
Video Presentation: YouTube Video

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
cliport		cliport
initial_experiment		initial_experiment
logs		logs
proposal		proposal
.gitignore		.gitignore
CSCE585_ProjectSlideshow.pdf		CSCE585_ProjectSlideshow.pdf
CSCE585_ProjectSlideshow.pptx		CSCE585_ProjectSlideshow.pptx
MooreheadCSCE585FinalReport.pdf		MooreheadCSCE585FinalReport.pdf
README.md		README.md
SayCan-Robot-Pick-Place.ipynb		SayCan-Robot-Pick-Place.ipynb
SayCanWithDirectControl.ipynb		SayCanWithDirectControl.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SayCan-Extended

Overview

Setup and Installation

1. Clone the Repository

2. Create and Activate Conda Environment

3. Install Dependencies

4. Required Assets

Environment

Key Components

Usage

To Start:

Hardware Requirements

Updating Dependencies

Citation

License

Links

About

Releases 1

Packages

Languages

csce585-mlsystems/SayCan-Extended

Folders and files

Latest commit

History

Repository files navigation

SayCan-Extended

Overview

Setup and Installation

1. Clone the Repository

2. Create and Activate Conda Environment

3. Install Dependencies

4. Required Assets

Environment

Key Components

Usage

To Start:

Hardware Requirements

Updating Dependencies

Citation

License

Links

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages