Language is All a Graph Needs

This repo presents implementation of the InstructGLM (Instruction-finetuned Graph Language Model) and provide a natural language interface for graph machine learning:

Paper: Language is All a Graph Needs

Paper link: https://arxiv.org/abs/2308.07134

Introduction

We introduce our proposed Instruction-finetuned Graph Language Model, i.e. InstructGLM, a framework utilizing natural language to describe both graph structure and node features to a generative large language model and further addresses graph-related problems by instruction-tuning, which provides a powerful natural language processing interface for graph machine learning.

Usage

Clone this repo


git clone https://github.com/agiresearch/InstructGLM.git

Download preprocessed data from Arxiv, Cora, PubMed. If you would like to preprocess your own data, please follow the data_preprocess folder. Requiured raw data files for preprocessing can be downloaded from this raw-Arxiv, raw-Cora, raw-PubMed.
Download Llama-7b pretrained checkpoint via this Google Drive link, it has been processed by the format conversion script of HuggingFace. Please then put the ./7B folder under the same path with ./scripts folder.
Multi-task Multi-prompt Instruction Tuning


bash scripts/train_llama_arxiv.sh 8

Here 8 means using 8 GPUs to conduct parallel instruction tuning with DDP.

Validation/ Inference


bash scripts/test_llama_arxiv.sh 8

Main key points are summarized in note.txt

Checkpoints

See: Google Drive link.

Citation

Please cite the following paper corresponding to the repository:


@article{ye2023language,
  title={Language is All a Graph Needs},
  author={Ye, Ruosong and Zhang, Caiqi and Wang, Runhui and Xu, Shuyuan and Zhang, Yongfeng},
  journal={EACL},
  year={2024}
}

Acknowledgements

TAPE, GIANT, OGB, P5, OpenP5, and Planetoid

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
7B		7B
Arxiv		Arxiv
Cora		Cora
PubMed		PubMed
data_preprocess		data_preprocess
flan_arxiv_src		flan_arxiv_src
flan_cora_src		flan_cora_src
flan_pubmed_src		flan_pubmed_src
llama_arxiv_src		llama_arxiv_src
llama_cora_src		llama_cora_src
llama_pubmed_src		llama_pubmed_src
pic		pic
scripts		scripts
InstructGLM.pdf		InstructGLM.pdf
LICENSE.txt		LICENSE.txt
README.md		README.md
note.txt		note.txt
requirements.txt		requirements.txt

License

agiresearch/InstructGLM

Folders and files

Latest commit

History

Repository files navigation

Language is All a Graph Needs

Introduction

Usage

Checkpoints

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Languages