Aligner-Reproduced

Align-Inc uses the Aligner technology developed by Peking University, training a lightweight Aligner based on Gemma-2B, and applying it to our specific business practices. Notably, the Aligner we replicated achieved marvelous results on AlpacaEval. See below for details.

Results

Using the techniques mentioned in the paper, we trained Aligner based on Gemma-2B and successfully improved the performance of Qwen-72B-Chat , Claude3-Opus and GPT-4 on AlpacaEval. After being corrected by our Aligner model, Qwen-72B-Chat's LC win rate was enhanced to 36.7% , with its responses averaging 1812 tokens, whereas the LC win rate of Claude3-Opus was enhanced to 41.8% , with an average response length of 1669 tokens.

Surprisingly, GPT-4's LC win rate increased to 58.3% , making it the Top Performer on the AlpacaEval.

Citing Aligner

This repository is the reproduction of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction. You can cite it in your publications if you find Aligner useful.

@article{ji2024aligner,
  title={Aligner: Achieving efficient alignment through weak-to-strong correction},
  author={Ji, Jiaming and Chen, Boyuan and Lou, Hantao and Hong, Donghai and Zhang, Borong and Pan, Xuehai and Dai, Juntao and Yang, Yaodong},
  journal={arXiv preprint arXiv:2402.02416},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
aligner		aligner
images		images
.gitignore		.gitignore
README.md		README.md
aligner.jpg		aligner.jpg
alpaca_eval.jpg		alpaca_eval.jpg
conda-recipe.yaml		conda-recipe.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aligner

aligner

images

images

.gitignore

.gitignore

README.md

README.md

aligner.jpg

aligner.jpg

alpaca_eval.jpg

alpaca_eval.jpg

conda-recipe.yaml

conda-recipe.yaml

requirements.txt

requirements.txt

Repository files navigation

Aligner-Reproduced

Results

Citing Aligner

About

Releases

Packages

Languages

AlignInc/aligner-replication

Folders and files

Latest commit

History

Repository files navigation

Aligner-Reproduced

Results

Citing Aligner

About

Topics

Resources

Stars

Watchers

Forks

Languages