Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate updates for LMSYS Chatbot Arena Leaderboard #15497

Open
1 task done
zhoukuncheng opened this issue May 7, 2024 · 1 comment
Open
1 task done

Generate updates for LMSYS Chatbot Arena Leaderboard #15497

zhoukuncheng opened this issue May 7, 2024 · 1 comment
Labels
anti-crawler The site have strict anti-crawler policies RSS proposal New RSS wanted

Comments

@zhoukuncheng
Copy link
Contributor

zhoukuncheng commented May 7, 2024

Category

Programming

Website URL

https://chat.lmsys.org/?leaderboard
or its mirror in huggingface:
https://huggingface.co/spaces/lmsys/chatbot--leaderboard

Website description

LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. We've collected over 800,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale.

What content should be included?

leaderboard for different categories (e.g. overall, coding, long user , English, Chinese)

Additional description

No response

This is not a duplicated rss request

@zhoukuncheng zhoukuncheng added the RSS proposal New RSS wanted label May 7, 2024
@TonyRL TonyRL added the anti-crawler The site have strict anti-crawler policies label May 7, 2024
@zhoukuncheng
Copy link
Contributor Author

@TonyRL
The original page (https://chat.lmsys.org/?leaderboard) uses Cloudflare to prevent crawling,
but their mirror page on Hugging Face (https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard) doesn't seem to have Cloudflare's reCAPTCHA.

Hugging Face also provides Python/JavaScript APIs, for example:

import { Client } from "@gradio/client";

const app = await Client.connect("lmsys/chatbot-arena-leaderboard");
const result = await app.predict("/update_leaderboard_and_plots", [		
				"Overall", // string  in 'Category' Dropdown component
	]);

console.log(result.data);

However, I haven't been able to successfully run this code in my environment. It might require some debugging.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
anti-crawler The site have strict anti-crawler policies RSS proposal New RSS wanted
Projects
None yet
Development

No branches or pull requests

2 participants