-
Notifications
You must be signed in to change notification settings - Fork 501
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Testset generation ValueError: invalid literal for int() with base 10: #966
Comments
This seems like a specific issue with the model and how we are parsing the inputs. This is something we are aware of and will be fixing in the coming weeks but sadily its not a an easy fix The easier fix is to use a model that is a bit more capable. I was curious why you are not using models like GPT4 and cluade models for your usecase? |
Thank you for your reply! Because I want to find some open source large language models to support our private deployment scenarios. Do ragas have a supported list of open source LLMs to choose from as a critic model, or can we select one from the open source LLMs list and use it for test set generation? |
the recommendation is to try out something as powerful as GPT4, because at that scale models are much more stirable with prompts. something else you can also try is our custom model for critic. https://docs.ragas.io/en/stable/howtos/customisations/ragas_custom_model.html if you want help using it and setting it up let me know, can help @choshiho |
Thank you! It seems like the ragas official critic model from url(https://docs.ragas.io/en/stable/howtos/customisations/ragas_custom_model.html) can't handle Chinese. Is there any open source Chinese critic model to recommend to me ? |
Unfortunately, there is no open-source Chinese critic model at present. The best case would be to use a proprietary model that follows Chinese (gpt4, Claude etc), maybe using Azure OpenAI could help. alternatively, we might be able to help you fine-tune a model but that will be what do you think? |
[ ] I have checked the documentation and related resources and couldn't resolve my bug.
Describe the bug
Ragas version: 0.1.7
Python version: 3.11.7
Code to Reproduce
First, I deployed my Qwen1.5-7B-Chat-GPTQ-Int8 using the following command:
CUDA_VISIBLE_DEVICES=1 python -m vllm.entrypoints.openai.api_server --served-model-name Qwen1.5-7B-Chat-GPTQ-Int8 --model /home/zhifeng.zhao/.cache/modelscope/hub/qwen/Qwen1___5-7B-Chat-GPTQ-Int8 --max-model-len 18576
Then, The code in the jupyter notebook is as follows :
Error trace
Expected behavior
TestsetGenerator.generate_with_langchain_docs() returns a TestDataset object with 10 elements.
Additional context
I have edited File "/home/zhifeng.zhao/anaconda3/lib/python3.11/site-packages/ragas/testset/evolutions.py" as issues #900:
selected_nodes = [ current_nodes.nodes[int(i) - 1] for i in relevant_context_indices if int(i) - 1 < len(current_nodes.nodes) ]
The text was updated successfully, but these errors were encountered: