You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As mentioned in the paper - "Furthermore, we also invite some expert annotators to label task planning for some complex requests (46 examples) as a high-quality human annotated dataset. We also plan to further improve the quality and quantity of this dataset to better help us to evaluate the LLM capability in planning, which leaves as future work.", are you planning to release the evaluation dataset? Or if it is there already in the repository, could you send me the folder location?
Thanks.
The text was updated successfully, but these errors were encountered:
@ssdasgupta We are currently working with our labeling teams to iteratively improve the quality of this dataset and our legal team to ensure compliance of the dataset release. We will release a work about this dataset in the future. Please be patient.
I hope you’re doing well. I wanted to kindly follow up on the status of the evaluation dataset mentioned in the previous discussion. I understand that the team has been working on improving the quality and ensuring legal compliance. Could you please provide any updates on when we might expect the release of this dataset?
This dataset would be extremely valuable for my work, and I’m sure many others in the community are also eagerly awaiting it. Your efforts are greatly appreciated.
As mentioned in the paper - "Furthermore, we also invite some expert annotators to label task planning for some complex requests (46 examples) as a high-quality human annotated dataset. We also plan to further improve the quality and quantity of this dataset to better help us to evaluate the LLM capability in planning, which leaves as future work.", are you planning to release the evaluation dataset? Or if it is there already in the repository, could you send me the folder location?
Thanks.
The text was updated successfully, but these errors were encountered: