Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Webdatasets for Linear Probe evaluation #127

Open
shrebox opened this issue May 27, 2024 · 2 comments
Open

Webdatasets for Linear Probe evaluation #127

shrebox opened this issue May 27, 2024 · 2 comments

Comments

@shrebox
Copy link

shrebox commented May 27, 2024

Hi, I noticed only the following web datasets work for the linear probe evaluation. It would be nice to have the following things fixed for the same:

  1. Support for the rest of the datasets to do linear probing evaluations. For example, the SUN397, the train shard file is missing from the HuggingFace datasets.

  2. Similar to zero-shot evaluation, an automatic way to run the linear probe evaluations would be nice to benchmark the models quickly.

Web datasets that worked for linear probe evaluation:

wds/imagenet1k
wds/fer2013
wds/voc2007
wds/cars
wds/fgvc_aircraft
wds/mnist
wds/stl10
wds/gtsrb
wds/renderedsst2
wds/vtab/caltech101
wds/vtab/cifar10
wds/vtab/cifar100
wds/vtab/clevr_count_all
wds/vtab/clevr_closest_object_distance
wds/vtab/diabetic_retinopathy
wds/vtab/dmlab
wds/vtab/dsprites_label_orientation
wds/vtab/dsprites_label_x_position
wds/vtab/dsprites_label_y_position
wds/vtab/dtd
wds/vtab/eurosat
wds/vtab/kitti_closest_vehicle_distance
wds/vtab/flowers
wds/vtab/pets
wds/vtab/pcam
wds/vtab/resisc45
wds/vtab/smallnorb_label_azimuth
wds/vtab/smallnorb_label_elevation
wds/vtab/svhn

Thanks! :)

@CAOANJIA
Copy link

I encountered the same problem that the training split of SUN397 is missing. Have you solved it?

@shrebox
Copy link
Author

shrebox commented Jun 18, 2024

I encountered the same problem that the training split of SUN397 is missing. Have you solved it?

Unfortunately not, as there's missing train shard file (as mentioned above in the issue) for using the pipeline with web datasets :/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants