Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UMAP hyperparameters question #59

Open
timofeytkachenko opened this issue Dec 26, 2023 · 1 comment
Open

UMAP hyperparameters question #59

timofeytkachenko opened this issue Dec 26, 2023 · 1 comment

Comments

@timofeytkachenko
Copy link

We have three bunches of hyperparameters of UMAP. When should we use categorical and numerical dicts, and when combined dict?

default_umap_params = {
"categorical": {
# jaccard is an option but only takes sparse input
"metric": "hamming",
"n_neighbors": 30,
"n_components": 5,
"min_dist": 0.0,
},
"numerical": {
"metric": "l2",
"n_neighbors": 30,
"n_components": 5,
"min_dist": 0.0,
},
"combined": {
"n_neighbors": 30,
"min_dist": 0.0,
"n_components": 5,
},
}

@momonga-ml
Copy link
Contributor

@timofeytkachenko Ideally, your use case won't require having to go to deep in the weeds and can use the presets. "combined" is set when it creates an intersection_union_mapper mapper like here and thus needs to fit a third UMAP over the other two. Regardless, it's fitting two using numerical and categorical. We've got another NB coming that shows these that might shed more light but please use the first one until then. FYI @bharven @srushtii-aws

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants