Skip to content

chunwangpro/Query-to-Distribution

Repository files navigation

Learning Data Distribution from Queries

Basic Usage

Single Table

python SingleTable.py --dataset wine3 --query-size 10000 --min-conditions 1 --max-conditions 2

Q-error: median 1.14, 90th 3.09, 99th 17.37, max 488.

Baseline

  • LPALG (PGM)
python LPALG.py --dataset wine --query-size 5 --num-conditions 2

Code Description

  • LPALG.py ----- Baseline of LPALG (PGM)
  • SingleTable.py ----- Single table generation
  • util.py ----- query related functions (rewrite by numpy version)
  • dataset.py ----- load and sort dataframe columns by unique number
  • preprocessing.py ----- build train set
  • model.py ----- models
  • generator.py ----- generation methods, by row, by col

About

Data generation from queries.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published