- all resources' primaryKey properity will be an array in datapackage.json now
- update the code to make it work with newer version of pandas
- improved and fixed bug in the curl download function
- imporved dask datapoint reader to correctly determine dtypes
- rewrite the download() function to use libcurl
- EntityDomain class support querying entities with no sets
- ihme loader: add support for filtering rei context (#132)
- ihme loader: update link template and fix json issue (#130)
- merge procedure: fix datapoints deep merge logic
- merge_keys: option to choose aggregation method and fix a bug when merged='keep'
- more test cases
- merge/split entities: handle categorical dtype columns (#128)
- create_datapackage: check if headers match file name (#127)
- query: handle query contains columns not exist in the dataframe
- minor changes
- chef: use string for time concepts (#124)
- imporve general retry/download functions
- imporve WDI data loader
- improve logging for datapackage generation (#120)
- update dependencies version (#121)
- minor changes
- improve performance of entity loading in DDFcsv reader (#119)
- update progress bar options (#118)
- data factory: fixed issues in ihme loader and remove outdated loaders.
- exclude assets folder in
ddf cleanup
- chef: added modifier for translate_header (#94)
- more docs
- ddf cleanup: remove all files in folders
- new utils to download file with resumming ability
- fix: ddf schema missing for concepts synonyms
- fix: check etl type should take care of missing module in etl.py
- minor changes
- ddf reader: read synonyms for entity set from entity domain synonyms
- chef: imporvments on
window
procedure, improvements onto_recipe
method - datapackage: improvments on ddfschema generation process
- minor changes
- added options to modify column headers for chef output (#116)
- fixed a chef debug option issue
- added more examples/docs
- DDF data models/Chef data models are re-written
- various improvments/bug fix on chef procedures
- updated project sturcture
- improved documents and tests
- fix click version issue when install with pip
- add default fields to create_datapackage
- fix exception when creating datapackage for files with multiple indicators (issue #113)
- fix factory method issue for cdiac
- factory method for IGME no longer works. mark it in doc.
- fix wrong requirement (issue #115)
- fix installation error (issue #114)
- updated module structure, move functions to correct place
- type annotation for classes
- updated trend_bridge, adding bridge start/end detection
- updated factory methods for a few sources
- improvments and bug fix in chef
- split
procedure.py
into seperated files for each procedure - bug fix, improvments for procedures
- bug fix, improvments for datapackage loading and creation of datapackage.json
- more factory methods for data sources
- bug fix for various procedures
- improvment the running time for recipes (#98)
- update commandline options for
ddf
sub-commands - update documents
- use dask and categorical types to improve memory usage (#90)
- sorting column/row order before serving result in Chef
- add
breakpoint
option to chef procedures to stop and debug chef - add data/metadata downloading functions to various data sources
- minor changes and improvments
- new support Hy mode
- bug fixes and improvements
- add support for reading mutiple indicators in one DDF datapoints file (#76)
- add support for reading repo from github / local path (#79)
- add dry_run option in Ingredient to speed up loading speed
- get_datapoints_df() for Dataset object always return a DataFrame
- to_graph() for Chef now includes procedure names
- bug fixes and improvements
- support mongo-like queries in ingredient definition and filter procedures (#72)
- new procedure: filter (#72)
- inline ingredients in ingredients section and procedures (#36)
- external csv file as ingredient (#36)
- colourful terminal output for chef
- bug fix and minor improvements
- totally rewrite the ddf reader, make separated models for dataset and datapackage in
ddf_utils.models
. - add Chef class, which can create or load recipes interactively. Now all recipe related tasks should run under a Chef instance.
- all procedures takes a chef instance as first arguments now. Also we support writing custom procedures now.
- removed
indentity
andcopy
procedure, updatedfilter_row
andgroupby
procedure - added new
flatten
procedure - minor improvements
- bug fix: #67
- add ddfSchema creation to datapackage
- bug fix and improvements
- bug fix in datapackage generation
- performance improvement on datapackage generation
- add progress bar to datapackage generation
- bug fix in ddfSchema generation
- the chef_new module, which is to replace the chef module. But it's still WIP, not working yet.
- ddf_utils.index renamed to ddf_utils.datapackage
- new interface for DDF Dataset and Datapackage
- removed python 3.3 support, because it's not supported by latest pandas now
- add support for ddfSchema creation
- bug fix for procedures/functions
- minor improvments
- new subcommands for
ddf
:diff
for comparing 2 datasets - DDF() now accepts absolute path to datasets
- new subcommands for
ddf
:validate_recipe
andbuild_recipe
- include a recipe schema for validating recipes
- documents for new commands
- minor improvments
- bug fix in several procedures
- minor improvments
- new
insert_keys
option togroupby
procedure - new
keep_columns
option tofliter_row
procedure - new
split_datapoints_by
andsub_folder
option for serving - misc improvements and bug fixes
- new options for
extract_concepts
procedure (#40) - the
key
parameter for ingredients now only accepts string (#39)- Note: this will break recipes worked on pervious version with entities
ingredients which
key
s are lists.
- Note: this will break recipes worked on pervious version with entities
ingredients which
translate_column
can ask for user input when ambiguity found (#34)- the ingredient dataframe's dtype will set according to the concepts table (#43)
- added debug options to all procedures (#46)
- added
trend_bridge
procedure (#42) - code cleanup and formatting
- added lots of documents
- added tests for all procedures
- removed outdated procedures (align, accumulate)
- more api docs, and all docs are available in read the docs now
- new
--ddf_dir
option forddf run_recipe
#45 - add options for
serve
procedures andserving
section. Now you should provide a list of dictionaries inserving
section, instead of a list of ids as pervious version - minor improvements and bug fixes
- new proecedures:
window
(#25) - updated
groupby
procedure (#25) - updated
translate_column
procedure to include the function inalign
(#3) - minor improvements
- use DAG to model the recipe. changes are:
- procedure result can not have same id with other ingredients (can't overwrite existing ingredients)
- the
result
of procedure is mandantory field now - recipe cooking procedures can be written in any order. Chef will check dependencies
- new show-tree option to display a tree view of procedures/ingredients in recipe
- added support for serve section
- renamed procedure
add_concepts
toextract_concepts
#40