v5.0.0
AnotherSamWilson
released this
15 Oct 19:33
·
215 commits
to master
since this release
- New main classes (
ImputationKernel
,ImputedData
) replace (ImputationKernel
,ImputationKernel
,ImputedDataSet
,MultipleImputedDataSet
). - Data can now be referenced and imputed in place. This saves a lot of memory allocation and is much faster.
- Data can now be completed in place. This allows for only a single copy of the dataset to be in memory at any given time, even if performing multiple imputation.
- mean_match_subset parameter has been replaced with data_subset. This subsets the data used to build the model as well as the candidates.
- More performance improvements around when data is copied and where it is stored.
- Raw data is now stored as the original. Can handle pandas DataFrame and numpy ndarray.