Support augment operations on multi-index columns #304

liangjh · 2024-11-26T18:12:23Z

Can pytimetk be augmented to support multi-index columns?

The pytimetk augment_* API appears to makes the assumption that columns are single indexes i.e. multi-index not supported.

In the augment_* functions, the date and value columns parameters make these assumptions, but if i had a situation where i had a multi-index, i would need to collapse the multi-index down into a single dimension before i can utilize pytimetk.

Here's an example. The pool forms a 2nd dimension on the column multi-index. this allows the rows to be keyed by dates only along the rows (i.e. longitudinally).

df = pd.DataFrame({
    'date': pd.date_range(start='2020-01-01', periods=10, freq='D'),
    'pool': ['A','A','A','A','A','B','B','B','B','B'],
    'target': [1,-1,0,-1,1,1,0,-1,-1,1],
    'reserve': [5,20,10,1,4,30,15,18,2,9]
})

df_tdp = df.set_index(['pool', 'date']).unstack('pool')

In pandas, this is also the ideal format to perform dataset-wide window operations. It preserves the dimensionality of the columns as well. This way we can stack / unstack the original columns into rows if we want to preserve the inbound dimensionality.

df_tdp.shift(1).rolling(window=2).mean()

pd.concat([
    df_tdp.stack('pool'),
    df_tdp.shift(1).rolling(window=2).mean().stack('pool', dropna=False)
], axis=1)

Assuming a singular dimension is a limiting factor for more advanced cases beyond more simple datasets / use cases.
Would like to hear any thoughts on this - most datasets I imagine are multi-dimensional / have multiple attributes. Thanks.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support augment operations on multi-index columns #304

Support augment operations on multi-index columns #304

liangjh commented Nov 26, 2024 •

edited

Loading

Support augment operations on multi-index columns #304

Support augment operations on multi-index columns #304

Comments

liangjh commented Nov 26, 2024 • edited Loading

liangjh commented Nov 26, 2024 •

edited

Loading