Add a new function to compute monthly averages #94

alperaltuntas · 2019-03-13T00:22:29Z

Checklist

Enable and install pre-commit to ensure style-guides and code checks are followed.
Target master for bugfixes and doc changes.
Target devel for new features or functionality changes.
Include documentation when adding new features.
Include new tests or update existing tests when applicable.

Fixes: #55

Adds a new function to compute monthly averages from a given dataset that has more frequent data, e.g., 5 day means.

Summary of what the function does:

First, the input dataset gets "grouped by" time_bound months. (The time_bound gets expanded, i.e., gets reshaped to be 1-dimensional for the purpose of grouping. All the DataArrays in the dataset are reshaped accordingly.)
Then, a local function "weighted_monthly_mean" is applied to each group: Weights get computed for each chunk (e.g. 5-day) of each group (month), by taking into account how much of the chunk falls within the group.
Finally, time and time_bound get corrected for each group (month).

Note 1: Parts of the function may be (unnecessarily) complicated, so a code review may perhaps be helpful.
Note 2: The existing monthly climatology function (compute_mon_climatology) may be rewritten to call this new function (compute_mon_averages) to compute the averages, and then to compute the correct climatology (pretty easily).
Note 3: This is a draft pull request for now, since I haven't tested it thoroughly. Let me know any issues you notice/encounter.

To test this function, you may run the following on cheyenne:

from glob import glob
import xarray as xr
import esmlab

files = sorted(glob('/glade/scratch/altuntas/archive/g.e20.GIAF.T62_g37.test_d2m.001/ocn/hist/g.e20.GIAF.T62_g37.test_d2m.001.pop.h.0001-1*.nc'))
ds = xr.open_mfdataset(files, decode_times=False, decode_coords=False)
ds_monclim = esmlab.climatology.compute_mon_averages(ds)
ds_monclim.to_netcdf("test.nc")

remove duplicate call to _get_weights_and_dims

…oups

andersy005

@alperaltuntas, @matt-long, as a preliminary review comment, I was wondering if it would be worth exploring the usage of resample with cftime index as it's been implemented in pydata/xarray#2593?

I will add more comments for the rest of the PR tomorrow morning.

matt-long · 2019-03-18T15:09:27Z

@alperaltuntas, have you looked into @andersy005's suggestion? I think the new resample capability on CFTimeIndex could greatly simplify this type of application.

alperaltuntas · 2019-03-18T16:14:56Z

@matt-long, @andersy005, I'll look into it. Thanks.

matt-long · 2019-03-18T16:16:07Z

One think to keep in mind is an ability to correctly handled missing values that vary in time.

andersy005 · 2019-03-28T19:17:28Z

@alperaltuntas, I am sorry I deleted the devel without making sure that there were no open pull requests. When you get time, please open a PR against the master branch

sudharsana-kjl and others added 14 commits March 9, 2019 19:10

remove duplicate call to _get_weights_and_dims

17d9788

Merge pull request NCAR#88 from sudharsana-kjl/master

479fa70

remove duplicate call to _get_weights_and_dims

create tb data array

af23410

group by time_bound months

db6e275

first prototype of weighted_monthly_mean function to be applied to gr…

67993be

…oups

make weighted avging an option

6bbf2b9

compute weights properly

a8305fe

rename tb_name_mth

b289770

drop partially covered months

4f231ea

designate months with month indices

cdf35e6

split mon_climo and mon_avg functions

24ca73c

correct time and time_bound (and other improvements)

853635d

bring back white spaces

5b41fa9

define tb dims with a tuple (i.e., ordered)

c9410b4

alperaltuntas requested review from andersy005 and matt-long March 13, 2019 00:22

andersy005 reviewed Mar 13, 2019

View reviewed changes

andersy005 closed this Mar 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a new function to compute monthly averages #94

Add a new function to compute monthly averages #94

alperaltuntas commented Mar 13, 2019 •

edited

Loading

andersy005 left a comment

matt-long commented Mar 18, 2019

alperaltuntas commented Mar 18, 2019

matt-long commented Mar 18, 2019

andersy005 commented Mar 28, 2019

Add a new function to compute monthly averages #94

Add a new function to compute monthly averages #94

Conversation

alperaltuntas commented Mar 13, 2019 • edited Loading

andersy005 left a comment

Choose a reason for hiding this comment

matt-long commented Mar 18, 2019

alperaltuntas commented Mar 18, 2019

matt-long commented Mar 18, 2019

andersy005 commented Mar 28, 2019

alperaltuntas commented Mar 13, 2019 •

edited

Loading