time operations where time_bounds span multiple averaging periods #55

matt-long · 2019-02-09T17:54:14Z

There is an assumption within the functions in climatology.py that the time_bound of data fit concisely within the averaging period applied; this assumption is violated when computing monthly averages, say, on 5-day data. A more appropriate approach would be to compute averaging weights based on the portion of the time_bound that falls within the target averaging period.

The text was updated successfully, but these errors were encountered:

alperaltuntas · 2019-02-14T02:02:05Z

One solution for this issue is to interpolate from, say, 5-day data to 1-day data (using 'zero', i.e., piecewise polynomial interpolation), and then to compute monthly averages on daily data. This would be less efficient compared to an approach based on computing weights, but would be more general and easier to implement. The problem, however, is that xarray does not support interpolation over a chunked dimension.

When I try to interpolate a dataset that's read in using open_mfdataset, I get the following:

>>> da.interp(time=new_time_dim).compute()
...
NotImplementedError: Chunking along the dimension to be interpolated (2) is not yet supported.

Eliminating the chunking over time dimension solves this issue, but that would definitely be an infeasible option for practical use.

matt-long · 2019-02-14T23:21:33Z

@alperaltuntas, what if we "unfix time" and use the resample on the float time-axis, then "refix time" to compute the monthly climatology?

alperaltuntas · 2019-02-14T23:44:02Z

@alperaltuntas, what if we "unfix time" and use the resample on the float time-axis, then "refix time" to compute the monthly climatology?

I'll try this.

matt-long · 2019-02-14T23:56:09Z

on second thought, I think resample only works on time axes.

alperaltuntas · 2019-02-15T00:04:43Z

Can't we convert the time axis from cftime to Pandas' accepted time type, instead of unfixing the time?

matt-long · 2019-02-15T14:52:00Z

I think pandas is too restrictive for our data:

When decoding/encoding datetimes for non-standard calendars or for dates before year 1678 or after 
year 2262, xarray uses the cftime library. It was previously packaged with the netcdf4-python package 
under the name netcdftime but is now distributed separately. cftime is an optional dependency of 
xarray.

kmpaul · 2019-02-15T15:33:53Z

Have you installed cftime?

On Fri, Feb 15, 2019 at 7:52 AM Matthew Long ***@***.***> wrote: I think pandas is too restrictive for our data: When decoding/encoding datetimes for non-standard calendars or for dates before year 1678 or after year 2262, xarray uses the cftime library. It was previously packaged with the netcdf4-python package under the name netcdftime but is now distributed separately. cftime is an optional dependency of xarray. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#55 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AK4fgz1IbCCGWeUMEBSsQ_OxeZZ3l9cjks5vNsmQgaJpZM4ayp56> .

-- *Kevin Paul, PhD* Project Scientist, Head of I/O & Workflow Applications (IOWA) The National Center for Atmospheric Research Computational and Information Systems Laboratory 1850 Table Mesa Dr Boulder, CO 80305 Phone: (303) 497-2441 Office: ML460B

matt-long · 2019-02-15T15:44:06Z

Yes. The issue is that cftime doesn’t work with resample and pandas time is too restrictive.

kmpaul · 2019-02-15T15:56:39Z

Ah. Got it.

On Fri, Feb 15, 2019 at 8:44 AM Matthew Long ***@***.***> wrote: Yes. The issue is that cftime doesn’t work with resample and pandas time is too restrictive. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#55 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AK4fg8bPTUswfky8FP37vMjceYij05VGks5vNtXHgaJpZM4ayp56> .

-- *Kevin Paul, PhD* Project Scientist, Head of I/O & Workflow Applications (IOWA) The National Center for Atmospheric Research Computational and Information Systems Laboratory 1850 Table Mesa Dr Boulder, CO 80305 Phone: (303) 497-2441 Office: ML460B

alperaltuntas · 2019-04-03T20:03:42Z

Actually, this issue still applies to compute_mon_climatology. Not sure if it applies to other functions in climatology.py I am planning to update compute_mon_climatology based on the new function that computes means (compute_mon_mean).

Relatedly, I am wondering what's the best way of distinguishing functions that compute climatology vs functions that compute means. I added compute_mon_mean (which computes monthly means, not climatology) to climatology module , but not sure if placing it to climatology module will cause confusion. Also, another potential source of confusion is that the function that computes annual climatology is named compute_ann_mean. Should it be named compute_ann_climatology?

@matt-long ?

andersy005 · 2019-04-03T23:05:56Z

Relatedly, I am wondering what's the best way of distinguishing functions that compute climatology vs functions that compute means.

@matt-long, I presume @alperaltuntas's concern would be solved by the nomenclature suggestion you made in our conversation today.

andersy005 · 2019-04-03T23:15:34Z

Should we imitate NCL's nomenclature to a certain level : https://www.ncl.ucar.edu/Document/Functions/climo.shtml?

andersy005 · 2019-04-04T04:10:12Z

@alperaltuntas,

I added compute_mon_mean (which computes monthly means, not climatology) to climatology module , but not sure if placing it to climatology module will cause confusion.

In #109, I am removing the climatology.py module and most of utility functions in utils will be moved to an EsmlabAccessor class in a new module core.py.

Not sure that it completely solves the confusion issue, I've also moved most functions to the top-level of esmlab.e.g. you can now call esmlab.compute_ann_mean() instead of esmlab.climatology.compute_ann_mean()

matt-long added the bug label Feb 9, 2019

andersy005 added this to the sprint-feb18-mar03 milestone Feb 12, 2019

andersy005 added the help wanted label Feb 18, 2019

andersy005 pinned this issue Feb 19, 2019

andersy005 modified the milestones: sprint-feb18-mar03, sprint-mar04-mar17 Mar 1, 2019

alperaltuntas mentioned this issue Mar 13, 2019

Add a new function to compute monthly averages #94

Closed

5 tasks

andersy005 removed this from the sprint-mar04-mar17 milestone Mar 16, 2019

alperaltuntas mentioned this issue Mar 29, 2019

new compute_mon_means function based on resample() #110

Merged

andersy005 closed this as completed in #110 Apr 3, 2019

alperaltuntas reopened this Apr 3, 2019

This was referenced Apr 4, 2019

Code Cleanup/Refactoring #109

Merged

encoding of time and time_bounds differs in compute_ann_mean results for decode_time=True #111

Closed

andersy005 mentioned this issue Jun 9, 2019

esmlab.resample with freq='mon' yields unexpected results when called with monthly data #135

Open

andersy005 added help wanted Extra attention is needed and removed bug labels Jul 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

time operations where time_bounds span multiple averaging periods #55

time operations where time_bounds span multiple averaging periods #55

matt-long commented Feb 9, 2019

alperaltuntas commented Feb 14, 2019

matt-long commented Feb 14, 2019

alperaltuntas commented Feb 14, 2019

matt-long commented Feb 14, 2019

alperaltuntas commented Feb 15, 2019 •

edited

Loading

matt-long commented Feb 15, 2019

kmpaul commented Feb 15, 2019 via email

matt-long commented Feb 15, 2019

kmpaul commented Feb 15, 2019 via email

alperaltuntas commented Apr 3, 2019

andersy005 commented Apr 3, 2019

andersy005 commented Apr 3, 2019

andersy005 commented Apr 4, 2019 •

edited

Loading

time operations where time_bounds span multiple averaging periods #55

time operations where time_bounds span multiple averaging periods #55

Comments

matt-long commented Feb 9, 2019

alperaltuntas commented Feb 14, 2019

matt-long commented Feb 14, 2019

alperaltuntas commented Feb 14, 2019

matt-long commented Feb 14, 2019

alperaltuntas commented Feb 15, 2019 • edited Loading

matt-long commented Feb 15, 2019

kmpaul commented Feb 15, 2019 via email

matt-long commented Feb 15, 2019

kmpaul commented Feb 15, 2019 via email

alperaltuntas commented Apr 3, 2019

andersy005 commented Apr 3, 2019

andersy005 commented Apr 3, 2019

andersy005 commented Apr 4, 2019 • edited Loading

alperaltuntas commented Feb 15, 2019 •

edited

Loading

andersy005 commented Apr 4, 2019 •

edited

Loading