dynamic error handling within `eia_data()` from metadata layer #17

mghoff · 2023-11-11T00:45:05Z

No description provided.

…tests based on new dynamic error handling messaging

…r input validation following a first error.

leonawicz

This reminded me we should have some unit tests that make use of all possible frequency values, so we'll probably have to test against at least a couple different data sources for which different frequencies can be requested. But my biggest question is the one about the algo for requesting metadata within a data request.

leonawicz · 2023-11-13T17:57:40Z

R/data.R

+ if(!is.null(end)) .end_check(end, freq, md$Frequency, md_end, md_start)
+ if(!is.null(sort)) .sort_check(sort)
+ if(!is.null(length)) .lng_check(length)
+ if(!is.null(offset)) .ofs_check(offset)


General comment about lines 83-105: What do you think about migrating all the instances of this if(!is.null(x)) stuff into the functions that check or handle the rest of the respective operations? Seems like since we have all these other internal helper functions handling cases and returning something (or nothing), they could also handle the NULL case themselves as well. Then outer functions like .eia_data_url() and .eia_data_check() can be further simplified.

Yes! I will make this happen. Good catch.

leonawicz · 2023-11-13T18:05:00Z

R/data.R

@@ -58,6 +58,8 @@ eia_data <- function(dir, data = NULL, facets = NULL,
 }

 .eia_data <- function(dir, data, facets, freq, start, end, sort, length, offset, tidy, key){
+ md <- eia_metadata(dir, TRUE, TRUE, key)


Does this mean that every API request becomes a minimum of two API requests?

Unfortunately, yes. I'm not sure how else having truly dynamic validation of input values could be accomplished - unless we precompiled within the package itself (maybe within data-raw/...?) all the available options for every API data endpoint.

My intention was to only check as part of error handling itself, like catching an exception and handling it as needed.

Meaning, data request is made. Then, if data is returned in some expected manner, do nothing but return the data. This would be the 99% common use case and only ever makes one request. Otherwise, handle the exception when it occurs (whether that is a try catch around an actual error, or checking for some message or lack of data in the response).

So a second API request, for metadata, would only be made at the end of the parent data request function if needed based on the result.

So, completely dynamic- typical exception handling around the data call, rather than an automatic preemptive request.

Looking at it, I'm thinking I could just re-order the metadata call to come after the data call...

Let me first commit and push the is.null() checks to the helper functions, and then I'll play around with error handling on error/warning, rather than preemptively.

The other thing to realize, is that it will only make two API calls on the first function call to that endpoint. After that initial call, that endpoint and its metadata will be cached, so only one API call is made from then on within the context of that API endpoint (dir entry).

Right. That's an excellent benefit of the memoization.

Another option is to make available a check_metadata = FALSE (default) parameter to eia_data(), ideally after tidy and before key, as perhaps the second least likely parameter to be passed an argument by the user.

Then, if they want to run a call with a more robust check that (potentially) makes another API request, they can set check_metadata = TRUE. Think of it like a sort of debug mode, though I wouldn't use that name. Then within the code, the metadata request and the subsequent checks could all be wrapped in an if(check_metadata).

It's just another option; if it's not worth the trouble to have the metadata request and checks be fully dynamic/only run as needed, then having the extra checking be optional but off by default is good.

Oh no - I like this. Good suggestion.

…defense in handling errors.

…rom metadata check; `sort` made more robust where `order` may now handle a list of varying length respective of `cols` length; messaging made more consistent with API error messaging

…he inputs being tested

mghoff · 2023-11-16T20:17:32Z

All this unit testing work has made me really re-think the user experience and the in-package (non-API) error handling I've written. I've modified this fairly heavily; e.g. removing sort, length and offset from metadata checks and more.

One thing I'm still concerned with, and would appreciate your input on, is with the start and end arguments. The API doesn't actually care if the format of start matches the format of freq. For example, if freq == "annual" and start == "2020-06", then what would be returned without my error handling would be all of 2020. As such, I'm forcing the user to be more exacting with their input choices.

Further, I am still planning to find a API data endpoint offering hourly granularity so I can add testing around that - just haven't gotten to it yet.

leonawicz · 2023-11-16T20:50:45Z

That's a good point about the start and freq formats. There is nothing inherently wrong with imposing more formality, requiring matching format, in this area simply because the API itself is more loose in its requirement around this. But of course it's also fine to only reproduce the API's behavior. I could go either way.

You can always indicate in the documentation something in the details about how if both arguments are given and the one is a higher temporal resolution than the other, it's resolution is reduced to match that of freq and that this is the API behavior.

mghoff added 7 commits November 1, 2023 20:49

ignore CRAN-SUBMISSION file

79cd331

add "weekly" option to frequency spec check

251cae8

new list object added to return: ExcelAddInVersion

1890e1b

new list object added to return: ExcelAddInVersion; added additional …

01873ba

…tests based on new dynamic error handling messaging

dynamic error handling

b2fe5b5

update NEWS for upcoming version bump

98d2d4b

call.=FALSE

bd2bf8b

mghoff requested a review from leonawicz November 11, 2023 00:45

mghoff added 3 commits November 11, 2023 14:23

separate url creation from input validation

304eedf

use top-level eia_metadata() to take advantage of caching for faste…

da96d64

…r input validation following a first error.

simplify .eia_data_url()

ff5414c

leonawicz reviewed Nov 13, 2023

View reviewed changes

mghoff added 3 commits November 13, 2023 13:58

push !is.null() check to helper functions

ff430b3

add check_metadata conditional to assist with debugging input values

1807cfa

concise language

208eb23

leonawicz approved these changes Nov 13, 2023

View reviewed changes

mghoff added 2 commits November 13, 2023 16:06

add stop on 400 status code so that the API may be the first line of …

394eec9

…defense in handling errors.

test API error handling

faf819b

leonawicz approved these changes Nov 13, 2023

View reviewed changes

mghoff added 3 commits November 16, 2023 13:29

metadata testing gets its own script

a4299c9

clean up handling of user inputs: rm sort, length, and offset f…

178e5dd

…rom metadata check; `sort` made more robust where `order` may now handle a list of varying length respective of `cols` length; messaging made more consistent with API error messaging

separate testing into multiple parts based on type/class/methods of t…

3566a9f

…he inputs being tested

mghoff added 2 commits November 16, 2023 16:17

match messaging and call. = FALSE

f84bc02

add missing code coverage tests

5166181

leonawicz approved these changes Nov 16, 2023

View reviewed changes

leonawicz merged commit 2139d22 into master Nov 16, 2023
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dynamic error handling within `eia_data()` from metadata layer #17

dynamic error handling within `eia_data()` from metadata layer #17

mghoff commented Nov 11, 2023

leonawicz left a comment

leonawicz Nov 13, 2023

mghoff Nov 13, 2023

leonawicz Nov 13, 2023

mghoff Nov 13, 2023

leonawicz Nov 13, 2023 •

edited

Loading

leonawicz Nov 13, 2023

mghoff Nov 13, 2023

mghoff Nov 13, 2023

leonawicz Nov 13, 2023 •

edited

Loading

mghoff Nov 13, 2023

mghoff commented Nov 16, 2023

leonawicz commented Nov 16, 2023

dynamic error handling within eia_data() from metadata layer #17

dynamic error handling within eia_data() from metadata layer #17

Conversation

mghoff commented Nov 11, 2023

leonawicz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leonawicz Nov 13, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leonawicz Nov 13, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mghoff commented Nov 16, 2023

leonawicz commented Nov 16, 2023

dynamic error handling within `eia_data()` from metadata layer #17

dynamic error handling within `eia_data()` from metadata layer #17

leonawicz Nov 13, 2023 •

edited

Loading

leonawicz Nov 13, 2023 •

edited

Loading