You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Request modification of existing behavior or design
What is the problem that your feature request solves
I'd like to request ArchiveBox send the If-Modified-Since if it has already fetched the website previously and the website sent the Last-Modified header. Or send If-None-Match from the stored value of the ETag response, such that feeds like Rachel Kroll's feed can easily be fetched without having to wait a full day.
This would also make long fetching of sites easier on both our host and the remote's host.
Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes
Sorry, I think I should've read the entire thing before. It should send If-Modified-Since if it has already fetched the website previously, using the value the server sent on the Last-Modified. Or it should send a If-None-Match from the value of ETag if found.
I don't know if sending both is allowed, but I guess it'd be acceptable to prefer If-Modified-Since if both are.
What hacks or alternative solutions have you tried to solve the problem?
I've considered putting a HTTP proxy that would store those tags, and have archivebox be in the middle, but that doesn't look pretty.
How badly do you want this new feature?
It's an urgent deal-breaker, I can't live without it
It's important to add it in the near-mid term future
It would be nice to have eventually
I don't really mind too much, but I'd appreciate it being there, as archivebox could cause strain on servers, and thus we might get blocked from being able to archive things if we archive too deep.
I'm willing to contribute dev time / money to fix this issue
I like ArchiveBox so far / would recommend it to a friend
I've had a lot of difficulty getting ArchiveBox set up
I'm a fairly "new" systems admin, and I haven't set ArchiveBox up in a public enviroment, it is only running on my laptop, but I could easily set it up as I have already set up some other Django based apps to a system. But I don't have time to do things.
The text was updated successfully, but these errors were encountered:
pirate
changed the title
Feature Request: Conditional requests
Feature Request: use cavhing headers to determine if content has changed since last snapshot
Jan 29, 2024
pirate
changed the title
Feature Request: use cavhing headers to determine if content has changed since last snapshot
Feature Request: use cache control headers to determine if content has changed since last snapshot
Jan 29, 2024
Type
What is the problem that your feature request solves
I'd like to request ArchiveBox send the
If-Modified-Since
if it has already fetched the website previously and the website sent theLast-Modified
header. Or sendIf-None-Match
from the stored value of theETag
response, such that feeds like Rachel Kroll's feed can easily be fetched without having to wait a full day.This would also make long fetching of sites easier on both our host and the remote's host.
Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes
Sorry, I think I should've read the entire thing before. It should send
If-Modified-Since
if it has already fetched the website previously, using the value the server sent on theLast-Modified
. Or it should send aIf-None-Match
from the value ofETag
if found.I don't know if sending both is allowed, but I guess it'd be acceptable to prefer
If-Modified-Since
if both are.What hacks or alternative solutions have you tried to solve the problem?
I've considered putting a HTTP proxy that would store those tags, and have archivebox be in the middle, but that doesn't look pretty.
How badly do you want this new feature?
I don't really mind too much, but I'd appreciate it being there, as archivebox could cause strain on servers, and thus we might get blocked from being able to archive things if we archive too deep.
I'm a fairly "new" systems admin, and I haven't set ArchiveBox up in a public enviroment, it is only running on my laptop, but I could easily set it up as I have already set up some other Django based apps to a system. But I don't have time to do things.
The text was updated successfully, but these errors were encountered: