Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: New extractor to download javascript/ts sourcemap files for any compiled/minified .js assets used in archived pages #1291

Open
jensolsson opened this issue Dec 17, 2023 · 1 comment
Labels
expected: unlikely unless contributed This change is unlikely to be made unless someone contributes a PR for review. help wanted size: medium status: idea-phase Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet touches: configuration touches: data/schema/architecture touches: dependencies/packaging Issues or changes that add/remove/affect dependencies touches: docs type: enhancement why: functionality Intended to improve ArchiveBox functionality or features

Comments

@jensolsson
Copy link

jensolsson commented Dec 17, 2023

I love how ArchiveBox is downloading javascript files from a specific site, however to make minified javascript readable it would be great to also include the .map file (called javascript source maps. If I understand correctly they are the same name but ends in .js.map instead of .js). Can this be added easily ?

If someone would give me some pointers on where to start I could probably do the work

@jensolsson jensolsson changed the title Feature Request: ... Feature Request: Try to download javascript.map also if javascript.js is downloaded Dec 17, 2023
@pirate
Copy link
Member

pirate commented Dec 17, 2023

This would likely require a new extractor, since none of our existing extractors try to download or are even aware of .map files.

You can check out our docs on that process here: https://github.com/ArchiveBox/ArchiveBox#contributing-a-new-extractor

Though to be honest this one would be fairly low on my priority list as it's just for developer experience and doesn't visually impact replay fidelity.

If you're interested in contributing an extractor, we'd love to have help adding one of these higher-impact ones, and I'm offering $250~$1000+ bounties for contributions at the moment:

  • an extractor to save discussion threads from forums, comment sections, reddit, twitter, etc as markdown/json
  • an extractor to fetch galleries of images e.g. gallery-dl
  • an extractor to fetch linked 3D assets like meshes, shaders, STL files, and other CAD models
  • an extractor to fetch research papers referenced by DOI numbers from scihub/jstor/etc

@pirate pirate changed the title Feature Request: Try to download javascript.map also if javascript.js is downloaded Feature Request: New extractor to download javascript/ts sourcemap files for any compiled/minified .js assets used in archived pages Dec 17, 2023
@pirate pirate added status: idea-phase Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet size: medium touches: configuration why: functionality Intended to improve ArchiveBox functionality or features touches: data/schema/architecture help wanted touches: dependencies/packaging Issues or changes that add/remove/affect dependencies touches: docs type: enhancement expected: unlikely unless contributed This change is unlikely to be made unless someone contributes a PR for review. labels Dec 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
expected: unlikely unless contributed This change is unlikely to be made unless someone contributes a PR for review. help wanted size: medium status: idea-phase Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet touches: configuration touches: data/schema/architecture touches: dependencies/packaging Issues or changes that add/remove/affect dependencies touches: docs type: enhancement why: functionality Intended to improve ArchiveBox functionality or features
Projects
None yet
Development

No branches or pull requests

2 participants