forked from Aloisius/nutch
-
Notifications
You must be signed in to change notification settings - Fork 2
Issues: commoncrawl/nutch
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Upgrade webarchive-commons dependency to include fix of SURT maker / URL canonicalizer
bug
dependency
Dependency upgrades and similar
#24
opened Aug 28, 2023 by
sebastian-nagel
Evaluate zlib-cloudflare for 15% performance speedup of WarcRecordWriter
#22
opened Jul 14, 2023 by
tfmorris
WARC writer: unit tests for conversion of URLs to URIs
enhancement
#21
opened Jul 12, 2023 by
sebastian-nagel
Improvements in Hadoop's s3a output committers obsolete class S3FileOutputFormat
#16
opened Nov 22, 2019 by
sebastian-nagel
More detailed marking of truncated records due to "network disconnect"
enhancement
#13
opened Aug 30, 2019 by
sebastian-nagel
WarcRecordWriter to write and index WAT/WET files
enhancement
help wanted
#9
opened Jul 4, 2019 by
sebastian-nagel
5 tasks
ProTip!
What’s not been updated in a month: updated:<2024-11-17.