112 Commits (7bb5c394194ca941629d70261874d7804f415350)
 

Author SHA1 Message Date
arkiver 7bb5c39419 Version 20230607.04. Abort on video for now. 1 year ago
arkiver f63c8ab696 Version 20230607.03. Prevent getting URL ending with /". Ignore /message/compose URLs. 1 year ago
arkiver 393407520b Version 20230607.02. Very simple content checks to check if response is complete. Properly prevent writing to WARC in cases and do not abort all items when finding a problematic URL. 1 year ago
arkiver 37ba172c61 Version 20230607.01. Use GNU Wget 1.21.3-at.20230605.01 and arguments around DNS. 1 year ago
arkiver da85457aae Version 20230531.01. Use --secure-protocol PFS. 1 year ago
arkiver 48b24323c6 Version 20230530.01. Queue discovered outlinks to urls-stash-reddit. 1 year ago
arkiver a3b5bcecc1 Version 20230529.01. Correctly extract more comment pages from comment pages in the new design. Print debug infrmation for comment pages on old design. 1 year ago
arkiver 1a14af2095 Version 20230509.02. Support new Wget-AT. 1 year ago
arkiver b2654e9317 Version 20230509.01. Support for new design. 1 year ago
arkiver 7f4db17348 Version 20221021.01. Ignore /tailwind-build.css URL from comment in HTML. 2 years ago
arkiver 8a27002fd3 Version 20221005.01. Max tries for backfeed to 10. 2 years ago
arkiver 35e31af37f Queue redditstatic.com URLs as outlinks. 2 years ago
arkiver bab4b4dcd2 Version 20220729.05. Fix aborting item on bad status code on url: item. Keep old retry code otherwise. 2 years ago
arkiver 8c45a263aa Version 20220729.04. Queue extra found URLs on media URLs to backfeed. 2 years ago
arkiver e8fe03fbd0 Version 20220729.03. Add url: prefix to url item. 2 years ago
arkiver 2d8fa4034b Version 20220729.02. Support older Wget versions. 2 years ago
arkiver f81b2ce97e Version 20220729.01. Queue media URLs back to reddit project and download individually. 2 years ago
arkiver edacb2065a Fix README. 2 years ago
arkiver cc83009a94 Version 20220605.01. Support GNU Wget 1.21.3-at.20220503.02. Fix killing crawl when items cannot be queued. 2 years ago
arkiver 7c4cf4548e Version 20220415.02. 2 years ago
arkiver 754fd256cb
Merge pull request #13 from NGTmeaty/patch-1
Add support for latest change in _options
2 years ago
arkiver 0ce1c59ca4 Version 20220415.01. Do not queue /r/undefined/ URLs. 2 years ago
Jake L a858c33e29
Add support for latest change in _options 3 years ago
arkiver da28d3c902 Version 20220323.03. Fix items to maxtries variable name. Fix backfeed key name. 3 years ago
arkiver 8944cf1fc6 Version 20220323.02. Fix items to maxtries variable name. 3 years ago
arkiver 10eaa7c50c Version 20220323.01. Fix backfeed. Fix maxtries use. 3 years ago
arkiver 28f132a052 Version 20220312.01. Fix backfeed. 3 years ago
arkiver 4f50a0d699 Version 20220311.01. Use new backfeed endpoint for queuing. 3 years ago
arkiver 383c101aef Version 20220109.02. Cut off URL at space when found between brackets without href= in front. 3 years ago
arkiver df35317e0c Version 20220109.01. Add codepoint to utf8 support. Percent encode outlinks correctly. 3 years ago
arkiver 8a3f8cd1de Version 20211004.02. Fix incomplete facebook.com fix. 3 years ago
arkiver d0070db67a Version 20211004.01. Do not check facebook.com while down at the moment. 3 years ago
arkiver 0c5e8cd3bd Version 20211001.01. Use GNU Wget 1.20.3-at.20211001.01. 3 years ago
arkiver ed80cb5a9d Version 20210707.01. Do not get media for cross posts. 3 years ago
arkiver 4b976e2ea7 Version 20210521.01. Use TLS 1.2. 3 years ago
Katie Holly f4619bb17f use onbuild-based image 3 years ago
km09 e6b876e9e6
New day.. new wget-at 1.20.3-at.20210504.01 3 years ago
Thomas Glass 1f9e995b4e
20210410.01 - New day, new wget-at 4 years ago
arkiver 6e15841550 Version 20210407.01. Improve video archiving. Detect if video is still being processed by reddit. 4 years ago
arkiver 1b3690d994 Version 20210330.04. Only decode unicode characters in URLs on v.redd.it URLs. 4 years ago
arkiver ce7fff480d Version 20210330.03. Unescape unicode characters. Do not HLS for video. 4 years ago
arkiver ad04f45d4f Fix typo. 4 years ago
arkiver adc7f9c6fb Version 20210330.02. Skip images that are only in JSON and not on web page. 4 years ago
arkiver 07ed16c44b Version 20210330.01. Handle 403 on v.redd.it on deleted post. 4 years ago
arkiver 8849165130 Version 20210321.01. Do not get all video sizes. 4 years ago
arkiver d3b6659419 Version 20210312.01. Get URLs with utm_* and context params. 4 years ago
arkiver a5c798945c Version 20210306.01. Remove some AppleWebKir user-agents for getting 403s. 4 years ago
Katie Holly eaad7cd7e7
add 1.20.3-at.20210212.02 as supported wget-at version 4 years ago
Katie Holly 3b4a2ef5a7
20210225.01: update dict url 4 years ago
Thomas Glass e6c33f9433
Updated warrior support 4 years ago