155 Commits (master)
 

Author SHA1 Message Date
arkiver daab40aa6e Version 20240216.01. Use fixed minimum Wget version 1.21.3-at.20231213.03. Use TLSv1.2. Fix check on svc comment content check. 3 months ago
arkiver 48dc016faf Version 20231201.01. Change protocol. 5 months ago
arkiver 5f7cee8d3a Version 20231127.02. New --ciphers value. 5 months ago
arkiver 2b41d8ef42 Version 20231127.01. Use --ciphers SECURE256. 5 months ago
arkiver 7da27ab110 Version 20231118.01. Switch to gnutls. 6 months ago
arkiver 0dc36e31e0 Version 20231115.01. Change cipher list. 6 months ago
arkiver 8fc86a11ca Version 20231111.02. 6 months ago
arkiver 6fdf778e19 Version 20231111.01. Switch ciphers again. 6 months ago
arkiver e87de8969c Version 20231108.02. Move to another cipher. 6 months ago
arkiver 9c9b59dafd Version 20231108.01. Do not install utf8 with luarocks, this is now in base parent image. 6 months ago
arkiver 388e4325c5 Version 20231102.01. Do not keep partial files over rsync. 6 months ago
arkiver 1c2723f9f2 Version 20231026.01. Use --ciphers HIGH:+SHA384. 7 months ago
arkiver e350e69f89 Version 20231020.01. Use gnutls. Support new method of serving Reddit comments. 7 months ago
arkiver 0e7392acd3 Version 20231019.01. Use --secure-protocol=TLSv1_2. 7 months ago
arkiver 4bcc04734f Version 20231017.02. Use --secure-protocol=TLSv1_3. 7 months ago
arkiver b1bf682030 Version 20231017.01. Use --secure-protocol=auto. Use new minimum Wget version checker. 7 months ago
arkiver a0e35bb72d Version 20230910.05. Install Lua utf8 library through warrior-install.sh. 8 months ago
arkiver 3add4f891c Version 20230910.04. Install lua utf8 library. Fix converting unicode codepoint to utf8 character support. 8 months ago
arkiver 12abd58d4d Version 20230910.03. Increase hardcoded multi item size to 100, for soft limiting on tracker side. 8 months ago
arkiver 8a46824231 Version 20230910.02. Remove old Lua files. 8 months ago
arkiver a2ffd1f671 Version 20230910.01. Use cjson instead of JSON.lua. 8 months ago
arkiver e6b1602e31 Version 20230827.01. Use --secure-protocol=TLSv1_3. 9 months ago
arkiver d210e65967
Merge pull request #18 from imerr/master-1
Extra docker container params
9 months ago
Robin Rolf b7feddc147
Extra docker container params
watchtower: `--include-restarting` also update if the container is in a crash loop due to a bad build or the like
grab container: `--log-driver json-file --log-opt max-size=50m` to limit logs, docker defaults to json-file with no limit
9 months ago
arkiver 29a6952edb Version 20230727.03. In the Warrior, do not use GnuTLS compiled Wget-AT. 10 months ago
arkiver 6e73452ec5 Version 20230727.02. Only allow GNU Wget 1.21.3-at.20230623.01. Use Wget-AT option --reject-reserved-subnets. Remove old Wget files. Update README to latest. 10 months ago
arkiver 288c9b731c Version 20230727.01. Use openssl instead of gnutls. 10 months ago
arkiver bb6198cc1a Version 20230627.01. Queue outlinks directly to the urls project. 11 months ago
arkiver f1ef7d1697 Version 20230619.02. Accept 404 on mediaembed URL. 11 months ago
arkiver d2571cde06 Version 20230619.01. Primitive fix to user post verification problems. 11 months ago
arkiver 2b19cdcd43 Version 20230617.01. Use --secure-protocol=auto for Wget-AT. 11 months ago
arkiver 5a0dcd6dd9
Merge pull request #17 from masterX244/master
Ignore fix for certain 404-ing garbage
11 months ago
masterX244 488aaa2181
Update pipeline.py 11 months ago
masterX244 520e8b95d6
Ignore for some garbge URLs that 404
wget guesses too much and generates bad URLs, ignore needed
11 months ago
arkiver bea971f375 Version 20230614.03. Better check for level error page on svc URL. 11 months ago
arkiver be6e32cba5 Version 20230614.02. Extra validity checks. 11 months ago
arkiver e84e804fc5 Version 20230614.01. Fix check for valid data. 11 months ago
arkiver 4936505b0f Version 20230612.02. Add Reddit problem check for /comments/.../comment/ URL. 11 months ago
arkiver 57adbb381c Version 20230612.01. Kill grab when reddit seems to have problems. 11 months ago
arkiver 0ef6368945 Version 20230611.02. Multi item size 40. 11 months ago
arkiver a974b81618 Version 20230611.01. Extra very simple check on validity of old.reddit.com returned body. 11 months ago
arkiver 15a0a1a6f5 Version 20230607.06. Ignore discovered /r/FIFA URL if coming from a /r/EASportFC parent URL. 11 months ago
arkiver fe17191306 Version 20230607.05. Better checking for video. Abort item if no post is found (during blackout for example). 11 months ago
arkiver 7bb5c39419 Version 20230607.04. Abort on video for now. 11 months ago
arkiver f63c8ab696 Version 20230607.03. Prevent getting URL ending with /". Ignore /message/compose URLs. 11 months ago
arkiver 393407520b Version 20230607.02. Very simple content checks to check if response is complete. Properly prevent writing to WARC in cases and do not abort all items when finding a problematic URL. 11 months ago
arkiver 37ba172c61 Version 20230607.01. Use GNU Wget 1.21.3-at.20230605.01 and arguments around DNS. 11 months ago
arkiver da85457aae Version 20230531.01. Use --secure-protocol PFS. 11 months ago
arkiver 48b24323c6 Version 20230530.01. Queue discovered outlinks to urls-stash-reddit. 12 months ago
arkiver a3b5bcecc1 Version 20230529.01. Correctly extract more comment pages from comment pages in the new design. Print debug infrmation for comment pages on old design. 12 months ago