2
0
mirror of https://github.com/WikiTeam/wikiteam synced 2024-11-15 00:15:00 +00:00
Commit Graph

126 Commits

Author SHA1 Message Date
emijrp
f94a04562f fixed for enwiki which uses several chunks for big files;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@168 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-26 12:48:26 +00:00
emijrp
5254085eb4 wikipedia dumps downloader;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@167 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-25 22:24:41 +00:00
emijrp
502d3a3d22 more wikis;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@166 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-13 22:28:10 +00:00
emijrp
778a8ad7ae adding support to download images on old mediawikis; regexp4;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@165 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-13 20:19:13 +00:00
emijrp
5b7674edb7 params in main() to call it from external scripts (using import dumpgenerator; dumpgenerator.main(params=params))
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@161 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-10 14:29:30 +00:00
emijrp
e4d0de09d6 params in main() to call it from external scripts (using import dumpgenerator; dumpgenerator.main(params=params))
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@160 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-10 14:25:04 +00:00
emijrp
852d6658ad more wikis;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@159 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-05 14:40:58 +00:00
emijrp
da6c4d1468 print error output when splitters error occurs; 5000 -> 500 in image list parser
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@158 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-06-04 11:10:45 +00:00
emijrp
c5b6b8a866 md5.new->md5; new uploaded wikis;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@157 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-30 20:31:32 +00:00
emijrp
11420962f5 print
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@156 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-08 16:00:07 +00:00
emijrp
2acd8415c7 removing dupes;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@155 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-08 15:52:16 +00:00
emijrp
835158b79b more wikis; md5 or hashlib
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@154 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-07 22:04:00 +00:00
emijrp
59fe050558 more wikis;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@153 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-06 22:29:08 +00:00
emijrp
c7b511ee38 more wikis; new content spliter;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@152 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-05 21:46:48 +00:00
emijrp
9a20327939 undoHTMLEntities for titles when scrapped from Allpages; protocol http https;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@150 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-04 16:53:17 +00:00
emijrp
af64c6b6c5 http or https
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@149 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-04 09:26:20 +00:00
emijrp
5cd7be9987 removing old code in wikiadownloader;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@148 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-03 21:40:21 +00:00
emijrp
e9f379b888 fixing print;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@147 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-03 21:07:17 +00:00
emijrp
1b7fa4c994 help
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@146 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-01 18:26:41 +00:00
emijrp
b26a86cdd5 verbose false for imagedump;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@144 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-01 16:50:38 +00:00
emijrp
896393016d getting date from index.json (wikia);
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@143 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-01 16:48:25 +00:00
emijrp
523ae5c983 months;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@142 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-01 14:54:18 +00:00
emijrp
4e19191cc2 wikia downloader;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@141 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-01 09:52:21 +00:00
emijrp
17235cffd9 better comments;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@140 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 19:44:02 +00:00
emijrp
6e45398878 when full history fails, retrieve only the last version; various server errors handled
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@139 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 18:53:35 +00:00
emijrp
e4b233cc37 print verbose; seconds numbers to variables;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@138 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 17:05:59 +00:00
emijrp
7b85e243b0 :
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@137 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 14:38:11 +00:00
emijrp
3c49a30764 fixing issue #12 and issue #13;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@136 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 14:37:15 +00:00
emijrp
3388c7e83b print
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@135 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 12:43:16 +00:00
emijrp
101cd62e3d print
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@134 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 12:40:02 +00:00
emijrp
46303d780d fixing issue #11;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@133 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 12:37:54 +00:00
emijrp
1f91b4c63e fixing issue #11;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@132 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-30 12:21:50 +00:00
emijrp
220896ccac more wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@131 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-29 18:34:04 +00:00
emijrp
8bd90b4cde more wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@127 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-29 09:02:08 +00:00
emijrp
e13b4a6428 removing http/https for file prefixes;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@126 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-29 08:59:13 +00:00
emijrp
0f3f7d51f5 more wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@124 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-27 23:16:46 +00:00
emijrp
d0cd9aa646 more wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@123 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-27 23:04:17 +00:00
emijrp
79f961aef5 more wikis;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@122 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-27 22:23:15 +00:00
emijrp
2c0d8b97b0 git-svn-id: https://wikiteam.googlecode.com/svn/trunk@121 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95 2011-04-27 15:51:46 +00:00
emijrp
2bac6cf3f0 git-svn-id: https://wikiteam.googlecode.com/svn/trunk@120 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95 2011-04-26 23:48:51 +00:00
emijrp
d6c1a773a0 removing some old #fix comments; moving TODO to Issues section at Google Code;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@119 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-23 21:55:31 +00:00
emijrp
88cbc0e871 removing some old #fix comments;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@118 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-23 21:54:03 +00:00
emijrp
23c9a06a31 removing some old #fix comments;removing thread option not implemented and better not to be done;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@117 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-23 21:52:13 +00:00
emijrp
05d1fb97c4 sorting titles; fixing issue #9;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@116 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-23 21:45:57 +00:00
emijrp
06b6f79989 git-svn-id: https://wikiteam.googlecode.com/svn/trunk@113 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95 2011-04-23 16:44:37 +00:00
emijrp
bc36eb0f85 uploaded wikis;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@110 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-18 21:02:40 +00:00
emijrp
e1cf136a97 wikia list, > 200000 wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@104 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-17 23:21:13 +00:00
emijrp
d4bc4cba9a readme for wikilists
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@103 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-17 23:18:49 +00:00
emijrp
24ea30f29f scribblewiki
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@101 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-17 22:55:51 +00:00
emijrp
eb6f7c2639 adding some more empty lists to do
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@100 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-17 22:36:54 +00:00