nemobis
dfc0a53827
Also the list of done wikis
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@976 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-06-04 11:43:14 +00:00
nemobis
838beb90b8
Update todo removing wikis done in last round
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@975 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-06-04 11:36:37 +00:00
scottdb56
9b8673768f
mediawikis_2013_byothers.txt have been filtered
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@974 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-04-24 03:03:16 +00:00
scottdb56
ac60918e7b
Uploaded BHW-alive_wikis.txt
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@972 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-04-04 04:07:42 +00:00
nemobis
0b2875756f
Updated alive list from other 2013 sources
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@971 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-03-23 21:44:17 +00:00
scottdb56
1b0949ce7b
other.list has been filtered and added as other-alive_wikis.list
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@970 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-03-06 05:37:37 +00:00
scottdb56
ab1e13207f
Filtered out duplicates from mediawikis_2013-alive.txt
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@968 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-03-05 05:57:32 +00:00
scottdb56
17bc29b66f
mediawikis_2013.txt filtered by checkalive.pl
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@967 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-03-03 05:05:45 +00:00
scottdb56
b9be07936a
A major update - checkalive.pl now checks for api.php and writes it to the list if found.
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@965 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-27 05:50:28 +00:00
nemobis
575c9dd3ea
Issue 85: more cross-platform shebang on all scripts... for real, meh
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@964 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-26 23:24:05 +00:00
nemobis
e61ed576ac
Issue 85: more cross-platform shebang on all scripts
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@963 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-26 23:23:40 +00:00
nemobis
ac4c93c12a
Issue 85: more cross-platform shebang on all scripts
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@962 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-26 23:22:53 +00:00
nemobis
79f912db86
Add wikis from Pavlo lists not verified as uploaded
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@961 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-25 12:27:29 +00:00
nemobis
d0f658a890
Remove old lists, add list of wikis already (re)downloaded from Pavlo and mediawikis_2013 lists
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@960 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-25 12:22:21 +00:00
nemobis
7eba594a2c
Use api.php URLs where we found them to be working, step 2
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@959 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-16 18:50:47 +00:00
nemobis
92f77ab085
Use api.php URLs where we found them to be working
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@958 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-16 17:14:41 +00:00
nemobis
395408e5a6
Also neoseeker and sourceforge
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@956 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-15 08:42:51 +00:00
nemobis
3fde888394
Remove dome wikifarms wikis and a couple duplicates
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@955 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-15 08:08:55 +00:00
nemobis
04e55c6622
Remove a hundred index.php redundant with api.php URLs
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@954 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-15 07:54:05 +00:00
nemobis
18a7f42086
Update with last raw list and checkalive.py
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@953 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-15 07:51:49 +00:00
nemobis
cbd0905cba
Add 2k more URLs from another crawl
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@950 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-14 19:04:40 +00:00
nemobis
4a5c91d471
Typo in filename
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@947 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-13 08:51:54 +00:00
nemobis
eb60580e91
New URLs from Incola
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@946 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-11 12:29:00 +00:00
nemobis
53236811d9
Also reupload the dump when verified missing
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@945 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-10 23:05:44 +00:00
nemobis
bab70e31c0
Use temporary name for history archive too to avoid conflicts
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@944 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-09 18:58:00 +00:00
nemobis
a1c89623a4
Another intermediate update with results from one more run
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@943 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-03 08:52:59 +00:00
nemobis
403dc213ef
Issue 71: English-only match for an older case
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@942 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-02 23:06:10 +00:00
nemobis
a3fe96b782
Revamp uploader.py for wider usage: move issues to tracker; add options --help, --prune-directories, --prune-wikidump, --admin
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@940 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-02 18:00:07 +00:00
nemobis
b74d6f79ce
Reduce requests for existing items and remove whitespace: tested with wiki-smackdownneoseekercom_w
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@939 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-02 11:58:49 +00:00
nemobis
54f9798be0
Mark staging 7z files as .tmp to avoid uploading them by mistake
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@938 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-31 14:19:15 +00:00
nemobis
2bb3fd7a50
Add neoseeker.com, from mutante's wikistats farm list
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@937 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-31 12:18:41 +00:00
nemobis
a358ffbfe0
Issue 88: Escaped a bit too much, some HTML we really need
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@936 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 22:34:25 +00:00
nemobis
c95465ac14
Nice to see curl progress, but only for actual upload
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@935 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 21:26:04 +00:00
nemobis
0ef0a5b229
Actually update last-updated-date
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@934 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 15:55:16 +00:00
Hydriz
16ec333e7d
Adding rewrite code so others can build on top of it
...
This is a partial rewrite of the dumpgenerator.py, and
is largely incomplete. I am no longer working on this
rewrite, so I am releasing it for others to build upon
it and work towards releasing DumpGenerator 2.0.
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@933 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 13:36:19 +00:00
nemobis
43074360b7
Experience shows 30 seconds is a more realistic timeout
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@932 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 10:47:15 +00:00
nemobis
991b96fbe2
Mark dump uploaded only if confirmed by curl exit code
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@931 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 10:44:52 +00:00
nemobis
8cf60a3285
Re-updated pavlo list with 30 s timeout
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@930 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-28 06:53:54 +00:00
nemobis
e2956e60a0
Replace some index.php with api.php where available
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@929 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 22:38:18 +00:00
scottdb56
57da61aac6
a minor syntax-error fix in checkalive.pl
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@928 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 22:10:27 +00:00
nemobis
8da3f15b35
Some manual filtering
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@927 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 20:53:11 +00:00
nemobis
84a2f9d6dc
Add index.php discovery and other fixes, update checked lists consequently; bad input makes it spit ugly errors but it keeps going
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@926 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 20:47:14 +00:00
nemobis
eba5f2d54e
More gamepedia from their homepage
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@925 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 14:14:36 +00:00
scottdb56
51ee9e9847
added a user-agent and another search string
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@924 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 13:41:29 +00:00
nemobis
009682a037
Sync API check needle with checkalive.pl, </api> is unreliable
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@923 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 11:02:05 +00:00
nemobis
ac8ed21b0c
Add Terraria, 270 wikis might be missing but where is the list?
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@922 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 10:52:17 +00:00
nemobis
01b177bcf2
Update raw list with scraper run by odder
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@921 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 10:16:59 +00:00
scottdb56
0ac7e477f5
Re-formatting of readme-checkalive.txt
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@920 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-26 22:23:34 +00:00
scottdb56
fc9207291a
readme for checkalive.pl
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@919 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-26 22:02:16 +00:00
scottdb56
d9247aa0ba
Lots of changes - improved error handling, progress reporting and other minor changes
...
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@918 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-26 21:58:05 +00:00