2
0
mirror of https://github.com/WikiTeam/wikiteam synced 2024-11-15 00:15:00 +00:00
Commit Graph

56 Commits

Author SHA1 Message Date
nemobis
18a7f42086 Update with last raw list and checkalive.py
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@953 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-15 07:51:49 +00:00
nemobis
cbd0905cba Add 2k more URLs from another crawl
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@950 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-14 19:04:40 +00:00
nemobis
eb60580e91 New URLs from Incola
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@946 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-11 12:29:00 +00:00
nemobis
a1c89623a4 Another intermediate update with results from one more run
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@943 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-02-03 08:52:59 +00:00
nemobis
2bb3fd7a50 Add neoseeker.com, from mutante's wikistats farm list
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@937 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-31 12:18:41 +00:00
nemobis
43074360b7 Experience shows 30 seconds is a more realistic timeout
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@932 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-29 10:47:15 +00:00
nemobis
8cf60a3285 Re-updated pavlo list with 30 s timeout
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@930 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-28 06:53:54 +00:00
nemobis
e2956e60a0 Replace some index.php with api.php where available
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@929 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 22:38:18 +00:00
scottdb56
57da61aac6 a minor syntax-error fix in checkalive.pl
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@928 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 22:10:27 +00:00
nemobis
8da3f15b35 Some manual filtering
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@927 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 20:53:11 +00:00
nemobis
84a2f9d6dc Add index.php discovery and other fixes, update checked lists consequently; bad input makes it spit ugly errors but it keeps going
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@926 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 20:47:14 +00:00
nemobis
eba5f2d54e More gamepedia from their homepage
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@925 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 14:14:36 +00:00
scottdb56
51ee9e9847 added a user-agent and another search string
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@924 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 13:41:29 +00:00
nemobis
009682a037 Sync API check needle with checkalive.pl, </api> is unreliable
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@923 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 11:02:05 +00:00
nemobis
ac8ed21b0c Add Terraria, 270 wikis might be missing but where is the list?
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@922 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 10:52:17 +00:00
nemobis
01b177bcf2 Update raw list with scraper run by odder
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@921 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-27 10:16:59 +00:00
scottdb56
0ac7e477f5 Re-formatting of readme-checkalive.txt
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@920 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-26 22:23:34 +00:00
scottdb56
fc9207291a readme for checkalive.pl
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@919 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-26 22:02:16 +00:00
scottdb56
d9247aa0ba Lots of changes - improved error handling, progress reporting and other minor changes
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@918 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-26 21:58:05 +00:00
nemobis
2bdf8da30c Add orain and gamepedia lists, might have mistakes
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@915 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-25 22:44:39 +00:00
nemobis
7e412d1f17 update with export of http://www.shoutwiki.com/wiki/Category:Flat_list_of_all_wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@911 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-25 11:02:34 +00:00
scottdb56
fb87cd9951 This is the first publicly available version of this Perl script.
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@901 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-23 00:29:43 +00:00
nemobis
91cf5b4d08 Commit skeleton for Scott's use
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@900 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-22 07:57:50 +00:00
nemobis
72d43634d5 Update Wikia list
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@899 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-19 22:09:35 +00:00
emijrp
9152b6861b list of wikis extracted from WikiIndex dump, excluding most wikifarms
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@898 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-18 21:03:45 +00:00
nemobis
f7c50f8ee5 Add retroshare
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@896 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-06 15:33:35 +00:00
nemobis
1fb5865166 Remove some obvious false positives
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@895 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-06 14:25:42 +00:00
nemobis
67eeed88e4 Add silly RSD discovery to checkalive.py and update wikis list
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@892 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-05 10:50:18 +00:00
nemobis
b8c90df787 Upload separately some already checked with the script
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@891 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-03 18:38:19 +00:00
nemobis
500e8ef350 Remove some more obvious duplicates including trailing slash
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@890 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-03 17:24:53 +00:00
nemobis
31ea06ff86 Remove also wiki/[A-Z].+$
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@889 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-03 17:17:42 +00:00
nemobis
664fa18ea3 Remove sourceforge wikis and URLs with parameters to index.php
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@888 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-03 17:13:46 +00:00
nemobis
3624c02852 Remove Wikimedia Foundation wikis, other 'wikimedia' URLs cleanup
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@887 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-03 17:04:29 +00:00
nemobis
ccfc95e9f4 Issue 59: Add first dirty list of possible MediaWiki sitesFirst passes of the script, now going on with all TLDs.Just sorted and cleaned of biggest noises like mailing lists, github and stackoverflow.
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@886 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2014-01-03 16:57:37 +00:00
nemobis
002a8d6702 Add first list of sourceforge wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@885 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2013-12-12 13:36:33 +00:00
nemobis
b940293136 Add more wikkii wikis from mutante's wikistats
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@881 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2013-11-17 11:10:36 +00:00
nemobis
75746af1c9 Update to current list of about 340k wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@823 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2013-05-04 10:05:29 +00:00
nemobis
810c94723e update to current list of 300k wikis, got from API, without http:// protocol
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@807 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2012-11-18 19:12:37 +00:00
emijrp
36ea489313 fixing file description bug
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@795 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2012-08-14 19:08:50 +00:00
emijrp
51d775c214 light improvements of checkalive.py script
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@794 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2012-08-14 18:11:37 +00:00
nemobis
06fb988438 Upload my logs.
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@787 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2012-08-12 16:21:23 +00:00
emijrp
d31828dfaa mediawikis pavlo
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@464 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2012-04-07 15:15:09 +00:00
emijrp
657577cc37 checkalive for wiki lists
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@439 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2012-04-07 13:58:19 +00:00
emijrp
5cc010c675 updating listofwikis directory [removing dupes];
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@316 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-12-09 15:29:12 +00:00
emijrp
ed0a3d08a2 filling referata list
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@314 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-12-05 16:39:59 +00:00
emijrp
d3b748e7f2 mediawiki list from Andrew Pavlo
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@313 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-12-05 16:11:10 +00:00
emijrp
b3e33380d6 wikkii.com list
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@257 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-08-05 11:19:31 +00:00
emijrp
de29904341 adding shoutwiki.com wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@190 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-07-08 06:33:24 +00:00
emijrp
2acd8415c7 removing dupes;
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@155 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-05-08 15:52:16 +00:00
emijrp
e1cf136a97 wikia list, > 200000 wikis
git-svn-id: https://wikiteam.googlecode.com/svn/trunk@104 31edc4fc-5e31-b4c4-d58b-c8bc928bcb95
2011-04-17 23:21:13 +00:00