..
247sports.com
feat: 247sports.com extractor ( #64 )
2016-12-21 20:52:23 -08:00
abcnews.go.com
feat: abcnewsgo parser ( #90 )
2017-02-02 17:43:35 -05:00
biorxiv.org
chore: minifying biorxiv.com fixture ( #478 )
2019-08-20 09:46:15 -07:00
blisterreview.com
feat: add custom extractor for blisterreview.com ( #299 )
2019-03-01 16:48:26 -08:00
bookwalker.jp
feat: add bookwalker.jp custom parser ( #374 )
2019-04-15 11:06:10 +03:00
buzzap.jp
feat: add buzzap.jp custom parser ( #351 )
2019-04-09 11:35:40 +03:00
clinicaltrials.gov
Custom Extractor for clinicaltrials.gov ( #305 )
2019-05-27 09:25:51 +03:00
deadline.com
feat: add deadline.com custom parser ( #383 )
2019-04-24 15:29:02 +03:00
deadspin.com
feat: support lazy loading video on deadspin
2016-10-26 11:53:42 -07:00
epaper.zeit.de
Implemented custom extractor epaper.zeit.de ( #488 )
2019-08-28 07:15:14 -07:00
fandom.wikia.com
feat: added wikia extractor
2016-10-04 12:06:19 -04:00
fortune.com
feat: fortune parser ( #84 )
2017-01-23 16:47:06 -08:00
forward.com
feat: forward.com parser ( #144 )
2017-03-14 17:53:23 -04:00
fusion.net
feat: fusion parser
2017-02-02 10:54:49 -07:00
genius.com
feat: custom genius parser. ( #284 )
2019-04-09 12:49:24 +03:00
getnews.jp
feat: add getnews.jp custom parser ( #402 )
2019-05-03 13:10:55 +03:00
github.com
Extract content from GitHub repos. ( #306 )
2019-03-14 08:48:33 -07:00
gothamist.com
Feat: gothamist extractor ( #151 )
2017-03-09 13:13:46 -05:00
hellogiggles.com
feat: hellogiggles parser ( #107 )
2017-01-21 14:07:20 -05:00
ici.radio-canada.ca
feat: ici.radio-canada.ca extractor ( #156 )
2017-03-13 17:23:20 -04:00
japan.cnet.com
feat: add japan.cnet.com custom parser ( #382 )
2019-04-24 14:39:54 +03:00
japan.zdnet.com
feat: add japan.zdnet.com custom parser ( #410 )
2019-05-08 13:51:03 +03:00
jvndb.jvn.jp
feat: add jvndb.jvn.jp custom parser ( #345 )
2019-04-09 12:05:03 +03:00
mashable.com
feat: mashable parser ( #76 )
2017-01-23 15:00:18 -08:00
medium.com
fix: incorrect parsing on medium.com ( #477 )
2019-08-28 07:04:27 -07:00
money.cnn.com
feat: add money.cnn custom parser ( #26 )
2016-11-29 15:13:29 -08:00
newrepublic.com
feat: new republic custom extractor ( #25 )
2016-11-29 15:30:52 -08:00
news.mynavi.jp
feat: add news.mynavi.jp custom parser ( #287 )
2019-03-01 16:45:32 -08:00
news.nationalgeographic.com
feat: news.natgeo parser ( #88 )
2017-02-08 15:27:35 -07:00
nock
fix: return early if creating the resource failed. ( #285 )
2019-02-20 16:48:51 -08:00
nymag.com
feat: improve nymag.com extractor to grab deks from features
2016-09-14 13:12:40 -04:00
obamawhitehouse.archives.gov
feat: improve wh parser ( #168 )
2017-03-24 14:41:40 -07:00
observer.com
feat: observer parser ( #91 )
2017-01-21 12:47:26 -05:00
otrs.com
feat: add otrs.com custom parser ( #353 )
2019-04-09 11:17:58 +03:00
pagesix.com
feat: pagesix parser ( #97 )
2017-02-07 17:38:09 -05:00
people.com
feat: people extractor ( #70 )
2016-12-21 19:46:48 -08:00
phpspot.org
feat: add phpspot.org custom parser ( #369 )
2019-04-12 17:18:47 +03:00
pitchfork.com
feat: pitchfork extractor ( #439 )
2019-06-26 09:02:17 -07:00
qz.com
feat: qz parser ( #81 )
2017-01-23 16:08:07 -08:00
sandiegouniontribune.com
feat: ability to add custom extractors via api ( #484 )
2019-09-04 07:32:28 -07:00
scan.netsecurity.ne.jp
feat: add scan.netsecurity.ne.jp custom parser ( #347 )
2019-04-09 11:59:27 +03:00
sciencefly.com
feat: sciencefly extractor ( #116 )
2017-02-02 11:26:29 -05:00
sect.iij.ad.jp
feat: add sect.iij.ad.jp custom parser ( #404 )
2019-05-03 13:19:06 +03:00
takagi-hiromitsu.jp
feat: add takagi-hiromitsu.jp custom parser ( #364 )
2019-04-12 18:11:05 +03:00
techlog.iij.ad.jp
feat: add techlog.iij.ad.jp custom parser ( #405 )
2019-05-08 13:20:47 +03:00
thefederalistpapers.org
feat: thefederalistpapers parser ( #101 )
2017-02-07 14:30:52 -05:00
thoughtcatalog.com
feat: thought catalog parser ( #102 )
2017-01-21 13:52:00 -05:00
twitter.com
feat: generator for custom parsers and some documentation
2016-09-20 10:37:03 -04:00
uproxx.com
uproxx extractor ( #66 )
2016-12-21 21:05:10 -08:00
weekly.ascii.jp
feat: add weekly.ascii.jp custom parser ( #401 )
2019-05-08 13:10:42 +03:00
wired.jp
feat: add wired.jp custom parser ( #409 )
2019-05-08 13:32:04 +03:00
www.al.com
feat: al.com parser ( #110 )
2017-02-03 11:45:45 -07:00
www.americanow.com
feat: america now parser ( #114 )
2017-02-02 13:46:20 -07:00
www.androidcentral.com
feat: androidcentral parser ( #119 )
2017-02-07 18:20:04 -05:00
www.aol.com
feat: aol custom extractor ( #42 )
2016-12-01 17:05:15 -08:00
www.apartmenttherapy.com
feat: Add custom extrator for Apartment Therapy
2016-10-17 10:35:22 -05:00
www.asahi.com
feat: add www.asahi.com custom parser ( #350 )
2019-04-09 11:42:14 +03:00
www.bloomberg.com
feat: bloomberg extractor ( #59 )
2016-12-07 14:39:00 -05:00
www.broadwayworld.com
feat: Add custom parser for broadwayworld.com
2016-10-13 16:22:33 -05:00
www.bustle.com
feat: bustle extractor ( #60 )
2016-12-08 15:32:08 -05:00
www.buzzfeed.com
Fix: extension bugs ( #47 )
2016-12-02 16:02:00 -08:00
www.cbssports.com
feat: cbs sports parser ( #98 )
2017-02-07 10:45:48 -05:00
www.chicagotribune.com
feat: chicago tribune parser ( #75 )
2017-01-22 12:18:10 -05:00
www.cinemablend.com
feat: cinema blend parser ( #105 )
2017-02-06 09:02:11 -07:00
www.cnbc.com
fix: Adapt CNBC extractor to article redesign ( #336 )
2019-03-25 15:43:40 -07:00
www.cnet.com
feat: cnet parser ( #104 )
2017-02-07 11:55:04 -07:00
www.cnn.com
Feat cnn extractor ( #34 )
2016-11-30 14:55:04 -08:00
www.dmagazine.com
feat: dmagazine parser ( #80 )
2017-01-23 15:52:05 -08:00
www.elecom.co.jp
feat: add www.elecom.co.jp custom parser ( #348 )
2019-04-09 11:54:57 +03:00
www.eonline.com
feat: eonline parser ( #68 )
2016-12-21 21:24:14 -08:00
www.fastcompany.com
feat: add fastcompany custom parser ( #191 )
2019-01-30 09:30:24 +02:00
www.fool.com
feat: fool.com parser ( #158 )
2017-03-14 18:04:19 -04:00
www.fortinet.com
feat: add fortinet custom parser ( #188 )
2019-01-30 09:33:36 +02:00
www.gizmodo.jp
feat: add www.gizmodo.jp custom parser ( #400 )
2019-05-03 13:06:51 +03:00
www.howtogeek.com
dx: comment on custom parser pr fix ( #278 )
2019-02-28 11:11:03 -08:00
www.huffingtonpost.com
Feat: huffington post extractor ( #28 )
2016-11-29 15:50:48 -08:00
www.infoq.com
feat: add www.infoq.com custom parser ( #368 )
2019-04-12 17:30:46 +03:00
www.inquisitr.com
feat: inquisitor parser ( #72 )
2017-01-18 16:34:22 -08:00
www.ipa.go.jp
feat: add www.ipa.go.jp custom parser ( #408 )
2019-05-03 13:40:42 +03:00
www.itmedia.co.jp
feat: add www.itmedia.co.jp custom parser ( #366 )
2019-04-12 17:51:16 +03:00
www.jnsa.org
feat: add www.jnsa.org custom parser ( #346 )
2019-04-09 16:51:25 +03:00
www.latimes.com
feat: latimes parser ( #92 )
2017-02-08 11:29:03 -05:00
www.lemonde.fr
feat: add le monde extractor ( #415 )
2019-05-14 14:53:49 +03:00
www.lifehacker.jp
feat: add www.lifehacker.jp custom parser ( #403 )
2019-05-03 13:14:53 +03:00
www.linkedin.com
Feat: LinkedIn parser ( #123 )
2017-01-26 10:11:10 -08:00
www.littlethings.com
feat: added littlethings extractor
2016-10-04 15:02:23 -04:00
www.macrumors.com
feat: macrumors parser ( #120 )
2017-02-07 19:15:29 -05:00
www.mentalfloss.com
feat: mental floss parser ( #94 )
2017-02-03 11:40:01 -05:00
www.miamiherald.com
feat: miami herald parser ( #69 )
2016-12-21 21:35:34 -08:00
www.moongift.jp
feat: add www.moongift.jp custom parser ( #367 )
2019-04-12 17:40:55 +03:00
www.msn.com
feat: added incomplete msn extractor
2016-10-03 13:27:51 -04:00
www.msnbc.com
feat: msnbc parser ( #100 )
2017-02-06 18:08:49 -05:00
www.nationalgeographic.com
feat: natgeo parser ( #89 )
2017-02-08 15:01:55 -07:00
www.nbcnews.com
feat: nbc news parser ( #74 )
2017-01-18 17:28:21 -08:00
www.newyorker.com
fix: new yorker extractor ( #414 )
2019-05-15 11:00:50 +03:00
www.nj.com
feat: nj.com parser ( #73 )
2017-01-18 16:49:05 -08:00
www.npr.org
feat: npr parser ( #86 )
2017-01-23 17:23:02 -08:00
www.nydailynews.com
feat: ny daily news parser ( #87 )
2017-02-02 12:30:16 -05:00
www.nytimes.com
fix: nytimes custom parser title selector ( #181 )
2018-10-12 13:39:41 -07:00
www.opposingviews.com
feat: opposing views parser ( #103 )
2017-02-06 12:22:42 -05:00
www.oreilly.co.jp
feat: add www.oreilly.co.jp custom parser ( #407 )
2019-05-03 13:30:48 +03:00
www.ossnews.jp
feat: add www.ossnews.jp custom parser ( #352 )
2019-04-09 11:30:56 +03:00
www.phoronix.com
feat: custom parser for phoronix.com. ( #431 )
2019-06-26 09:55:13 -07:00
www.politico.com
feat: added politico extractor
2016-10-05 13:51:11 -04:00
www.popsugar.com
feat: popsugar parser ( #93 )
2017-01-21 13:11:00 -05:00
www.prospectmagazine.co.uk
feat: prospect magazine parser ( #147 )
2017-03-14 18:34:40 -04:00
www.publickey1.jp
feat: add www.publickey1.jp custom parser ( #365 )
2019-04-12 18:00:51 +03:00
www.qdaily.com
feat: qdaily parser ( #146 )
2017-03-14 17:37:53 -04:00
www.rawstory.com
feat: rawstory parser ( #109 )
2017-02-07 12:53:05 -07:00
www.rbbtoday.com
feat: add rbbtoday.com custom parser ( #411 )
2019-05-08 14:04:03 +03:00
www.recode.net
feat: recode parser ( #85 )
2017-01-23 17:02:33 -08:00
www.reddit.com
feat: Add custom parser for Reddit ( #307 )
2019-03-08 14:37:24 -08:00
www.refinery29.com
feat: refinery29 parser ( #71 )
2016-12-21 21:57:13 -08:00
www.reuters.com
feat: reuters parser ( #78 )
2017-01-23 15:16:37 -08:00
www.rollingstone.com
feat: rolling stone extractor ( #65 )
2016-12-21 20:30:34 -08:00
www.sanwa.co.jp
feat: add www.sanwa.co.jp custom parser ( #349 )
2019-04-09 11:50:48 +03:00
www.sbnation.com
feat: sbnation extractor ( #55 )
2016-12-07 14:25:57 -05:00
www.si.com
feat: si parser ( #118 )
2017-02-07 16:52:11 -05:00
www.slate.com
Feat: Slate extractor ( #153 )
2017-03-13 17:44:04 -04:00
www.theatlantic.com
fix: incorrect parsing on theatlantic.com ( #475 )
2019-08-20 09:58:24 -07:00
www.theguardian.com
feat: aol custom extractor ( #42 )
2016-12-01 17:05:15 -08:00
www.thepennyhoarder.com
feat: thepennyhoarder parser ( #112 )
2017-02-03 08:56:15 -07:00
www.thepoliticalinsider.com
feat: the political insider parser ( #99 )
2017-02-03 16:25:16 -05:00
www.theverge.com
feat: extractor for the verge ( #33 )
2016-11-30 14:08:56 -08:00
www.tmz.com
feat: added tmz custom parser ( #22 )
2016-11-28 15:10:28 -08:00
www.today.com
feat: today parser ( #106 )
2017-02-06 09:20:12 -07:00
www.usmagazine.com
feat: usmagazine extractor ( #63 )
2016-12-21 20:06:47 -08:00
www.vox.com
feat: vox custom parser ( #67 )
2016-12-15 17:48:15 -08:00
www.washingtonpost.com
fix: author and date published selectors ( #189 )
2019-01-25 11:28:43 -08:00
www.westernjournalism.com
feat: westernjournalism parser ( #113 )
2017-02-03 11:15:50 -07:00
www.wired.com
feat: added wired custom extractor
2016-09-30 14:32:28 -04:00
www.yahoo.com
feat: added incomplete yahoo extractor
2016-10-03 17:48:11 -04:00
www.yomiuri.co.jp
feat: add www.yomiuri.co.jp custom parser ( #381 )
2019-04-24 11:00:56 +03:00
www.youtube.com
feat: youtube custom extractor ( #53 )
2016-12-06 12:36:51 -05:00
ars.html
feat: nextPageUrl handles multi-page articles
2016-09-13 10:08:49 -04:00
latimes.html
fix: brought .html fixtures into project dir
2016-09-08 11:07:51 -04:00
nytimes.html
fix: brought .html fixtures into project dir
2016-09-08 11:07:51 -04:00
vulture.html
fix: bug in scoring and converting to paragraphs
2016-09-14 10:15:36 -04:00
wired.html
fix: brought .html fixtures into project dir
2016-09-08 11:07:51 -04:00