.. |
247sports.com
|
feat: 247sports.com extractor (#64)
|
2016-12-21 20:52:23 -08:00 |
abcnews.go.com
|
feat: abcnewsgo parser (#90)
|
2017-02-02 17:43:35 -05:00 |
blisterreview.com
|
feat: add custom extractor for blisterreview.com (#299)
|
2019-03-01 16:48:26 -08:00 |
bookwalker.jp
|
feat: add bookwalker.jp custom parser (#374)
|
2019-04-15 11:06:10 +03:00 |
buzzap.jp
|
feat: add buzzap.jp custom parser (#351)
|
2019-04-09 11:35:40 +03:00 |
clinicaltrials.gov
|
Custom Extractor for clinicaltrials.gov (#305)
|
2019-05-27 09:25:51 +03:00 |
deadline.com
|
feat: add deadline.com custom parser (#383)
|
2019-04-24 15:29:02 +03:00 |
deadspin.com
|
feat: support lazy loading video on deadspin
|
2016-10-26 11:53:42 -07:00 |
fandom.wikia.com
|
feat: added wikia extractor
|
2016-10-04 12:06:19 -04:00 |
fortune.com
|
feat: fortune parser (#84)
|
2017-01-23 16:47:06 -08:00 |
forward.com
|
feat: forward.com parser (#144)
|
2017-03-14 17:53:23 -04:00 |
fusion.net
|
feat: fusion parser
|
2017-02-02 10:54:49 -07:00 |
genius.com
|
feat: custom genius parser. (#284)
|
2019-04-09 12:49:24 +03:00 |
getnews.jp
|
feat: add getnews.jp custom parser (#402)
|
2019-05-03 13:10:55 +03:00 |
github.com
|
Extract content from GitHub repos. (#306)
|
2019-03-14 08:48:33 -07:00 |
gothamist.com
|
Feat: gothamist extractor (#151)
|
2017-03-09 13:13:46 -05:00 |
hellogiggles.com
|
feat: hellogiggles parser (#107)
|
2017-01-21 14:07:20 -05:00 |
ici.radio-canada.ca
|
feat: ici.radio-canada.ca extractor (#156)
|
2017-03-13 17:23:20 -04:00 |
japan.cnet.com
|
feat: add japan.cnet.com custom parser (#382)
|
2019-04-24 14:39:54 +03:00 |
japan.zdnet.com
|
feat: add japan.zdnet.com custom parser (#410)
|
2019-05-08 13:51:03 +03:00 |
jvndb.jvn.jp
|
feat: add jvndb.jvn.jp custom parser (#345)
|
2019-04-09 12:05:03 +03:00 |
mashable.com
|
feat: mashable parser (#76)
|
2017-01-23 15:00:18 -08:00 |
medium.com
|
fix: medium bug (#129)
|
2017-01-31 15:28:25 -08:00 |
money.cnn.com
|
feat: add money.cnn custom parser (#26)
|
2016-11-29 15:13:29 -08:00 |
newrepublic.com
|
feat: new republic custom extractor (#25)
|
2016-11-29 15:30:52 -08:00 |
news.mynavi.jp
|
feat: add news.mynavi.jp custom parser (#287)
|
2019-03-01 16:45:32 -08:00 |
news.nationalgeographic.com
|
feat: news.natgeo parser (#88)
|
2017-02-08 15:27:35 -07:00 |
nock
|
fix: return early if creating the resource failed. (#285)
|
2019-02-20 16:48:51 -08:00 |
nymag.com
|
feat: improve nymag.com extractor to grab deks from features
|
2016-09-14 13:12:40 -04:00 |
obamawhitehouse.archives.gov
|
feat: improve wh parser (#168)
|
2017-03-24 14:41:40 -07:00 |
observer.com
|
feat: observer parser (#91)
|
2017-01-21 12:47:26 -05:00 |
otrs.com
|
feat: add otrs.com custom parser (#353)
|
2019-04-09 11:17:58 +03:00 |
pagesix.com
|
feat: pagesix parser (#97)
|
2017-02-07 17:38:09 -05:00 |
people.com
|
feat: people extractor (#70)
|
2016-12-21 19:46:48 -08:00 |
phpspot.org
|
feat: add phpspot.org custom parser (#369)
|
2019-04-12 17:18:47 +03:00 |
pitchfork.com
|
feat: pitchfork extractor (#439)
|
2019-06-26 09:02:17 -07:00 |
qz.com
|
feat: qz parser (#81)
|
2017-01-23 16:08:07 -08:00 |
scan.netsecurity.ne.jp
|
feat: add scan.netsecurity.ne.jp custom parser (#347)
|
2019-04-09 11:59:27 +03:00 |
sciencefly.com
|
feat: sciencefly extractor (#116)
|
2017-02-02 11:26:29 -05:00 |
sect.iij.ad.jp
|
feat: add sect.iij.ad.jp custom parser (#404)
|
2019-05-03 13:19:06 +03:00 |
takagi-hiromitsu.jp
|
feat: add takagi-hiromitsu.jp custom parser (#364)
|
2019-04-12 18:11:05 +03:00 |
techlog.iij.ad.jp
|
feat: add techlog.iij.ad.jp custom parser (#405)
|
2019-05-08 13:20:47 +03:00 |
thefederalistpapers.org
|
feat: thefederalistpapers parser (#101)
|
2017-02-07 14:30:52 -05:00 |
thoughtcatalog.com
|
feat: thought catalog parser (#102)
|
2017-01-21 13:52:00 -05:00 |
twitter.com
|
feat: generator for custom parsers and some documentation
|
2016-09-20 10:37:03 -04:00 |
uproxx.com
|
uproxx extractor (#66)
|
2016-12-21 21:05:10 -08:00 |
weekly.ascii.jp
|
feat: add weekly.ascii.jp custom parser (#401)
|
2019-05-08 13:10:42 +03:00 |
wired.jp
|
feat: add wired.jp custom parser (#409)
|
2019-05-08 13:32:04 +03:00 |
www.al.com
|
feat: al.com parser (#110)
|
2017-02-03 11:45:45 -07:00 |
www.americanow.com
|
feat: america now parser (#114)
|
2017-02-02 13:46:20 -07:00 |
www.androidcentral.com
|
feat: androidcentral parser (#119)
|
2017-02-07 18:20:04 -05:00 |
www.aol.com
|
feat: aol custom extractor (#42)
|
2016-12-01 17:05:15 -08:00 |
www.apartmenttherapy.com
|
feat: Add custom extrator for Apartment Therapy
|
2016-10-17 10:35:22 -05:00 |
www.asahi.com
|
feat: add www.asahi.com custom parser (#350)
|
2019-04-09 11:42:14 +03:00 |
www.bloomberg.com
|
feat: bloomberg extractor (#59)
|
2016-12-07 14:39:00 -05:00 |
www.broadwayworld.com
|
feat: Add custom parser for broadwayworld.com
|
2016-10-13 16:22:33 -05:00 |
www.bustle.com
|
feat: bustle extractor (#60)
|
2016-12-08 15:32:08 -05:00 |
www.buzzfeed.com
|
Fix: extension bugs (#47)
|
2016-12-02 16:02:00 -08:00 |
www.cbssports.com
|
feat: cbs sports parser (#98)
|
2017-02-07 10:45:48 -05:00 |
www.chicagotribune.com
|
feat: chicago tribune parser (#75)
|
2017-01-22 12:18:10 -05:00 |
www.cinemablend.com
|
feat: cinema blend parser (#105)
|
2017-02-06 09:02:11 -07:00 |
www.cnbc.com
|
fix: Adapt CNBC extractor to article redesign (#336)
|
2019-03-25 15:43:40 -07:00 |
www.cnet.com
|
feat: cnet parser (#104)
|
2017-02-07 11:55:04 -07:00 |
www.cnn.com
|
Feat cnn extractor (#34)
|
2016-11-30 14:55:04 -08:00 |
www.dmagazine.com
|
feat: dmagazine parser (#80)
|
2017-01-23 15:52:05 -08:00 |
www.elecom.co.jp
|
feat: add www.elecom.co.jp custom parser (#348)
|
2019-04-09 11:54:57 +03:00 |
www.eonline.com
|
feat: eonline parser (#68)
|
2016-12-21 21:24:14 -08:00 |
www.fastcompany.com
|
feat: add fastcompany custom parser (#191)
|
2019-01-30 09:30:24 +02:00 |
www.fool.com
|
feat: fool.com parser (#158)
|
2017-03-14 18:04:19 -04:00 |
www.fortinet.com
|
feat: add fortinet custom parser (#188)
|
2019-01-30 09:33:36 +02:00 |
www.gizmodo.jp
|
feat: add www.gizmodo.jp custom parser (#400)
|
2019-05-03 13:06:51 +03:00 |
www.howtogeek.com
|
dx: comment on custom parser pr fix (#278)
|
2019-02-28 11:11:03 -08:00 |
www.huffingtonpost.com
|
Feat: huffington post extractor (#28)
|
2016-11-29 15:50:48 -08:00 |
www.infoq.com
|
feat: add www.infoq.com custom parser (#368)
|
2019-04-12 17:30:46 +03:00 |
www.inquisitr.com
|
feat: inquisitor parser (#72)
|
2017-01-18 16:34:22 -08:00 |
www.ipa.go.jp
|
feat: add www.ipa.go.jp custom parser (#408)
|
2019-05-03 13:40:42 +03:00 |
www.itmedia.co.jp
|
feat: add www.itmedia.co.jp custom parser (#366)
|
2019-04-12 17:51:16 +03:00 |
www.jnsa.org
|
feat: add www.jnsa.org custom parser (#346)
|
2019-04-09 16:51:25 +03:00 |
www.latimes.com
|
feat: latimes parser (#92)
|
2017-02-08 11:29:03 -05:00 |
www.lemonde.fr
|
feat: add le monde extractor (#415)
|
2019-05-14 14:53:49 +03:00 |
www.lifehacker.jp
|
feat: add www.lifehacker.jp custom parser (#403)
|
2019-05-03 13:14:53 +03:00 |
www.linkedin.com
|
Feat: LinkedIn parser (#123)
|
2017-01-26 10:11:10 -08:00 |
www.littlethings.com
|
feat: added littlethings extractor
|
2016-10-04 15:02:23 -04:00 |
www.macrumors.com
|
feat: macrumors parser (#120)
|
2017-02-07 19:15:29 -05:00 |
www.mentalfloss.com
|
feat: mental floss parser (#94)
|
2017-02-03 11:40:01 -05:00 |
www.miamiherald.com
|
feat: miami herald parser (#69)
|
2016-12-21 21:35:34 -08:00 |
www.moongift.jp
|
feat: add www.moongift.jp custom parser (#367)
|
2019-04-12 17:40:55 +03:00 |
www.msn.com
|
feat: added incomplete msn extractor
|
2016-10-03 13:27:51 -04:00 |
www.msnbc.com
|
feat: msnbc parser (#100)
|
2017-02-06 18:08:49 -05:00 |
www.nationalgeographic.com
|
feat: natgeo parser (#89)
|
2017-02-08 15:01:55 -07:00 |
www.nbcnews.com
|
feat: nbc news parser (#74)
|
2017-01-18 17:28:21 -08:00 |
www.newyorker.com
|
fix: new yorker extractor (#414)
|
2019-05-15 11:00:50 +03:00 |
www.nj.com
|
feat: nj.com parser (#73)
|
2017-01-18 16:49:05 -08:00 |
www.npr.org
|
feat: npr parser (#86)
|
2017-01-23 17:23:02 -08:00 |
www.nydailynews.com
|
feat: ny daily news parser (#87)
|
2017-02-02 12:30:16 -05:00 |
www.nytimes.com
|
fix: nytimes custom parser title selector (#181)
|
2018-10-12 13:39:41 -07:00 |
www.opposingviews.com
|
feat: opposing views parser (#103)
|
2017-02-06 12:22:42 -05:00 |
www.oreilly.co.jp
|
feat: add www.oreilly.co.jp custom parser (#407)
|
2019-05-03 13:30:48 +03:00 |
www.ossnews.jp
|
feat: add www.ossnews.jp custom parser (#352)
|
2019-04-09 11:30:56 +03:00 |
www.phoronix.com
|
feat: custom parser for phoronix.com. (#431)
|
2019-06-26 09:55:13 -07:00 |
www.politico.com
|
feat: added politico extractor
|
2016-10-05 13:51:11 -04:00 |
www.popsugar.com
|
feat: popsugar parser (#93)
|
2017-01-21 13:11:00 -05:00 |
www.prospectmagazine.co.uk
|
feat: prospect magazine parser (#147)
|
2017-03-14 18:34:40 -04:00 |
www.publickey1.jp
|
feat: add www.publickey1.jp custom parser (#365)
|
2019-04-12 18:00:51 +03:00 |
www.qdaily.com
|
feat: qdaily parser (#146)
|
2017-03-14 17:37:53 -04:00 |
www.rawstory.com
|
feat: rawstory parser (#109)
|
2017-02-07 12:53:05 -07:00 |
www.rbbtoday.com
|
feat: add rbbtoday.com custom parser (#411)
|
2019-05-08 14:04:03 +03:00 |
www.recode.net
|
feat: recode parser (#85)
|
2017-01-23 17:02:33 -08:00 |
www.reddit.com
|
feat: Add custom parser for Reddit (#307)
|
2019-03-08 14:37:24 -08:00 |
www.refinery29.com
|
feat: refinery29 parser (#71)
|
2016-12-21 21:57:13 -08:00 |
www.reuters.com
|
feat: reuters parser (#78)
|
2017-01-23 15:16:37 -08:00 |
www.rollingstone.com
|
feat: rolling stone extractor (#65)
|
2016-12-21 20:30:34 -08:00 |
www.sanwa.co.jp
|
feat: add www.sanwa.co.jp custom parser (#349)
|
2019-04-09 11:50:48 +03:00 |
www.sbnation.com
|
feat: sbnation extractor (#55)
|
2016-12-07 14:25:57 -05:00 |
www.si.com
|
feat: si parser (#118)
|
2017-02-07 16:52:11 -05:00 |
www.slate.com
|
Feat: Slate extractor (#153)
|
2017-03-13 17:44:04 -04:00 |
www.theatlantic.com
|
feat: generator for custom parsers and some documentation
|
2016-09-20 10:37:03 -04:00 |
www.theguardian.com
|
feat: aol custom extractor (#42)
|
2016-12-01 17:05:15 -08:00 |
www.thepennyhoarder.com
|
feat: thepennyhoarder parser (#112)
|
2017-02-03 08:56:15 -07:00 |
www.thepoliticalinsider.com
|
feat: the political insider parser (#99)
|
2017-02-03 16:25:16 -05:00 |
www.theverge.com
|
feat: extractor for the verge (#33)
|
2016-11-30 14:08:56 -08:00 |
www.tmz.com
|
feat: added tmz custom parser (#22)
|
2016-11-28 15:10:28 -08:00 |
www.today.com
|
feat: today parser (#106)
|
2017-02-06 09:20:12 -07:00 |
www.usmagazine.com
|
feat: usmagazine extractor (#63)
|
2016-12-21 20:06:47 -08:00 |
www.vox.com
|
feat: vox custom parser (#67)
|
2016-12-15 17:48:15 -08:00 |
www.washingtonpost.com
|
fix: author and date published selectors (#189)
|
2019-01-25 11:28:43 -08:00 |
www.westernjournalism.com
|
feat: westernjournalism parser (#113)
|
2017-02-03 11:15:50 -07:00 |
www.wired.com
|
feat: added wired custom extractor
|
2016-09-30 14:32:28 -04:00 |
www.yahoo.com
|
feat: added incomplete yahoo extractor
|
2016-10-03 17:48:11 -04:00 |
www.yomiuri.co.jp
|
feat: add www.yomiuri.co.jp custom parser (#381)
|
2019-04-24 11:00:56 +03:00 |
www.youtube.com
|
feat: youtube custom extractor (#53)
|
2016-12-06 12:36:51 -05:00 |
ars.html
|
feat: nextPageUrl handles multi-page articles
|
2016-09-13 10:08:49 -04:00 |
latimes.html
|
fix: brought .html fixtures into project dir
|
2016-09-08 11:07:51 -04:00 |
nytimes.html
|
fix: brought .html fixtures into project dir
|
2016-09-08 11:07:51 -04:00 |
vulture.html
|
fix: bug in scoring and converting to paragraphs
|
2016-09-14 10:15:36 -04:00 |
wired.html
|
fix: brought .html fixtures into project dir
|
2016-09-08 11:07:51 -04:00 |