mercury-parser/fixtures
Janet 2279c2d486 feat: natgeo parser (#89)
* feat: natgeo parser

Same as the news.nationalgeographic.com parser - for some reason the
author name doesn’t appear to be getting pulled into the local copy of
the file.

* fix: content assertion

* fix: generalize author byline

* disable: author assertion

* rm: author assertion

* fix: image lead, handles image-group

* fix: guard agaist missing img url

* fix: generalize dek and title selectors
2017-02-08 15:01:55 -07:00
..
247sports.com feat: 247sports.com extractor (#64) 2016-12-21 20:52:23 -08:00
abcnews.go.com feat: abcnewsgo parser (#90) 2017-02-02 17:43:35 -05:00
deadspin.com feat: support lazy loading video on deadspin 2016-10-26 11:53:42 -07:00
fandom.wikia.com feat: added wikia extractor 2016-10-04 12:06:19 -04:00
fortune.com feat: fortune parser (#84) 2017-01-23 16:47:06 -08:00
fusion.net feat: fusion parser 2017-02-02 10:54:49 -07:00
hellogiggles.com feat: hellogiggles parser (#107) 2017-01-21 14:07:20 -05:00
mashable.com feat: mashable parser (#76) 2017-01-23 15:00:18 -08:00
medium.com fix: medium bug (#129) 2017-01-31 15:28:25 -08:00
money.cnn.com feat: add money.cnn custom parser (#26) 2016-11-29 15:13:29 -08:00
newrepublic.com feat: new republic custom extractor (#25) 2016-11-29 15:30:52 -08:00
nock feat: encoding response body based on content-type charset (#21) 2016-11-22 10:44:27 -08:00
nymag.com feat: improve nymag.com extractor to grab deks from features 2016-09-14 13:12:40 -04:00
obamawhitehouse.archives.gov feat: custom parser for wh blog (#130) 2017-01-31 15:50:39 -08:00
observer.com feat: observer parser (#91) 2017-01-21 12:47:26 -05:00
pagesix.com feat: pagesix parser (#97) 2017-02-07 17:38:09 -05:00
people.com feat: people extractor (#70) 2016-12-21 19:46:48 -08:00
qz.com feat: qz parser (#81) 2017-01-23 16:08:07 -08:00
sciencefly.com feat: sciencefly extractor (#116) 2017-02-02 11:26:29 -05:00
thefederalistpapers.org feat: thefederalistpapers parser (#101) 2017-02-07 14:30:52 -05:00
thoughtcatalog.com feat: thought catalog parser (#102) 2017-01-21 13:52:00 -05:00
twitter.com feat: generator for custom parsers and some documentation 2016-09-20 10:37:03 -04:00
uproxx.com uproxx extractor (#66) 2016-12-21 21:05:10 -08:00
www.al.com feat: al.com parser (#110) 2017-02-03 11:45:45 -07:00
www.americanow.com feat: america now parser (#114) 2017-02-02 13:46:20 -07:00
www.androidcentral.com feat: androidcentral parser (#119) 2017-02-07 18:20:04 -05:00
www.aol.com feat: aol custom extractor (#42) 2016-12-01 17:05:15 -08:00
www.apartmenttherapy.com feat: Add custom extrator for Apartment Therapy 2016-10-17 10:35:22 -05:00
www.bloomberg.com feat: bloomberg extractor (#59) 2016-12-07 14:39:00 -05:00
www.broadwayworld.com feat: Add custom parser for broadwayworld.com 2016-10-13 16:22:33 -05:00
www.bustle.com feat: bustle extractor (#60) 2016-12-08 15:32:08 -05:00
www.buzzfeed.com Fix: extension bugs (#47) 2016-12-02 16:02:00 -08:00
www.cbssports.com feat: cbs sports parser (#98) 2017-02-07 10:45:48 -05:00
www.chicagotribune.com feat: chicago tribune parser (#75) 2017-01-22 12:18:10 -05:00
www.cinemablend.com feat: cinema blend parser (#105) 2017-02-06 09:02:11 -07:00
www.cnbc.com feat: cnbc parser (#96) 2017-01-21 13:25:23 -05:00
www.cnet.com feat: cnet parser (#104) 2017-02-07 11:55:04 -07:00
www.cnn.com Feat cnn extractor (#34) 2016-11-30 14:55:04 -08:00
www.dmagazine.com feat: dmagazine parser (#80) 2017-01-23 15:52:05 -08:00
www.eonline.com feat: eonline parser (#68) 2016-12-21 21:24:14 -08:00
www.howtogeek.com feat: howtogeek extractor (#108) 2017-02-06 15:23:15 -07:00
www.huffingtonpost.com Feat: huffington post extractor (#28) 2016-11-29 15:50:48 -08:00
www.inquisitr.com feat: inquisitor parser (#72) 2017-01-18 16:34:22 -08:00
www.latimes.com feat: latimes parser (#92) 2017-02-08 11:29:03 -05:00
www.linkedin.com Feat: LinkedIn parser (#123) 2017-01-26 10:11:10 -08:00
www.littlethings.com feat: added littlethings extractor 2016-10-04 15:02:23 -04:00
www.macrumors.com feat: macrumors parser (#120) 2017-02-07 19:15:29 -05:00
www.mentalfloss.com feat: mental floss parser (#94) 2017-02-03 11:40:01 -05:00
www.miamiherald.com feat: miami herald parser (#69) 2016-12-21 21:35:34 -08:00
www.msn.com feat: added incomplete msn extractor 2016-10-03 13:27:51 -04:00
www.msnbc.com feat: msnbc parser (#100) 2017-02-06 18:08:49 -05:00
www.nationalgeographic.com feat: natgeo parser (#89) 2017-02-08 15:01:55 -07:00
www.nbcnews.com feat: nbc news parser (#74) 2017-01-18 17:28:21 -08:00
www.newyorker.com feat: improvements for nyer magazine articles (#45) 2016-12-02 15:30:09 -08:00
www.nj.com feat: nj.com parser (#73) 2017-01-18 16:49:05 -08:00
www.npr.org feat: npr parser (#86) 2017-01-23 17:23:02 -08:00
www.nydailynews.com feat: ny daily news parser (#87) 2017-02-02 12:30:16 -05:00
www.nytimes.com feat: generator for custom parsers and some documentation 2016-09-20 10:37:03 -04:00
www.opposingviews.com feat: opposing views parser (#103) 2017-02-06 12:22:42 -05:00
www.politico.com feat: added politico extractor 2016-10-05 13:51:11 -04:00
www.popsugar.com feat: popsugar parser (#93) 2017-01-21 13:11:00 -05:00
www.rawstory.com feat: rawstory parser (#109) 2017-02-07 12:53:05 -07:00
www.recode.net feat: recode parser (#85) 2017-01-23 17:02:33 -08:00
www.refinery29.com feat: refinery29 parser (#71) 2016-12-21 21:57:13 -08:00
www.reuters.com feat: reuters parser (#78) 2017-01-23 15:16:37 -08:00
www.rollingstone.com feat: rolling stone extractor (#65) 2016-12-21 20:30:34 -08:00
www.sbnation.com feat: sbnation extractor (#55) 2016-12-07 14:25:57 -05:00
www.si.com feat: si parser (#118) 2017-02-07 16:52:11 -05:00
www.theatlantic.com feat: generator for custom parsers and some documentation 2016-09-20 10:37:03 -04:00
www.theguardian.com feat: aol custom extractor (#42) 2016-12-01 17:05:15 -08:00
www.thepennyhoarder.com feat: thepennyhoarder parser (#112) 2017-02-03 08:56:15 -07:00
www.thepoliticalinsider.com feat: the political insider parser (#99) 2017-02-03 16:25:16 -05:00
www.theverge.com feat: extractor for the verge (#33) 2016-11-30 14:08:56 -08:00
www.tmz.com feat: added tmz custom parser (#22) 2016-11-28 15:10:28 -08:00
www.today.com feat: today parser (#106) 2017-02-06 09:20:12 -07:00
www.usmagazine.com feat: usmagazine extractor (#63) 2016-12-21 20:06:47 -08:00
www.vox.com feat: vox custom parser (#67) 2016-12-15 17:48:15 -08:00
www.washingtonpost.com Fix extension bugs (#23) 2016-11-28 16:58:21 -08:00
www.westernjournalism.com feat: westernjournalism parser (#113) 2017-02-03 11:15:50 -07:00
www.wired.com feat: added wired custom extractor 2016-09-30 14:32:28 -04:00
www.yahoo.com feat: added incomplete yahoo extractor 2016-10-03 17:48:11 -04:00
www.youtube.com feat: youtube custom extractor (#53) 2016-12-06 12:36:51 -05:00
ars.html feat: nextPageUrl handles multi-page articles 2016-09-13 10:08:49 -04:00
latimes.html fix: brought .html fixtures into project dir 2016-09-08 11:07:51 -04:00
nytimes.html fix: brought .html fixtures into project dir 2016-09-08 11:07:51 -04:00
vulture.html fix: bug in scoring and converting to paragraphs 2016-09-14 10:15:36 -04:00
wired.html fix: brought .html fixtures into project dir 2016-09-08 11:07:51 -04:00