mercury-parser

mirror of https://github.com/postlight/mercury-parser synced 2024-10-31 03:20:40 +00:00

Author	SHA1	Message	Date
Adam Pash	afd313900b	change ci image	2019-01-16 14:01:06 -08:00
Adam Pash	e65db47597	ignore deleted fixtures	2019-01-16 13:47:33 -08:00
Adam Pash	78af226e86	fix results processing	2019-01-16 10:46:48 -08:00
Adam Pash	d53116fdae	make dir for artifacts	2019-01-16 10:31:01 -08:00
Adam Pash	208a4421b4	change node for CI docker image	2019-01-16 10:26:01 -08:00
Adam Pash	27e31b8cc6	getting test results in the comment	2019-01-16 10:21:55 -08:00
Adam Pash	b526a4eca3	try to output test artifact	2019-01-15 16:41:20 -08:00
Postlight Bot	9e69501ad2	add delay between pushing branch and creating pr troubleshooting CI not triggering	2019-01-15 16:26:44 -08:00
Postlight Bot	c9dcb514d8	fix find-and-replace for linux sed	2019-01-15 16:21:18 -08:00
Adam Pash	0a62ea0965	sort by oldest fixture	2019-01-15 16:06:57 -08:00
Adam Pash	78adb2c2a0	fix: auto-pr (#199 )	2019-01-15 15:58:17 -08:00
Adam Pash	c643666c88	dx: automate fixture updates (#197 )	2019-01-15 15:41:18 -08:00
Adam Pash	bc23b8b7ea	dx: one-line comment links (#195 )	2019-01-14 15:01:46 -08:00
Adam Pash	c0676423be	dx: add image to preview and link to original article (#194 )	2019-01-14 11:34:34 -08:00
Adam Pash	ff144952b9	dx: test/finish bot preview	2019-01-14 11:18:32 -08:00
Adam Pash	d35f7bd5bf	dx: comment on PRs when fixtures have been added/changed (#192 ) The goal here is to provide some sort of relatively easy preview for the PR reviewer to see if the fixture looks good, if the parsing is working, and to make suggestions easily.	2019-01-11 13:58:28 -08:00
Adam Pash	96640e3564	fix: failing fetchResource test (#187 ) I think was a fixture problem	2018-12-20 10:06:16 -08:00
Adam Pash	4478338046	docs: document release process (#186 )	2018-12-20 09:30:47 -08:00
Adam Pash	a7fd0e8dda	dx: add nvmrc file (#185 ) The node version should not be higher than the node version we're using on AWS with the Mercury Parser API	2018-12-20 09:13:27 -08:00
Adam Pash	d850177b68	docs: Update README.md (#184 )	2018-12-20 09:04:01 -08:00
Adam Pash	fd6c9d4fa3	release: 1.0.13 (#183 )	2018-10-12 15:01:42 -07:00
Adam Pash	0c15e9aad3	chore: update circle config.yml to 2.0 (#182 )	2018-10-12 14:17:11 -07:00
Adam Pash	5663660f76	fix: nytimes custom parser title selector (#181 ) * fix: nytimes custom parser title selector * upgrade node version * circle ci tweak	2018-10-12 13:39:41 -07:00
Adam Pash	7fcd9b62eb	release: 1.0.12 (#173 )	2017-04-10 16:10:52 -07:00
Jeremy Mack	5fcea1c5c3	fix: PARSING_NODE undefined (#172 ) * fix: PARSING_NODE undefined * chore: remove unused cleanup function/call	2017-04-10 15:55:21 -07:00
Adam Pash	a51cc81c27	release: 1.0.11 (#171 )	2017-04-10 14:57:32 -07:00
Jeremy Mack	e92e798880	fix: viewport tags leaking to parent page (#170 ) * fix: scrub meta viewport tags They leak to the parent page when using the web version of Mercury Parser. * chore: build * fix: keep DOM in memory to avoid conflicts	2017-04-10 14:35:23 -07:00
Adam Pash	86d6bd1dc1	release: 1.0.10 (#169 )	2017-03-24 15:24:06 -07:00
Adam Pash	b8aa87c777	feat: improve wh parser (#168 )	2017-03-24 14:41:40 -07:00
Adam Pash	e56e8e24cd	release: 1.0.9 (#167 )	2017-03-23 13:39:46 -07:00
Adam Pash	61f0f4e1af	fix: kept elements being removed (#166 ) Elements marked to keep were removeable under specific circumstances. This PR fixes these edge cases.	2017-03-23 13:16:21 -07:00
Adam Pash	5741910fdc	docs: update changelog (#165 )	2017-03-22 14:47:20 -07:00
Adam Pash	321c087be6	release: 1.0.8 (#164 )	2017-03-22 14:08:22 -07:00
Adam Pash	453419de72	feat: improve wh.gov parser (#163 ) * feat: support youtube-nocookie domain * feat: updated wh.gov parser to support speeches	2017-03-22 13:16:54 -07:00
Adam Pash	e267d57d78	release: 1.0.7 (#160 )	2017-03-15 09:16:04 -07:00
Janet	f13bb721f6	feat: prospect magazine parser (#147 ) * feat: prospect magazine parser Couldn’t find a way to parse the date but I think it’s good otherwise. * fix: pulls date * fix: add timezone * fix: generalize	2017-03-14 18:34:40 -04:00
Kevin Ngao	1b28713cf5	feat: fool.com parser (#158 ) * feat: add fool.com custom parser	2017-03-14 18:04:19 -04:00
Janet	c18959779d	feat: forward.com parser (#144 ) * feat: forward.com parser LGTM although image didn’t show up in preview * feat: also pull imge into content * fix: generalize selectors * fix: generalize selector	2017-03-14 17:53:23 -04:00
Janet	50e548bac2	feat: qdaily parser (#146 ) * feat: qdaily parser Firstly — I accidentally tried to generate the parser on the master branch, and I’m not sure where it is, maybe floating in the nether world. On to the parser — this one was a bit tricky because things were in Chinese! The content appears to be parsing (as seen in preview) but it’s not passing the test. I noticed the second “ ‘ “ mark isn’t appearing on the parser side. Additionally, some of the lazy loading images aren’t appearing in the preview (I cleaned the wrong lazy load images that appeared), so someone will probably have to work on that (I don’t know how to do transforms yet). * fix tests * fix: selector generalization	2017-03-14 17:37:53 -04:00
Silas Burton	51a4d1d12f	feat: newrepublic parser shows image on page (#159 )	2017-03-14 14:07:45 -04:00
Silas Burton	11382ce651	Feat: Slate extractor (#153 ) * feat: slate extractor * fix: generalize selectors * fix: add Slate timezone	2017-03-13 17:44:04 -04:00
Silas Burton	5acaa6ab56	feat: ici.radio-canada.ca extractor (#156 ) * feat: ici.radio-canada.ca extractor * fix: add timezone	2017-03-13 17:23:20 -04:00
Silas Burton	4509b341e6	feat: better cleanup of atlantic articles (#157 )	2017-03-13 17:14:58 -04:00
Kevin Ngao	f2e3f055c2	Fixes an issue with encoding (#154 ) * fix: fixes an issue with encoding on the fetch level	2017-03-10 17:40:31 -05:00
Silas Burton	9b371e51ac	Feat: gothamist extractor (#151 ) * feat: gothamist extractor * feat: add other gothamist network sites * fix: try getting date another way * fix: add gothamist timezone * fix: generalize selectors * fix: h1 is inside entry-header, needs to be specific because of another h1 on the page * fix: general and specific selector	2017-03-09 13:13:46 -05:00
Kevin Ngao	afbef9bc39	Fix Encoding on Body (#143 ) * fix: check encoding on body	2017-03-06 11:36:56 -05:00
Adam Pash	9d4c883d51	release: 1.0.6 (#142 )	2017-02-09 08:58:49 -08:00
Janet	93d2baf5cf	feat: news.natgeo parser (#88 ) * feat: natgeo parser For some reason, the local copy of the article didn’t grab the author name in it, so I couldn’t figure out how to parse it. The generic parser took a name of an author of a paper mentioned in the article, and thought that was the author name, which was funny. I cleaned a large block quote that didn’t make sense as it was shown in the preview, although I noticed that the Mercury chrome extension didn’t even display it. * fix: add date_published transform * fix: date_published assertion * disable: author assertion, generlize author selector * rm: author assertion * fix: image lead * fix: guard agaist missing img url * fix: generalize dek and title selectors	2017-02-08 15:27:35 -07:00
Janet	2279c2d486	feat: natgeo parser (#89 ) * feat: natgeo parser Same as the news.nationalgeographic.com parser - for some reason the author name doesn’t appear to be getting pulled into the local copy of the file. * fix: content assertion * fix: generalize author byline * disable: author assertion * rm: author assertion * fix: image lead, handles image-group * fix: guard agaist missing img url * fix: generalize dek and title selectors	2017-02-08 15:01:55 -07:00
Adam Pash	08b5bb7ff1	feat: allow parser to define custom date formats (#141 ) * feat: allow parser to define custom date formats * feat: updating macrumors to test/verify format working correctly	2017-02-08 11:00:17 -07:00

1 2 3 4 5 ...

338 Commits