Mike Ashley
90ccbe09ea
dx: upgrading rollup-related packages
2019-09-05 11:20:15 -07:00
Michael Ashley
e12c916499
feat: ability to add custom extractors via api ( #484 )
...
* feat: ability to add custom extractors via api
* docs: updating readme
* fix: example.com was being used in another test
* fix: timezone was messing up date_published test
* fix: using a unique site for testing
* fix: updated custom extractor api
* docs: updating readme
* fix: removing unused fixture
* fix: updating test description
* feat: ability to add custom extractors via cli
2019-09-04 07:32:28 -07:00
Sven Wiegand
f95947fe88
Implemented custom extractor epaper.zeit.de ( #488 )
2019-08-28 07:15:14 -07:00
Michael Ashley
2422e4717d
fix: incorrect parsing on medium.com ( #477 )
...
* fix: medium extractor now pulls content
* fix: remove youtube caption if no preview available
* fix: remove youtube node if no image
* fix: removing dek from medium.com extractor
2019-08-28 07:04:27 -07:00
greenkeeper[bot]
2bed238b68
chore(package): update inquirer to version 7.0.0 ( #479 )
2019-08-23 07:01:59 -07:00
greenkeeper[bot]
869e44a69f
chore(package): update karma-chrome-launcher to version 3.0.0 ( #458 )
2019-08-21 14:35:23 -07:00
greenkeeper[bot]
e4a7a288e5
chore(package): update eslint-config-prettier to version 6.1.0 ( #476 )
2019-08-21 11:50:11 -07:00
Malo Bourgon
2173c4cf83
deps: Update wuzzy to fix vulnerability ( #462 )
2019-08-21 11:12:17 -07:00
Jakob Fix
a918a9d6fa
doc: correct link that points to wrong line ( #469 )
2019-08-21 10:10:10 -07:00
Michael Ashley
0686ee7956
fix: incorrect parsing on theatlantic.com ( #475 )
...
* fix: incorrect parsing on theatlantic.com
* chore: updating theatlantic.com tests & fixtures
* chore: removing script data from minified fixture
2019-08-20 09:58:24 -07:00
Michael Ashley
5e33263d25
chore: minifying biorxiv.com fixture ( #478 )
2019-08-20 09:46:15 -07:00
david0leong
911b0f87c8
Add custom extractor for biorxiv.org ( #467 )
...
* Add custom extractor for biorxiv.org
* Fix content selector
* Improve content selector
2019-08-19 13:46:03 -07:00
Jakob Fix
76d59f2d58
doc: correct internal page links ( #470 )
...
Specifically, to the cleaning content and using transform sections.
2019-08-16 14:41:46 -07:00
dependabot[bot]
398cba4d66
chore(deps): bump lodash.merge from 4.6.1 to 4.6.2 ( #456 )
...
Bumps [lodash.merge](https://github.com/lodash/lodash ) from 4.6.1 to 4.6.2.
- [Release notes](https://github.com/lodash/lodash/releases )
- [Commits](https://github.com/lodash/lodash/commits )
Signed-off-by: dependabot[bot] <support@github.com>
2019-08-16 14:26:03 -07:00
dependabot[bot]
90e208ea13
chore(deps): bump cached-path-relative from 1.0.0 to 1.0.2 ( #472 )
...
Bumps [cached-path-relative](https://github.com/ashaffer/cached-path-relative ) from 1.0.0 to 1.0.2.
- [Release notes](https://github.com/ashaffer/cached-path-relative/releases )
- [Commits](https://github.com/ashaffer/cached-path-relative/commits )
Signed-off-by: dependabot[bot] <support@github.com>
2019-08-16 14:22:57 -07:00
dependabot[bot]
5bb7c58e95
chore(deps): bump merge from 1.2.0 to 1.2.1 ( #473 )
...
Bumps [merge](https://github.com/yeikos/js.merge ) from 1.2.0 to 1.2.1.
- [Release notes](https://github.com/yeikos/js.merge/releases )
- [Commits](https://github.com/yeikos/js.merge/compare/v1.2.0...v1.2.1 )
Signed-off-by: dependabot[bot] <support@github.com>
2019-08-16 14:19:29 -07:00
greenkeeper[bot]
ce572f3a28
chore(package): update brfs-babel to version 2.0.0 ( #461 )
2019-08-16 12:25:12 -07:00
greenkeeper[bot]
6f65702a6c
Update moment-timezone to the latest version 🚀 ( #455 )
...
* fix(package): update moment-timezone to version 0.5.26
* chore(package): update lockfile yarn.lock
2019-08-16 12:20:47 -07:00
greenkeeper[bot]
c764cebc0c
chore(package): update remark-cli to version 7.0.0 ( #460 )
2019-08-16 12:16:46 -07:00
greenkeeper[bot]
853e041d84
deps: update husky to the latest version 🚀 ( #450 )
...
* chore(package): update husky to version 3.0.0
* chore(package): update lockfile yarn.lock
2019-07-03 09:44:02 -07:00
greenkeeper[bot]
f42f81218b
deps: update iconv-lite to the latest version 🚀 ( #447 )
...
* fix(package): update iconv-lite to version 0.5.0
* chore(package): update lockfile yarn.lock
2019-07-03 09:34:54 -07:00
Kirill Danshin
592f175270
tests: remove a duplicate test ( #448 )
2019-07-03 09:30:10 -07:00
Adam Pash
713de25751
release: 2.1.1 ( #446 )
2019-06-26 13:36:55 -07:00
greenkeeper[bot]
c11b85f405
deps: update eslint-config-prettier to version 5.0.0 ( #441 )
2019-06-26 10:32:27 -07:00
Jaen
3b0d5fed69
chore: prevent adding phantomjs-prebuilt as a dependency in CI. ( #412 )
...
Fixes #384 .
2019-06-26 10:22:23 -07:00
Toufic Mouallem
939d181951
fix: support query strings in lazy-loaded srcsets ( #387 )
2019-06-26 10:13:58 -07:00
Ben Ubois
0942c37876
feat: custom parser for phoronix.com. ( #431 )
2019-06-26 09:55:13 -07:00
Michael P. Geraci
571a913745
feat: pitchfork extractor ( #439 )
...
* generate the custom extractor and get the first test to pass
* add the basic extractors (title, author, date, etc)
* select the score as well as the review text, and break the content test
* prepend the score to the content
* get the date from the datetime attribute
* mangle this test a little, but just a little (it does work properly)
* move from prepending the score to the review text to adding it as a custom field in the extractor
2019-06-26 09:02:17 -07:00
greenkeeper[bot]
c8a66b0d77
deps: Update moment-timezone to the latest version 🚀 ( #388 )
...
* fix(package): update moment-timezone to version 0.5.24
* chore(package): update lockfile yarn.lock
2019-06-17 14:13:18 -07:00
dependabot[bot]
255da63e26
deps: bump handlebars from 4.0.6 to 4.1.2 ( #434 )
...
Bumps [handlebars](https://github.com/wycats/handlebars.js ) from 4.0.6 to 4.1.2.
- [Release notes](https://github.com/wycats/handlebars.js/releases )
- [Changelog](https://github.com/wycats/handlebars.js/blob/master/release-notes.md )
- [Commits](https://github.com/wycats/handlebars.js/compare/v4.0.6...v4.1.2 )
Signed-off-by: dependabot[bot] <support@github.com>
2019-06-17 14:09:47 -07:00
dependabot[bot]
c7abfc25c6
chore(deps): bump sshpk from 1.10.1 to 1.16.1 ( #435 )
...
Bumps [sshpk](https://github.com/joyent/node-sshpk ) from 1.10.1 to 1.16.1.
- [Release notes](https://github.com/joyent/node-sshpk/releases )
- [Commits](https://github.com/joyent/node-sshpk/commits/v1.16.1 )
Signed-off-by: dependabot[bot] <support@github.com>
2019-06-12 14:47:57 -07:00
david0leong
694ea820aa
Custom Extractor for clinicaltrials.gov ( #305 )
...
* Add prototype of custom extractor for clinicaltrials.gov
* Add .DS_Store to gitignore
* Make tests for title, author and date_published selectors pass
* Make content selector test pass
* Fix date_published test
* Rebuild
* Remove .DS-Store from gitignore
* Improve extractor and text/fixture of clinicaltrials.gov
2019-05-27 09:25:51 +03:00
Toufic Mouallem
a7cd9027e2
chore: update husky to version 2.3.0 ( #422 )
...
* chore(package): update husky to version 2.3.0
Closes #395
* chore(package): update lockfile yarn.lock
2019-05-20 16:45:22 +03:00
Gina Trapani
9f6f07508c
docs: Add links to README
2019-05-19 13:30:16 -04:00
Toufic Mouallem
3414ebaa62
chore: update jquery to version 3.4.1 ( #420 )
2019-05-16 11:34:47 +03:00
Wajeeh Zantout
7c8de71c52
fix: new yorker extractor ( #414 )
...
* fix: new yorker extractor
* fix: date_published selector
* fix: remove footer from content
* feat: add additional selector for title
* feat: support article with multiple authors
2019-05-15 11:00:50 +03:00
Wajeeh Zantout
e66ad8b81c
feat: add le monde extractor ( #415 )
2019-05-14 14:53:49 +03:00
kik0220
f81dc63617
feat: add rbbtoday.com custom parser ( #411 )
...
* feat: add rbbtoday.com custom parser
* fix: content test
* fix: dek and content
2019-05-08 14:04:03 +03:00
kik0220
5e1113b3a9
feat: add japan.zdnet.com custom parser ( #410 )
...
* feat: add japan.zdnet.com custom parser
* fix: author and date_published selector
2019-05-08 13:51:03 +03:00
kik0220
77e3bc00e2
feat: add wired.jp custom parser ( #409 )
...
* feat: add wired.jp custom parser
* fix: author test
* fix: date_published selector
* test: fix dek and contest
* test: fix content (without clean dek)
2019-05-08 13:32:04 +03:00
kik0220
0b36c96de0
feat: add techlog.iij.ad.jp custom parser ( #405 )
...
* feat: add techlog.iij.ad.jp custom parser
* fix: date_published and content selector
2019-05-08 13:20:47 +03:00
kik0220
406bf1b1a9
feat: add weekly.ascii.jp custom parser ( #401 )
...
* feat: add weekly.ascii.jp custom parser
* fix: title and date_published selector
2019-05-08 13:10:42 +03:00
kik0220
216bfade00
feat: add www.ipa.go.jp custom parser ( #408 )
2019-05-03 13:40:42 +03:00
kik0220
3ae8f3bde3
feat: add www.oreilly.co.jp custom parser ( #407 )
2019-05-03 13:30:48 +03:00
kik0220
7396e81b72
feat: add sect.iij.ad.jp custom parser ( #404 )
2019-05-03 13:19:06 +03:00
kik0220
3f1d9030ee
feat: add www.lifehacker.jp custom parser ( #403 )
2019-05-03 13:14:53 +03:00
kik0220
b077000c4a
feat: add getnews.jp custom parser ( #402 )
2019-05-03 13:10:55 +03:00
kik0220
b5425c3e8a
feat: add www.gizmodo.jp custom parser ( #400 )
2019-05-03 13:06:51 +03:00
kik0220
a38c727a0a
feat: add deadline.com custom parser ( #383 )
...
* feat: add deadline.com custom parser
* fix: timezone
* fix: date_published selectors
* fix: title and author selector
* test: transform .embed-twitter
* fix: regenerate the fixture and fix content selector
2019-04-24 15:29:02 +03:00
kik0220
74a3c49a3c
feat: add japan.cnet.com custom parser ( #382 )
...
* feat: add japan.cnet.com custom parser
* fix: remove transform
2019-04-24 14:39:54 +03:00