Commit Graph

492 Commits

Author SHA1 Message Date
Mike Ashley
90ccbe09ea dx: upgrading rollup-related packages 2019-09-05 11:20:15 -07:00
Michael Ashley
e12c916499
feat: ability to add custom extractors via api (#484)
* feat: ability to add custom extractors via api

* docs: updating readme

* fix: example.com was being used in another test

* fix: timezone was messing up date_published test

* fix: using a unique site for testing

* fix: updated custom extractor api

* docs: updating readme

* fix: removing unused fixture

* fix: updating test description

* feat: ability to add custom extractors via cli
2019-09-04 07:32:28 -07:00
Sven Wiegand
f95947fe88 Implemented custom extractor epaper.zeit.de (#488) 2019-08-28 07:15:14 -07:00
Michael Ashley
2422e4717d
fix: incorrect parsing on medium.com (#477)
* fix: medium extractor now pulls content

* fix: remove youtube caption if no preview available

* fix: remove youtube node if no image

* fix: removing dek from medium.com extractor
2019-08-28 07:04:27 -07:00
greenkeeper[bot]
2bed238b68 chore(package): update inquirer to version 7.0.0 (#479) 2019-08-23 07:01:59 -07:00
greenkeeper[bot]
869e44a69f chore(package): update karma-chrome-launcher to version 3.0.0 (#458) 2019-08-21 14:35:23 -07:00
greenkeeper[bot]
e4a7a288e5 chore(package): update eslint-config-prettier to version 6.1.0 (#476) 2019-08-21 11:50:11 -07:00
Malo Bourgon
2173c4cf83 deps: Update wuzzy to fix vulnerability (#462) 2019-08-21 11:12:17 -07:00
Jakob Fix
a918a9d6fa doc: correct link that points to wrong line (#469) 2019-08-21 10:10:10 -07:00
Michael Ashley
0686ee7956
fix: incorrect parsing on theatlantic.com (#475)
* fix: incorrect parsing on theatlantic.com

* chore: updating theatlantic.com tests & fixtures

* chore: removing script data from minified fixture
2019-08-20 09:58:24 -07:00
Michael Ashley
5e33263d25
chore: minifying biorxiv.com fixture (#478) 2019-08-20 09:46:15 -07:00
david0leong
911b0f87c8 Add custom extractor for biorxiv.org (#467)
* Add custom extractor for biorxiv.org

* Fix content selector

* Improve content selector
2019-08-19 13:46:03 -07:00
Jakob Fix
76d59f2d58 doc: correct internal page links (#470)
Specifically, to the cleaning content and using transform sections.
2019-08-16 14:41:46 -07:00
dependabot[bot]
398cba4d66 chore(deps): bump lodash.merge from 4.6.1 to 4.6.2 (#456)
Bumps [lodash.merge](https://github.com/lodash/lodash) from 4.6.1 to 4.6.2.
- [Release notes](https://github.com/lodash/lodash/releases)
- [Commits](https://github.com/lodash/lodash/commits)

Signed-off-by: dependabot[bot] <support@github.com>
2019-08-16 14:26:03 -07:00
dependabot[bot]
90e208ea13 chore(deps): bump cached-path-relative from 1.0.0 to 1.0.2 (#472)
Bumps [cached-path-relative](https://github.com/ashaffer/cached-path-relative) from 1.0.0 to 1.0.2.
- [Release notes](https://github.com/ashaffer/cached-path-relative/releases)
- [Commits](https://github.com/ashaffer/cached-path-relative/commits)

Signed-off-by: dependabot[bot] <support@github.com>
2019-08-16 14:22:57 -07:00
dependabot[bot]
5bb7c58e95 chore(deps): bump merge from 1.2.0 to 1.2.1 (#473)
Bumps [merge](https://github.com/yeikos/js.merge) from 1.2.0 to 1.2.1.
- [Release notes](https://github.com/yeikos/js.merge/releases)
- [Commits](https://github.com/yeikos/js.merge/compare/v1.2.0...v1.2.1)

Signed-off-by: dependabot[bot] <support@github.com>
2019-08-16 14:19:29 -07:00
greenkeeper[bot]
ce572f3a28 chore(package): update brfs-babel to version 2.0.0 (#461) 2019-08-16 12:25:12 -07:00
greenkeeper[bot]
6f65702a6c Update moment-timezone to the latest version 🚀 (#455)
* fix(package): update moment-timezone to version 0.5.26

* chore(package): update lockfile yarn.lock
2019-08-16 12:20:47 -07:00
greenkeeper[bot]
c764cebc0c chore(package): update remark-cli to version 7.0.0 (#460) 2019-08-16 12:16:46 -07:00
greenkeeper[bot]
853e041d84 deps: update husky to the latest version 🚀 (#450)
* chore(package): update husky to version 3.0.0

* chore(package): update lockfile yarn.lock
2019-07-03 09:44:02 -07:00
greenkeeper[bot]
f42f81218b deps: update iconv-lite to the latest version 🚀 (#447)
* fix(package): update iconv-lite to version 0.5.0

* chore(package): update lockfile yarn.lock
2019-07-03 09:34:54 -07:00
Kirill Danshin
592f175270 tests: remove a duplicate test (#448) 2019-07-03 09:30:10 -07:00
Adam Pash
713de25751
release: 2.1.1 (#446) 2019-06-26 13:36:55 -07:00
greenkeeper[bot]
c11b85f405 deps: update eslint-config-prettier to version 5.0.0 (#441) 2019-06-26 10:32:27 -07:00
Jaen
3b0d5fed69 chore: prevent adding phantomjs-prebuilt as a dependency in CI. (#412)
Fixes #384.
2019-06-26 10:22:23 -07:00
Toufic Mouallem
939d181951 fix: support query strings in lazy-loaded srcsets (#387) 2019-06-26 10:13:58 -07:00
Ben Ubois
0942c37876 feat: custom parser for phoronix.com. (#431) 2019-06-26 09:55:13 -07:00
Michael P. Geraci
571a913745 feat: pitchfork extractor (#439)
* generate the custom extractor and get the first test to pass

* add the basic extractors (title, author, date, etc)

* select the score as well as the review text, and break the content test

* prepend the score to the content

* get the date from the datetime attribute

* mangle this test a little, but just a little (it does work properly)

* move from prepending the score to the review text to adding it as a custom field in the extractor
2019-06-26 09:02:17 -07:00
greenkeeper[bot]
c8a66b0d77 deps: Update moment-timezone to the latest version 🚀 (#388)
* fix(package): update moment-timezone to version 0.5.24

* chore(package): update lockfile yarn.lock
2019-06-17 14:13:18 -07:00
dependabot[bot]
255da63e26 deps: bump handlebars from 4.0.6 to 4.1.2 (#434)
Bumps [handlebars](https://github.com/wycats/handlebars.js) from 4.0.6 to 4.1.2.
- [Release notes](https://github.com/wycats/handlebars.js/releases)
- [Changelog](https://github.com/wycats/handlebars.js/blob/master/release-notes.md)
- [Commits](https://github.com/wycats/handlebars.js/compare/v4.0.6...v4.1.2)

Signed-off-by: dependabot[bot] <support@github.com>
2019-06-17 14:09:47 -07:00
dependabot[bot]
c7abfc25c6 chore(deps): bump sshpk from 1.10.1 to 1.16.1 (#435)
Bumps [sshpk](https://github.com/joyent/node-sshpk) from 1.10.1 to 1.16.1.
- [Release notes](https://github.com/joyent/node-sshpk/releases)
- [Commits](https://github.com/joyent/node-sshpk/commits/v1.16.1)

Signed-off-by: dependabot[bot] <support@github.com>
2019-06-12 14:47:57 -07:00
david0leong
694ea820aa Custom Extractor for clinicaltrials.gov (#305)
* Add prototype of custom extractor for clinicaltrials.gov

* Add .DS_Store to gitignore

* Make tests for title, author and date_published selectors pass

* Make content selector test pass

* Fix date_published test

* Rebuild

* Remove .DS-Store from gitignore

* Improve extractor and text/fixture of clinicaltrials.gov
2019-05-27 09:25:51 +03:00
Toufic Mouallem
a7cd9027e2
chore: update husky to version 2.3.0 (#422)
* chore(package): update husky to version 2.3.0

Closes #395

* chore(package): update lockfile yarn.lock
2019-05-20 16:45:22 +03:00
Gina Trapani
9f6f07508c
docs: Add links to README 2019-05-19 13:30:16 -04:00
Toufic Mouallem
3414ebaa62
chore: update jquery to version 3.4.1 (#420) 2019-05-16 11:34:47 +03:00
Wajeeh Zantout
7c8de71c52 fix: new yorker extractor (#414)
* fix: new yorker extractor

* fix: date_published selector

* fix: remove footer from content

* feat: add additional selector for title

* feat: support article with multiple authors
2019-05-15 11:00:50 +03:00
Wajeeh Zantout
e66ad8b81c feat: add le monde extractor (#415) 2019-05-14 14:53:49 +03:00
kik0220
f81dc63617 feat: add rbbtoday.com custom parser (#411)
* feat: add rbbtoday.com custom parser

* fix: content test

* fix: dek and content
2019-05-08 14:04:03 +03:00
kik0220
5e1113b3a9 feat: add japan.zdnet.com custom parser (#410)
* feat: add japan.zdnet.com custom parser

* fix: author and date_published selector
2019-05-08 13:51:03 +03:00
kik0220
77e3bc00e2 feat: add wired.jp custom parser (#409)
* feat: add wired.jp custom parser

* fix: author test

* fix: date_published selector

* test: fix dek and contest

* test: fix content (without clean dek)
2019-05-08 13:32:04 +03:00
kik0220
0b36c96de0 feat: add techlog.iij.ad.jp custom parser (#405)
* feat: add techlog.iij.ad.jp custom parser

* fix: date_published and content selector
2019-05-08 13:20:47 +03:00
kik0220
406bf1b1a9 feat: add weekly.ascii.jp custom parser (#401)
* feat: add weekly.ascii.jp custom parser

* fix: title and date_published selector
2019-05-08 13:10:42 +03:00
kik0220
216bfade00 feat: add www.ipa.go.jp custom parser (#408) 2019-05-03 13:40:42 +03:00
kik0220
3ae8f3bde3 feat: add www.oreilly.co.jp custom parser (#407) 2019-05-03 13:30:48 +03:00
kik0220
7396e81b72 feat: add sect.iij.ad.jp custom parser (#404) 2019-05-03 13:19:06 +03:00
kik0220
3f1d9030ee feat: add www.lifehacker.jp custom parser (#403) 2019-05-03 13:14:53 +03:00
kik0220
b077000c4a feat: add getnews.jp custom parser (#402) 2019-05-03 13:10:55 +03:00
kik0220
b5425c3e8a feat: add www.gizmodo.jp custom parser (#400) 2019-05-03 13:06:51 +03:00
kik0220
a38c727a0a feat: add deadline.com custom parser (#383)
* feat: add deadline.com custom parser

* fix: timezone

* fix: date_published selectors

* fix: title and author selector

* test: transform .embed-twitter

* fix: regenerate the fixture and fix content selector
2019-04-24 15:29:02 +03:00
kik0220
74a3c49a3c feat: add japan.cnet.com custom parser (#382)
* feat: add japan.cnet.com custom parser

* fix: remove transform
2019-04-24 14:39:54 +03:00