kik0220
a9e010b718
feat: add www.sanwa.co.jp custom parser ( #349 )
2019-04-09 11:50:48 +03:00
kik0220
1639eae324
feat: add www.asahi.com custom parser ( #350 )
2019-04-09 11:42:14 +03:00
kik0220
21f7de70c1
feat: add buzzap.jp custom parser ( #351 )
2019-04-09 11:35:40 +03:00
kik0220
f3a7e393a3
feat: add www.ossnews.jp custom parser ( #352 )
2019-04-09 11:30:56 +03:00
kik0220
c309bdb373
feat: add otrs.com custom parser ( #353 )
2019-04-09 11:17:58 +03:00
Alexsander Akers
71c4d05037
Include "src/shims" for webpack builds for web ( #302 )
2019-04-05 15:47:31 -07:00
Frankie Simms
a3fe02678c
chore: small CoC typofix ( #358 )
2019-04-05 15:46:27 -07:00
John Holdun
437f50a5c8
fix: Initialize Content-Type as empty string if not present ( #359 )
2019-04-05 15:45:58 -07:00
Frankie Simms
da9a836eab
chore: remove unneeded import ( #357 )
2019-04-03 10:30:15 -07:00
Frankie Simms
bafa764000
chore: set up ciftr for failed test reports ( #343 )
2019-04-01 14:25:01 -07:00
Toufic Mouallem
262dda94b3
fix: explicity reject non-200 status codes ( #342 )
2019-03-29 15:50:55 -07:00
Drew Bell
b6c82f2b16
doc: fix extend typo in README ( #340 )
2019-03-26 15:23:50 -07:00
Toufic Mouallem
144a797564
feat: Support passing custom headers in requests ( #337 )
2019-03-26 13:48:41 +02:00
Toufic Mouallem
3ed778b53e
fix: Adapt CNBC extractor to article redesign ( #336 )
2019-03-25 15:43:40 -07:00
Toufic Mouallem
da9606a4cb
docs: Add parsing custom HTML to README.md ( #326 )
2019-03-25 15:40:51 -07:00
Drew Bell
b3e2a0ffd1
feat: extract custom types with extend option ( #313 )
...
* feat: extract custom types with extend option
Adds an `extend` option that lets you add custom types to be extracted
and returned alongside the defaults, either in a call to `parse()` or in
a custom extractor.
```
Mercury.parse(
url,
extend: {
last_edited: { selectors: ['#last-edited'], defaultCleaner: false }
}
)
```
* chore: use Reflect.ownKeys
* feat: add CLI options
* doc: add extend param to cli help
* refactor: extract selectExtendedTypes
* feat: only overwrite null extended results
* feat: add allowMultiple extraction option
* feat: accept extendList CLI args
* feat: allow attribute selectors in extends on CLI
* test: update extend tests
* fix: don't invoke cleaner for custom types
* feat: always return array if allowMultiple
* test: add test for array of single result
* refactor: extract extractHtml
* refactor: destructure allowMultiple
* fix: wrap multiple matches in $ for cheerio shim
* fix: find extended types before any other munging
* feat: absolutize all links
* fix: clean content more directly
* doc: Update CLI docs in README
* chore: update dist
* doc: Document extend in custom extractor README
2019-03-25 15:36:20 -07:00
Toufic Mouallem
136d6df798
feat: Return specific errors on failed parse attempts
2019-03-20 11:23:54 +02:00
Toufic Mouallem
a250f403f5
fix: Preserve whitespace in certain HTML elements ( #333 )
2019-03-19 09:43:29 -07:00
Adam Pash
2a3ade706d
fix: run parser preview
2019-03-15 10:15:50 -07:00
Ben Ubois
a7e4c67d1d
Extract content from GitHub repos. ( #306 )
...
* Extract content from GitHub repos.
* Add published and dek.
* Timezone fix.
2019-03-14 08:48:33 -07:00
Matthew Watkins
6e66887048
docs: add content formats to README.md ( #318 )
2019-03-12 08:37:38 -07:00
Toufic Mouallem
0940971069
fix: better handling for responsive images ( #312 )
2019-03-08 15:47:17 -08:00
Drew Bell
785a22245f
feat: switch from forked request to postman-request ( #319 )
2019-03-08 14:46:45 -08:00
Toufic Mouallem
7844129fda
feat: Add custom parser for Reddit ( #307 )
2019-03-08 14:37:24 -08:00
Drew Bell
13581cd899
feat: upgrade watchify to remove vulnerable hoek dep ( #320 )
2019-03-08 14:34:33 -08:00
Drew Bell
91fb0dfb46
fix: update parse signature in tests ( #315 )
2019-03-07 11:30:00 -08:00
Adam Pash
ffb25f34d7
docs: add usage gif ( #308 )
2019-03-05 11:37:56 -08:00
Toufic Mouallem
9714cb70c5
feat: Use Deadspin parser for all Kinja websites ( #304 )
2019-03-04 14:47:09 -08:00
Jordan Hotmann
83d1c2401b
feat: add custom extractor for blisterreview.com ( #299 )
2019-03-01 16:48:26 -08:00
kik0220
d9a1e7b22b
feat: add news.mynavi.jp custom parser ( #287 )
2019-03-01 16:45:32 -08:00
Olli Sulopuisto
44a7ec791d
docs: typofix ( #300 )
2019-02-28 22:38:15 -08:00
Adam Pash
0a15a37f04
fix: ci artifact paths ( #301 )
2019-02-28 11:27:06 -08:00
Adam Pash
9698d9a0c4
dx: comment on custom parser pr fix ( #278 )
...
* dx: comment on custom parser pr fix
* fix path
* write json
* chore: rename comment script
2019-02-28 11:11:03 -08:00
Ben Ubois
ed14203e97
fix: return early if creating the resource failed. ( #285 )
2019-02-20 16:48:51 -08:00
greenkeeper[bot]
52dfdda553
Update mocha to the latest version 🚀 ( #282 )
...
* chore(package): update mocha to version 6.0.0
* chore(package): update lockfile yarn.lock
2019-02-19 13:31:40 -08:00
Adam Pash
b044cfa958
release: 2.0.0 ( #275 )
2019-02-13 15:46:45 -08:00
Adam Pash
2afd8c9fa8
fix: jquery doesn't like the case insensitive selector ( #274 )
2019-02-13 15:41:47 -08:00
Adam Pash
9bf88b0ba3
chore: refactor format output adjustments ( #272 )
...
I had previously done this in an overly complicated manner. This PR cleans
it up a bit.
2019-02-13 13:30:49 -08:00
David Brownman
867623ab33
chore: add files to package.json ( #269 )
2019-02-12 16:59:02 -08:00
Adam Pash
ab56ce0de3
fix: custom parser generator ( #271 )
...
- swap fs import
- fix rollup config
2019-02-12 16:14:47 -08:00
Ben Ubois
0e27448866
feat: Various Character Encoding Improvements ( #270 )
...
* Support HTML5 charset tag
In HTML5 `<meta charset="">` is shorthand for `<meta http-equiv="content-type" content="">`
https://developer.mozilla.org/en-US/docs/Web/HTML/Element/meta
* Handle more character encoding declaration methods.
2019-02-12 15:15:19 -08:00
Madison Kanna
b3fa18b6d9
docs: delete extra semicolon ( #266 )
2019-02-11 15:44:00 -08:00
Adam Pash
e033835c72
fix: parse signature in cli ( #259 )
2019-02-07 17:03:42 -08:00
Adam Pash
32748ad4c5
dx: add .prettierignore ( #258 )
2019-02-07 16:59:43 -08:00
Adam Pash
2d0f10a888
dx: add .prettierignore ( #257 )
2019-02-07 16:50:45 -08:00
Adam Pash
9b0664bc91
feat: add content format output options ( #256 )
2019-02-07 16:48:13 -08:00
Adam Pash
a57f29eec3
release: 1.1.1 ( #254 )
...
see [changelog](./CHANGELOG.md) for changes.
2019-02-07 10:38:39 -08:00
George Haddad
b15948f3f4
chore: remove all-contributors-cli deps and script since no longer used ( #253 )
2019-02-07 08:19:12 -08:00
Adam Pash
02476f4336
docs: add instructions for cli to README ( #251 )
2019-02-06 09:46:13 -08:00
Adam Pash
b77a236dbe
feat: handle cli errors/timeout ( #250 )
2019-02-06 09:34:22 -08:00