mirror of
https://github.com/postlight/mercury-parser
synced 2024-11-17 03:25:31 +00:00
chore: small doc fixes
This commit is contained in:
parent
d786a7ae0c
commit
e3ee5e93bf
@ -55,11 +55,10 @@ The text you want isn't the text inside a matching element, but rather, inside t
|
||||
export const ExampleExtractor = {
|
||||
...
|
||||
|
||||
// This example returns the datetime attribute if it exists; if not, it falls back to the text of time.article-timestamp
|
||||
// This example returns the datetime attribute if it exists
|
||||
date_published: {
|
||||
selectors: [
|
||||
['time.article-timestamp[datetime]', 'datetime'],
|
||||
'time.article-timestamp',
|
||||
],
|
||||
},
|
||||
|
||||
@ -148,13 +147,19 @@ Now that you know the basics of how custom extractors work, let's walk through t
|
||||
First, you'll need to clone the Mercury Parser repository and install dependencies.
|
||||
|
||||
```bash
|
||||
git clone https://github.com/postlight/mercury-parser.git
|
||||
git clone git@github.com:postlight/readability-parser.git
|
||||
|
||||
cd mercury-parser
|
||||
cd readibilty-parser
|
||||
|
||||
npm install
|
||||
```
|
||||
|
||||
If you don't have already have watchman installed, you'll also need to install that:
|
||||
|
||||
```bash
|
||||
brew install watchman
|
||||
```
|
||||
|
||||
You should also create a new git branch for your custom extractor:
|
||||
|
||||
```bash
|
||||
|
Loading…
Reference in New Issue
Block a user