Commit Graph

380 Commits (fix-remove-moment-js)

Author SHA1 Message Date
Jordan Hotmann 83d1c2401b feat: add custom extractor for blisterreview.com (#299) 5 years ago
kik0220 d9a1e7b22b feat: add news.mynavi.jp custom parser (#287) 5 years ago
Olli Sulopuisto 44a7ec791d docs: typofix (#300) 5 years ago
Ben Ubois ed14203e97 fix: return early if creating the resource failed. (#285) 5 years ago
Adam Pash 2afd8c9fa8
fix: jquery doesn't like the case insensitive selector (#274) 5 years ago
Adam Pash 9bf88b0ba3
chore: refactor format output adjustments (#272)
I had previously done this in an overly complicated manner. This PR cleans
it up a bit.
5 years ago
Ben Ubois 0e27448866 feat: Various Character Encoding Improvements (#270)
* Support HTML5 charset tag

In HTML5 `<meta charset="">` is shorthand for `<meta http-equiv="content-type" content="">`
https://developer.mozilla.org/en-US/docs/Web/HTML/Element/meta

* Handle more character encoding declaration methods.
5 years ago
Adam Pash 9b0664bc91
feat: add content format output options (#256) 5 years ago
Adam Pash c6f42c1278
docs: cleanup and update docs (#238) 5 years ago
George Haddad 5c0325f5a7
feat: hook up ci to publish to npm (#226)
* chore: add missing fields to  package.json

* feat: add postlight org scope to package name

* feat: automate npm publish

* test: npm publish without filters

* fix: add docker image

* test: change directory

* test: add working directory

* fix: defaults syntax

* test: add workspace

* fix: attach workspace

* fix: use standard mercury email

* fix: use ISO time format and preserve original timezone offset

* fix: do not match time zone offset

* chore: move babel runtime-corejs2 to prod deps

* chore: uncomment config to deploy on git tag

* feat: publish to npm public

* adding browser-request

It doesn't seem to impact the build, but technically it should be there
so for good measure, why not...

* chore: roll version back to original state
5 years ago
Adam Pash 663cc45bf4
fresh run of prettier; remove NOTES.md (#233) 5 years ago
Wajeeh Zantout 1ccd14e1e9 feat: add fortinet custom parser (#188)
* feat: add fortinet custom parser

* fix: eslint error

* fix: transform noscript images

* feat: add fortinet custom parser

* fix: eslint error

* fix: transform noscript images

* fix: transform method

* test: transform method

* fix: fs import
5 years ago
Wajeeh Zantout 9b36003b62 feat: add fastcompany custom parser (#191)
* feat: add fastcompany custom parser

* fix: eslint error

* fix: test for date_published

* feat: add fastcompany custom parser

* fix: eslint error

* fix: test for date_published

* fix: fs import
5 years ago
Toufic Mouallem bb6ad2682b fix: Transform relative URLs in srcset attributes to absolute URLs (#190) 5 years ago
Jad Termsani 15a5229998 fix: womansay.net image urls (#196) 5 years ago
Adam Pash 0e22947e2c
fix: non-forked packages breaking web build (#225) 5 years ago
Ralph Jbeily f3f6e21fd8 fix: author and date published selectors (#189) 5 years ago
Jad Termsani 28cf41304c
fix: timezone comparison (#222)
* fix: use format() instead of toISOString()

* fix: timezone comparison
5 years ago
Ralph Jbeily ca44ce3dd1
docs: add install build and test guide (#215)
* docs: add install build and test guide

* docs: remove install build and test guides

* docs: add installation guide
5 years ago
Ralph Jbeily 2e1e4d90c9
feat: add remarklint for md docs (#213)
* feat: add remarklint for md docs

* fix: remarkrc file and run linter on commit hook
5 years ago
Adam Pash 76d333f0be
deps: upgrade (#218) 5 years ago
George Haddad 56badb51f5
dx: remove unnec comments in source (#205)
* dx: remove commented code and obvious comments that can be looked up

* dx: remove commented out eslint options

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove test block as all its code was commented out

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove regex example comments

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out import

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* dx: remove commented out code

* chore: remove empty files

* chore: re-prettier code that may have missed it

* added back nec comments
5 years ago
Adam Pash e4b057f9ea
chore: update node and some deps (#209)
* chore: update .nvmrc

* added prettier and pre-commit hooks

* update docker image to new node

* add karma-cli to get web tests working

* explictly install karma... seems to fix problem

* remove pre-built phantomjs

* swap install order
5 years ago
Adam Pash 96640e3564
fix: failing fetchResource test (#187)
I think was a fixture problem
6 years ago
Adam Pash d850177b68
docs: Update README.md (#184) 6 years ago
Adam Pash 5663660f76
fix: nytimes custom parser title selector (#181)
* fix: nytimes custom parser title selector

* upgrade node version

* circle ci tweak
6 years ago
Jeremy Mack 5fcea1c5c3 fix: PARSING_NODE undefined (#172)
* fix: PARSING_NODE undefined

* chore: remove unused cleanup function/call
7 years ago
Jeremy Mack e92e798880 fix: viewport tags leaking to parent page (#170)
* fix: scrub meta viewport tags

They leak to the parent page when using the web version of Mercury
Parser.

* chore: build

* fix: keep DOM in memory to avoid conflicts
7 years ago
Adam Pash b8aa87c777 feat: improve wh parser (#168) 7 years ago
Adam Pash 61f0f4e1af fix: kept elements being removed (#166)
Elements marked to keep were removeable under specific circumstances.
This PR fixes these edge cases.
7 years ago
Adam Pash 453419de72 feat: improve wh.gov parser (#163)
* feat: support youtube-nocookie domain

* feat: updated wh.gov parser to support speeches
7 years ago
Janet f13bb721f6 feat: prospect magazine parser (#147)
* feat: prospect magazine parser

Couldn’t find a way to parse the date but I think it’s good otherwise.

* fix: pulls date

* fix: add timezone

* fix: generalize
7 years ago
Kevin Ngao 1b28713cf5 feat: fool.com parser (#158)
* feat: add fool.com custom parser
7 years ago
Janet c18959779d feat: forward.com parser (#144)
* feat: forward.com parser

LGTM although image didn’t show up in preview

* feat: also pull imge into content

* fix: generalize selectors

* fix: generalize selector
7 years ago
Janet 50e548bac2 feat: qdaily parser (#146)
* feat: qdaily parser

Firstly — I accidentally tried to generate the parser on the master
branch, and I’m not sure where it is, maybe floating in the nether
world.

On to the parser — this one was a bit tricky because things were in
Chinese! The content appears to be parsing (as seen in preview) but
it’s not passing the test. I noticed the second “ ‘ “ mark isn’t
appearing on the parser side.

Additionally, some of the lazy loading images aren’t appearing in the
preview (I cleaned the wrong lazy load images that appeared), so
someone will probably have to work on that (I don’t know how to do
transforms yet).

* fix tests

* fix: selector generalization
7 years ago
Silas Burton 51a4d1d12f feat: newrepublic parser shows image on page (#159) 7 years ago
Silas Burton 11382ce651 Feat: Slate extractor (#153)
* feat: slate extractor

* fix: generalize selectors

* fix: add Slate timezone
7 years ago
Silas Burton 5acaa6ab56 feat: ici.radio-canada.ca extractor (#156)
* feat: ici.radio-canada.ca extractor

* fix: add timezone
7 years ago
Silas Burton 4509b341e6 feat: better cleanup of atlantic articles (#157) 7 years ago
Kevin Ngao f2e3f055c2 Fixes an issue with encoding (#154)
* fix: fixes an issue with encoding on the fetch level
7 years ago
Silas Burton 9b371e51ac Feat: gothamist extractor (#151)
* feat: gothamist extractor

* feat: add other gothamist network sites

* fix: try getting date another way

* fix: add gothamist timezone

* fix: generalize selectors

* fix: h1 is inside entry-header, needs to be specific because of another h1 on the page

* fix: general and specific selector
7 years ago
Kevin Ngao afbef9bc39 Fix Encoding on Body (#143)
* fix: check encoding on body
7 years ago
Janet 93d2baf5cf feat: news.natgeo parser (#88)
* feat: natgeo parser

For some reason, the local copy of the article didn’t grab the author
name in it, so I couldn’t figure out how to parse it. The generic
parser took a name of an author of a paper mentioned in the article,
and thought that was the author name, which was funny.

I cleaned a large block quote that didn’t make sense as it was shown in
the preview, although I noticed that the Mercury chrome extension
didn’t even display it.

* fix: add date_published transform

* fix: date_published assertion

* disable: author assertion, generlize author selector

* rm: author assertion

* fix: image lead

* fix: guard agaist missing img url

* fix: generalize dek and title selectors
7 years ago
Janet 2279c2d486 feat: natgeo parser (#89)
* feat: natgeo parser

Same as the news.nationalgeographic.com parser - for some reason the
author name doesn’t appear to be getting pulled into the local copy of
the file.

* fix: content assertion

* fix: generalize author byline

* disable: author assertion

* rm: author assertion

* fix: image lead, handles image-group

* fix: guard agaist missing img url

* fix: generalize dek and title selectors
7 years ago
Adam Pash 08b5bb7ff1 feat: allow parser to define custom date formats (#141)
* feat: allow parser to define custom date formats

* feat: updating macrumors to test/verify format working correctly
7 years ago
Janet 11f466ccb3 feat: latimes parser (#92)
* feat: latimes parser
7 years ago
Kevin Ngao 26a8e4f75a feat: macrumors parser (#120)
* feat: add macrumors
7 years ago
Kevin Ngao b4fec6af98 feat: androidcentral parser (#119)
* feat: androidcentral parser
7 years ago
Janet beb0b89a4f feat: pagesix parser (#97)
* feat: pagesix parser
7 years ago
Janet f2160eb5b6 feat: si parser (#118)
* feat: si parser
7 years ago
Janet 2af0f6179a feat: rawstory parser (#109)
* feat: rawstory parser

Finished, with a little help from Frankie (thanks Frankie!)

* fix: date_published timezone
7 years ago
Janet 765032452d feat: thefederalistpapers parser (#101)
* feat: thefederalistpapers parser
7 years ago
Janet fb5eb2e104 feat: cnet parser (#104)
* feat: cnet parser

Date test fail - please take a look!

Also, image didn’t load in preview.

* fix: timezone

* fix: image lead
7 years ago
Janet 3c5fa28f10 feat: cbs sports parser (#98)
* feat: cbs sports parser
7 years ago
Janet 3cf2d0d3ef feat: msnbc parser (#100)
* feat: msnbc parser
7 years ago
Janet f9ab9eb885 feat: howtogeek extractor (#108)
* feat: howtogeek extractor

This one is a bit tricky - the author and date info appear in a comment
section at the bottom. Was able to parse the author but not the date
info. Halp

* howtogeek update

Thanks to @fdsimms I was able to parse the date, but not sure what to
test it against, so I left it blank.

* fix: date_published assertion, it was comparing against empty string

* fix: timezone

* amend: generalize author selector
7 years ago
Janet 258acdfd02 feat: opposing views parser (#103)
* feat: opposing views parser
7 years ago
Janet b63dd33579 feat: today parser (#106)
* feat: today parser

This looks fine — there are a couple of lines of “Related” but they are
within the body (and don’t have their own classes) so I couldn’t clean
them out.

* fix: fix content assertion
7 years ago
Janet c94eee7f92 feat: cinema blend parser (#105)
* feat: cinema blend parser

all systems go

* fix: timezone
7 years ago
Janet 64e3c205e8 feat: the political insider parser (#99)
* feat: the political insider parser with timezone
7 years ago
Janet 7b52d3d1fc feat: al.com parser (#110)
* feat: al.com parser

I think this is good but could you pls double check time zone on the
date? Thanks

* fix: date_published timezone
7 years ago
Janet 15df58496f feat: westernjournalism parser (#113)
* feat: westernjournalism parser

Adjacent sibling selector FTW!

Image not displaying in preview.

* feat: fix assertion, body does not include _Advertisement_ subtext
7 years ago
Janet ae12a1d701 feat: mental floss parser (#94)
* feat: mental floss parser
7 years ago
Janet bf29291395 feat: thepennyhoarder parser (#112)
* feat: thepennyhoarder parser

Looks good, although no image in preview!

* fix: adds selector for article lead image
7 years ago
Janet fadd198d04 feat: abcnewsgo parser (#90)
* feat: abcnewsgo parser
7 years ago
Adam Pash 25d9642ff9 feat: support cleaning and transforms for all fields (#138) 7 years ago
Janet 1054d854dd feat: america now parser (#114)
* feat: america now parser

Looks good but lead image did not display in preview.

* feat: adds selector for lead image
7 years ago
dviramontes 93c8ba0e56 feat: adds selector for lead image 7 years ago
dviramontes f71fe7685d feat: adds video embed transform 7 years ago
dviramontes a77515d861 fix: author selector, less brittle 7 years ago
Janet 4c48acba59 feat: fusion parser
Looks okay — image did not load in preview.
7 years ago
David A. Viramontes c679e493de Merge branch 'master' into feat-the-verge-polygon-supported-domain 7 years ago
Janet d292d8ef3a feat: ny daily news parser (#87)
* feat: ny daily news parser
7 years ago
dviramontes a53587acef feat: adds www.polygon.com to list of www.theverge.com supportedDomains 7 years ago
Janet 385b9d76a3 feat: sciencefly extractor (#116)
* feat: sciencefly extractor, use loading image rather than 404'ing meta
7 years ago
Adam Pash 6bd6278a07 feat: custom parser for wh blog (#130) 7 years ago
Adam Pash aa682d71e8 fix: medium bug (#129)
* fix: improved medium parser for images and multi-section content

* fix: duplicate video
7 years ago
Adam Pash 31eb4f9222 Feat: LinkedIn parser (#123)
* feat: rebuild custom parser

* feat: linkedin custom parser
7 years ago
Adam Pash 8662474d8a feat: changed user agent to latest chrome (#121)
* feat: changed user agent to latest chrome

* removed dead link
7 years ago
Janet 7709d69379 feat: npr parser (#86)
* feat: npr parser

Lead image appears in preview, but the test fails for some reason.

AssertionError: null ==
'https://media.npr.org/assets/img/2016/12/15/gettyimages-540681598_wide-
8b160732b96c083dc115134c3c019f3ac73586ba.jpg?s=1400'

Looks okay otherwise.

* feat: transformed figures/figcaptions, improved date_published and
addressed NPR's bad image metadata
7 years ago
Janet 8a82f2c0ab feat: recode parser (#85)
* feat: recode parser

Thumbs up, as far as I can tell.

Note: No image appeared in the preview.

* feat: pulling in lead image
7 years ago
Janet ad29acd7b7 feat: fortune parser (#84)
* feat: fortune parser

For some reason, the dek doesn’t appear in the local version of the
article I selected. I tried parsing the meta tag containing
og:description but it’s not working, and the description is slightly
longer than the dek in the original article.

I’m not sure why, but for the lead image, the meta tag for og:image is
not parsing the image url.

:(

* feat: fortune redesigned, so re-did extractor

* fix: added timezone
7 years ago
Janet c133ddf614 feat: qz parser (#81)
* feat: qz parser

I couldn’t figure out how to parse the date, but otherwise should be
fine. I added a clean for the div.article-aside element based on what I
saw in how the chrome extension worked.

* feat: updated content to grab top image

test: date is null :/
7 years ago
Janet 84312b6ef1 feat: dmagazine parser (#80)
* feat: dmagazine parser

I’m sorry to have failed you. :-( These are the issues I encountered:

1) author - does not have a unique selector to distinguish it from the
date, couldn’t parse it
2) date - no meta data in the head
3) no meta og:image in the head (my go to), so I couldn’t get the image
test to pass, but it appears to be parsing. The caption below it is the
same size as the body copy in the preview. I couldn’t figure out how to
“transform” it to caption size.

* feat: update date, image, and author selectors and corresponding tests

* feat: generalized content selector
7 years ago
Janet e035f36361 feat: reuters parser (#78)
* feat: reuters parser

Date parses correctly but fails test because of format discrepancy.

Author tags are nested within the content, which is why the author
names are appearing twice. I wasn’t sure how to address this.

Additionally, the location appears twice, so I cleaned the location
tags from the content.

* test: fix date format

* transform .article-subtitle to h4; cleaning author but leaving location
7 years ago
Janet dec49ab073 feat: mashable parser (#76)
* feat: mashable parser

As usual the date is giving me issues because of formatting
discrepancies:
AssertionError: '2016-12-13T22:33:06.000Z' == '2016-12-14T03:33:06.000Z'

Not sure how we wanna deal with Twitter card embeds that don’t show up?

Also, image credits did not show up in preview.

* test: fixed date format

* transforming .image-credit to figcaption
7 years ago
Janet cddc1afb69 feat: chicago tribune parser (#75)
* feat: chicago tribune parser

Date is parsing but failing the test because:
AssertionError: '2016-12-13T21:45:00.000Z' == '2016-12-13T13:45:00-0800'

I tried to insert a line of code for Time Zone but I’m a n00b so I
don’t think I did it right.

No image showing up in the preview.

* fix: remove timezone from date_published extractor

* test: update unit tests to assert the correct value for date_published
7 years ago
Janet aff651c2d8 feat: hellogiggles parser (#107)
Looks good to me!
7 years ago
Janet 11ad7b9a92 feat: thought catalog parser (#102)
Looks good!
7 years ago
Janet aa43a6091c feat: cnbc parser (#96)
Should be good to go!
7 years ago
Janet cd245f7980 feat: popsugar parser (#93)
I think this one is good to go!
7 years ago
Janet a8ab7135e1 feat: observer parser (#91)
no problems
7 years ago
Janet 3bee7224cb feat: nbc news parser (#74) 7 years ago
Janet 88242dd233 feat: nj.com parser (#73) 7 years ago
Janet 1ac5670a54 feat: inquisitor parser (#72) 7 years ago
Janet 9e5b91ed8b feat: refinery29 parser (#71) 8 years ago
Janet b78c58c43a feat: miami herald parser (#69) 8 years ago
Janet aedf83edc6 feat: eonline parser (#68) 8 years ago
Janet a20da5eb31 uproxx extractor (#66) 8 years ago
Janet 87c42b6358 feat: 247sports.com extractor (#64) 8 years ago
Janet 22e6c884fb feat: rolling stone extractor (#65) 8 years ago
Janet 6337231697 feat: usmagazine extractor (#63) 8 years ago
Janet c06b19efe7 feat: people extractor (#70)
No major problems!
8 years ago
Janet 3cf2bb78c4 feat: vox custom parser (#67) 8 years ago
Janet 861c5f0dcb feat: bustle extractor (#60) 8 years ago
Adam Pash 06397a4360 feat: browser-friendly selector for medium (#61) 8 years ago
Adam Pash 3297ab079d feat: bloomberg extractor (#59)
Bloomberg has several templates. I'm supporting three different templates here, but I'm not sure that this is complete by any means.

It's also worth noting that SVGs don't make it through the parser terribly well for many reasons. One, for example, is that a lot of SVGs require custom CSS in order for them to make sense. I'm not sure this is something we can expect to address in the parser.
8 years ago
Janet e55e9da534 feat: sbnation extractor (#55) 8 years ago
Adam Pash 8070e4790b test: streamlined guardian tests w/new single-extraction (#58) 8 years ago
Adam Pash bdb751fb53 feat: more cleaning for wired (#56) 8 years ago
Janet e7e41bd242 feat: the guardian custom extractor (#41) 8 years ago
Adam Pash 81aa89f2c1 feat: youtube custom extractor (#53) 8 years ago
Adam Pash 2fb47640f2 Feat: detect platforms (#52)
Detectors for matching extractors for publishing platforms. Currently supporting Medium and Blogger.
8 years ago
Adam Pash 64c0fad2fd fix: preserve whitespace (#51)
No longer normalizing whitespace in html
8 years ago
Adam Pash 15656cb3e1 Refactor: running tests more efficiently (#49)
Only running one parser per page we're testing rather than a parser per field we're testing.
8 years ago
Adam Pash f9902cfa05 Fix: extension bugs (#47)
* feat: lead image on atlantic stories now included

* feat: supporting buzzfeed "longform" template

* feat: cleaning .parter-box from the atlantic
8 years ago
Adam Pash 16860f1d85 feat: improved nyt parser (#46)
NYT was one of the first, and its test was stale and it didn't have all
of its fields well defined.
8 years ago
Adam Pash d0453efbf8 feat: improvements for nyer magazine articles (#45)
adds dek and date_published for magazine template
8 years ago
Adam Pash 00f8965c1f fix: cleaning up deks (#44)
We've solidified what we consider a dek. This PR removes the dek selectors that do not fit that mold.
8 years ago
Janet b415d1d37c feat: aol custom extractor (#42)
* feat: aol custom parser

* removed work from other commits. merged with latest master
8 years ago
Matt 4cc3b68b5e feat: remove footer links (#40)
the links at the bottom of the stories feel a little spammy because of how we treat links vs. the way they are displayed on the Times, would like to clean them
8 years ago
Adam Pash ff1963bdca feat: new cleaner for wapo (#38) 8 years ago
Adam Pash 0e6ccdf622 fix: browser cleanup (#35)
Cleaning up after the parser when it's done in the browser, before
returning result.
8 years ago
Silas Burton c3d98a0d76 Feat cnn extractor (#34)
* wip: cnn custom extactor

* wip: cnn works except first paragraph

* final touches on cnn parser

* cleanup
8 years ago
Silas Burton a0570f8e94 feat: extractor for the verge (#33)
* feat: extractor for the verge's standard article template

* feat: basic support for the verge feature template

* feat: allow multiple links to be previewed

* feat: content selector arrays

Content selector arrays allow custom parsers to select multiple elements
to match and include in the result.

* feat: updated verge parser to use multimatch selectors

* lint fix

* cleanup test builds
8 years ago
Adam Pash 233ca11a33 fix: added timezone to new republic date (#32) 8 years ago
Adam Pash cfe7f34be4 fix: normalizing spaces for authors/dek/title (#31)
* fix: normalizing spaces for authors/dek/title
8 years ago
Adam Pash 9a23b24a89 feat: adjustment for huffpo. skipping overly aggressive default cleaners (#30) 8 years ago
Silas Burton be2e4b5c80 Feat: huffington post extractor (#28)
* wip: huffpo custom extractor

* wip: some huffpo cleanup
8 years ago
Adam Pash 94198c0a65 feat: new republic custom extractor (#25)
* wip: new republic custom extractor

* feat: new republic article extractor

* feat: new republic minutes article extractor
8 years ago
Janet c4d72fb735 feat: add money.cnn custom parser (#26)
* feat: add money.cnn custom parser

* added timezone to cnn custom parser
8 years ago
Adam Pash 6343946dd8 Feat: custom timezones (#29)
* using moment-timezone to allow custom timezones

* added tz to tmz, even though still so-so
8 years ago
Adam Pash a8face796a Fix extension bugs (#23)
* feat: cleaning supplemental elements in nytimes (visible in web only)

closes https://github.com/postlight/mercury-reader-chrome-extension/issues/102

* wip

* fix: more generous date published bits

* feat: added washington post extractor (including figure transforms)

closes https://github.com/postlight/mercury-reader-chrome-extension/issues/100

* feat: cleaning zoom lightbox from gizmodo/kinja

* lint fix
8 years ago
Adam Pash 3a2f32b0eb feat: added tmz custom parser (#22) 8 years ago
Adam Pash 783a9cfb2f fix: changed overly liberal regex for removing transparent images 8 years ago
Adam Pash 7411922c55 feat: encoding response body based on content-type charset (#21)
Also some small code organization
8 years ago
Adam Pash c30fb2e4c0 chore: updated readme 8 years ago
Adam Pash 60a6861e18 Feat: browser support (#19)
Big undertaking to support Mercury in the browser. Builds are working and all tests are passing both for web and node builds. Most code is closely shared.
8 years ago
Adam Pash eaea57461a fix: servers returning bad headers was breaking request. temporarily (#20)
using fork with a fix for this until request merges the necessary pull request
8 years ago
Adam Pash 629eada1f7 feat: recording/playing back network requests with nock (#18)
* feat: recording/playing back network requests with nock

* lint fix
8 years ago
Adam Pash e325d860fd Feat: improving ci (#16)
This commit also swaps in yarn for npm and tweaks circle ci a bit.

* appveyor.yml first go

* changing node

* ps

* narrow it down

* trying this

* fix airbnb module

* trying with yarn

* logging

* hybrid?

* trying yarn w/circle

* bump workers?

* build off?

* updating script

* tweaking script for appveyor

* bumping maxworkers

* cleaning up

* build step?

* yarn it

* added appveyor badge
8 years ago
Adam Pash 048d654417 feat: parser auto-generates name; lint is more specific 8 years ago
Adam Pash 65c641a879 feat: enforcing line break rules in linter 8 years ago
Adam Pash 4d1d950807 updated generator templates for new style of import/export. also some
adjustments for usability
8 years ago
Adam Pash 7fa90f59b7 making all.js export a generic function to decrease possiblity of error 8 years ago
Adam Pash de5b120b79 feat: allowing extractors to support multiple domains 8 years ago
Adam Pash d038a36544 feat: custom medium extractor 8 years ago
Adam Pash 007ddec8ac feat: allowing iframes from src domain 8 years ago
Adam Pash b65b0c98b0 feat: supporting all GMG sites using DeadspinExtractor 8 years ago
Adam Pash 17317823de fix: bug that stopped proper attr cleaning in certain cases 8 years ago