Nicolas Perriault
|
670322a4da
|
Fixes #198 - Avoid stripping linked list items.
|
9 years ago |
Gijs
|
a801846a45
|
Merge pull request #204 from mozilla/tweak-great-grandparent-scoring
Updated great grandparent node scoring.
|
9 years ago |
Nicolas Perriault
|
ae0833522c
|
Improved embedded video elements detection.
|
9 years ago |
Nicolas Perriault
|
46304bb5fe
|
Updated great grandparent node scoring.
|
9 years ago |
Nicolas Perriault
|
88ef3893b5
|
Fixes #180 - Score intermediary headings.
|
9 years ago |
Nicolas Perriault
|
dc1b2c9fa0
|
Refs #195 - Exclude nodes likely to be related content.
|
9 years ago |
Nicolas Perriault
|
cc18cb5787
|
Ref #195 - Add support for dailymotion videos.
|
9 years ago |
Nicolas Perriault
|
9dbc009376
|
Fixes #113 - Recursive node ancestor scoring.
|
9 years ago |
Nicolas Perriault
|
44879722b6
|
Fixes #183 - Preserve list items.
|
9 years ago |
Gijs
|
79aa2fca87
|
Merge pull request #189 from mozilla/dont-remove-headings
Fixes #150 - Keep article intermediary headings.
|
9 years ago |
Margaret Leibovic
|
af6da2a87d
|
Merge pull request #190 from mozilla/improved-author-meta-extraction
Improved author metadata detection.
|
9 years ago |
Nicolas Perriault
|
7aee44adb2
|
Improved author metadata detection.
|
9 years ago |
Gijs Kruitbosch
|
5f184053cd
|
Make isProbablyReaderable include <pre>, and deal with long <br>-separated paragraphs and/or shorter-than-5-paragraph text and such.
|
9 years ago |
Nicolas Perriault
|
2451a07a7d
|
Fixes #150 - Keep article intermediary headings.
|
9 years ago |
Margaret Leibovic
|
319a50b4f0
|
Fixes #184 - Don't strip class names from article content
|
9 years ago |
Gijs
|
49e40768aa
|
Merge pull request #185 from mozilla/score-section-tags-by-default
Fixes #139 #143: Added more weight to section tags.
|
9 years ago |
Nicolas Perriault
|
f6ffa6acde
|
Fixes #139 #143: Added more weight to section tags.
|
9 years ago |
Nicolas Perriault
|
58cd789cd3
|
Improved title extraction 'algorithm'.
|
9 years ago |
Gijs
|
b37ff08bc7
|
Merge pull request #169 from mozilla/clean-footer-tags
Fixes #163 - Avoid including footer tag contents.
|
9 years ago |
Nicolas Perriault
|
12c6a11f67
|
Fixes #163 - Avoid including footer tag contents.
|
9 years ago |
Nicolas Perriault
|
6eeabf90c1
|
Fixes #164 - Add support for title alt semantic metadata.
|
9 years ago |
Gijs Kruitbosch
|
0ff82de0f4
|
Implement createTextNode, do more relaxed escaping there, update testcase.
|
9 years ago |
Margaret Leibovic
|
37a8cd4171
|
Bug 1147584 - Don't remove unlikely <a> tags, and replace <a> tags with their text content if they won't be useful links
|
9 years ago |
Gijs
|
a6014f5854
|
Merge pull request #132 from gijsk/heise-ad-prioritization
Don't look at banners and skyscrapers, remove <noscript> elements
|
9 years ago |
Gijs Kruitbosch
|
a6346a0ad4
|
Don't look at banners and skyscrapers, remove <noscript> elements
|
9 years ago |
Nicolas Perriault
|
4424b0bad7
|
Refs #128 - Add support for options to Readability constructor. r=@gijsk
|
9 years ago |
Nicolas Perriault
|
4d41f5e4ed
|
Refs #117 - Drop social/share buttons.
|
9 years ago |
Gijs Kruitbosch
|
7c60dba3b6
|
Fix Readability.js to work with jsdom's DOM implementation (in particular: no firstElementChild implementation...)
|
9 years ago |
Margaret Leibovic
|
eb3a8e8dc4
|
Bug 1150695 - Move isProbablyReaderable function to Readability.js
|
9 years ago |
Nicolas Perriault
|
f8d37e4276
|
Don't remove elements containing figures or having them as a parent.
|
9 years ago |
Nicolas Perriault
|
b6730703a1
|
Fixes #81 - Keep article images.
|
9 years ago |
Gijs
|
194a5376c8
|
Merge pull request #63 from mozilla/preserve-embedded-tweets
Preserve inline tweets as they're part of article contents.
|
9 years ago |
Gijs Kruitbosch
|
b4332328f3
|
Fix an issue where we don't track scores for the parents appropriately.
|
9 years ago |
Gijs
|
14b33b69db
|
Merge pull request #65 from mozilla/support-embed-videos
Fixes #56 - Updated support for embedded Youtube & Vimeo videos.
|
9 years ago |
Nicolas Perriault
|
ad52d8ee30
|
Fixes #53 - Fixed dot-slash relative URI resolution.
|
9 years ago |
Nicolas Perriault
|
2d5f59f3eb
|
Fixes #56 - Updated support for embedded Youtube & Vimeo videos.
|
9 years ago |
Nicolas Perriault
|
d83763c8a1
|
Preserve inline tweets as they're part of article contents.
|
9 years ago |
Nicolas Perriault
|
cf3dce6cf2
|
Refs #58 - Stripped embed tags.
|
9 years ago |
Nicolas Perriault
|
eee224560b
|
Addressed review comments from @Gijsk.
|
9 years ago |
Nicolas Perriault
|
4f9615cb9a
|
Use forEach when it makes sense.
|
9 years ago |
Gijs Kruitbosch
|
955951659d
|
Bug 1143725 - fix the Herald Sun website
|
9 years ago |
Gijs Kruitbosch
|
eb81444946
|
Improve logic to rely on children instead of childNodes
|
9 years ago |
Margaret Leibovic
|
3c2d93cd09
|
Improve byline algorithm
|
9 years ago |
Gijs Kruitbosch
|
d94f3158d3
|
Fix readability.js to do a DOM traversal rather than relying on a wonky DOMCollection, fix trims, fix a potential null access, etc.
|
9 years ago |
Margaret Leibovic
|
fc53e1a315
|
Set 'name' variable to null in _getExcerpt to avoid old values in future for loop iterations
|
9 years ago |
Margaret Leibovic
|
2c7c504a36
|
Merge pull request #32 from gijsk/regex-issues-with-class-and-id-stuff
Fix regex issues. r=margaret
|
9 years ago |
Gijs
|
aec1ce774d
|
Merge pull request #31 from gijsk/testing-generates
Allow generating tests from the web, make testing more closely match Firefox
|
9 years ago |
Gijs Kruitbosch
|
1c42f29aa5
|
Create a script to generate testcases, actually use our version of JSDOMParser
|
9 years ago |
Gijs
|
17062c1ccf
|
Fix video regular expression to support https
|
9 years ago |
Gijs
|
d9f1e884dd
|
Fix regex issues
|
9 years ago |