Ivan Persidsky
|
fd11f92adb
|
Use a dedicated method and backward iteration for removing nodes (#300)
This improves compat with "real" DOMs that provide a live NodeList as the return value of getElementsByTagName.
|
2016-07-18 12:56:51 +01:00 |
|
Gijs Kruitbosch
|
140d4c4aca
|
Only compute textContent once.
|
2016-05-04 15:09:21 +01:00 |
|
usergit
|
327bfcb93f
|
exposed textContent to be returned
this returns the text content only, this is useful as it allows the content to be easily accessible
|
2016-05-03 23:59:39 -07:00 |
|
Gijs
|
69b81f5d70
|
Fix #287: convert getElementsByTagName result to an array (#288)
|
2016-04-20 08:59:42 +01:00 |
|
Gijs Kruitbosch
|
46b08a5ea5
|
Address issue #277 by marking 'modal' unlikely+negative
|
2016-03-17 10:53:57 +00:00 |
|
Peter deHaan
|
b380917b4b
|
Convert nested function declaration to function expression
|
2016-03-15 10:13:33 -07:00 |
|
Gijs Kruitbosch
|
e830ac9dd8
|
Fix eslint issues identified in m-c
|
2016-03-08 14:49:51 +00:00 |
|
Gijs Kruitbosch
|
dffa760c04
|
Fix issue #267 by ignoring hash URIs when making URIs absolute
|
2016-03-07 10:32:09 +00:00 |
|
Gijs Kruitbosch
|
a9597efc17
|
Fix bug 1230050 by checking for the 'hid' class specifically, r?MattN
|
2016-01-12 15:24:27 +00:00 |
|
Gijs
|
a801846a45
|
Merge pull request #204 from mozilla/tweak-great-grandparent-scoring
Updated great grandparent node scoring.
|
2015-05-05 23:02:09 +01:00 |
|
Nicolas Perriault
|
ae0833522c
|
Improved embedded video elements detection.
|
2015-05-05 22:11:11 +02:00 |
|
Nicolas Perriault
|
46304bb5fe
|
Updated great grandparent node scoring.
|
2015-05-05 18:12:17 +02:00 |
|
Nicolas Perriault
|
88ef3893b5
|
Fixes #180 - Score intermediary headings.
|
2015-05-04 08:59:05 +02:00 |
|
Nicolas Perriault
|
dc1b2c9fa0
|
Refs #195 - Exclude nodes likely to be related content.
|
2015-05-04 08:51:45 +02:00 |
|
Nicolas Perriault
|
cc18cb5787
|
Ref #195 - Add support for dailymotion videos.
|
2015-04-30 15:02:52 +02:00 |
|
Nicolas Perriault
|
9dbc009376
|
Fixes #113 - Recursive node ancestor scoring.
|
2015-04-29 22:51:45 +02:00 |
|
Nicolas Perriault
|
44879722b6
|
Fixes #183 - Preserve list items.
|
2015-04-28 16:32:04 +02:00 |
|
Gijs
|
79aa2fca87
|
Merge pull request #189 from mozilla/dont-remove-headings
Fixes #150 - Keep article intermediary headings.
|
2015-04-27 23:36:39 +01:00 |
|
Margaret Leibovic
|
af6da2a87d
|
Merge pull request #190 from mozilla/improved-author-meta-extraction
Improved author metadata detection.
|
2015-04-27 09:11:30 -07:00 |
|
Nicolas Perriault
|
7aee44adb2
|
Improved author metadata detection.
|
2015-04-27 17:03:23 +02:00 |
|
Gijs Kruitbosch
|
5f184053cd
|
Make isProbablyReaderable include <pre>, and deal with long <br>-separated paragraphs and/or shorter-than-5-paragraph text and such.
|
2015-04-27 15:49:03 +01:00 |
|
Nicolas Perriault
|
2451a07a7d
|
Fixes #150 - Keep article intermediary headings.
|
2015-04-27 15:15:52 +02:00 |
|
Margaret Leibovic
|
319a50b4f0
|
Fixes #184 - Don't strip class names from article content
|
2015-04-24 14:49:30 -07:00 |
|
Gijs
|
49e40768aa
|
Merge pull request #185 from mozilla/score-section-tags-by-default
Fixes #139 #143: Added more weight to section tags.
|
2015-04-24 20:12:11 +01:00 |
|
Nicolas Perriault
|
f6ffa6acde
|
Fixes #139 #143: Added more weight to section tags.
|
2015-04-24 19:55:51 +02:00 |
|
Nicolas Perriault
|
58cd789cd3
|
Improved title extraction 'algorithm'.
|
2015-04-24 16:16:10 +01:00 |
|
Gijs
|
b37ff08bc7
|
Merge pull request #169 from mozilla/clean-footer-tags
Fixes #163 - Avoid including footer tag contents.
|
2015-04-17 16:53:51 +01:00 |
|
Nicolas Perriault
|
12c6a11f67
|
Fixes #163 - Avoid including footer tag contents.
|
2015-04-17 17:33:04 +02:00 |
|
Nicolas Perriault
|
6eeabf90c1
|
Fixes #164 - Add support for title alt semantic metadata.
|
2015-04-17 15:38:25 +02:00 |
|
Gijs Kruitbosch
|
0ff82de0f4
|
Implement createTextNode, do more relaxed escaping there, update testcase.
|
2015-04-13 14:32:49 +01:00 |
|
Margaret Leibovic
|
37a8cd4171
|
Bug 1147584 - Don't remove unlikely <a> tags, and replace <a> tags with their text content if they won't be useful links
|
2015-04-09 17:19:59 -07:00 |
|
Gijs
|
a6014f5854
|
Merge pull request #132 from gijsk/heise-ad-prioritization
Don't look at banners and skyscrapers, remove <noscript> elements
|
2015-04-09 20:12:01 +01:00 |
|
Gijs Kruitbosch
|
a6346a0ad4
|
Don't look at banners and skyscrapers, remove <noscript> elements
|
2015-04-09 20:02:46 +01:00 |
|
Nicolas Perriault
|
4424b0bad7
|
Refs #128 - Add support for options to Readability constructor. r=@gijsk
|
2015-04-09 11:56:58 +02:00 |
|
Nicolas Perriault
|
4d41f5e4ed
|
Refs #117 - Drop social/share buttons.
|
2015-04-07 23:00:52 +02:00 |
|
Gijs Kruitbosch
|
7c60dba3b6
|
Fix Readability.js to work with jsdom's DOM implementation (in particular: no firstElementChild implementation...)
|
2015-04-03 22:32:13 +01:00 |
|
Margaret Leibovic
|
eb3a8e8dc4
|
Bug 1150695 - Move isProbablyReaderable function to Readability.js
|
2015-04-02 17:39:43 -04:00 |
|
Nicolas Perriault
|
f8d37e4276
|
Don't remove elements containing figures or having them as a parent.
|
2015-03-31 19:12:45 +02:00 |
|
Nicolas Perriault
|
b6730703a1
|
Fixes #81 - Keep article images.
|
2015-03-31 18:46:04 +02:00 |
|
Gijs
|
194a5376c8
|
Merge pull request #63 from mozilla/preserve-embedded-tweets
Preserve inline tweets as they're part of article contents.
|
2015-03-25 13:57:34 +00:00 |
|
Gijs Kruitbosch
|
b4332328f3
|
Fix an issue where we don't track scores for the parents appropriately.
|
2015-03-23 14:57:04 +00:00 |
|
Gijs
|
14b33b69db
|
Merge pull request #65 from mozilla/support-embed-videos
Fixes #56 - Updated support for embedded Youtube & Vimeo videos.
|
2015-03-23 14:48:00 +00:00 |
|
Nicolas Perriault
|
ad52d8ee30
|
Fixes #53 - Fixed dot-slash relative URI resolution.
|
2015-03-23 12:54:46 +01:00 |
|
Nicolas Perriault
|
2d5f59f3eb
|
Fixes #56 - Updated support for embedded Youtube & Vimeo videos.
|
2015-03-23 10:07:04 +01:00 |
|
Nicolas Perriault
|
d83763c8a1
|
Preserve inline tweets as they're part of article contents.
|
2015-03-22 18:19:34 +01:00 |
|
Nicolas Perriault
|
cf3dce6cf2
|
Refs #58 - Stripped embed tags.
|
2015-03-22 17:52:23 +01:00 |
|
Nicolas Perriault
|
eee224560b
|
Addressed review comments from @Gijsk.
|
2015-03-21 18:05:58 +01:00 |
|
Nicolas Perriault
|
4f9615cb9a
|
Use forEach when it makes sense.
|
2015-03-21 15:10:00 +01:00 |
|
Gijs Kruitbosch
|
955951659d
|
Bug 1143725 - fix the Herald Sun website
|
2015-03-20 17:49:42 -07:00 |
|
Gijs Kruitbosch
|
eb81444946
|
Improve logic to rely on children instead of childNodes
|
2015-03-20 17:10:45 -07:00 |
|