readability/test/test-pages/heise
Daniel Aleksandersen 5a69d4a8eb Improve metadata extraction (#478)
* Improve metadata extraction

* Recognize meta[property] as a space-separated list
* Recognize Dulin Core (dc|dcterm): metadata.
* Prefer Dublin Core, Open Graph, Twitter, and HTML in that order.
* _getArticleTitle() is now only used as fallback if document
 doesn't provide good metadata.
2018-08-25 00:28:00 +01:00
..
expected-metadata.json Improve metadata extraction (#478) 2018-08-25 00:28:00 +01:00
expected.html Update test expectations. 2017-11-21 10:04:59 +00:00
source.html Don't look at banners and skyscrapers, remove <noscript> elements 2015-04-09 20:02:46 +01:00