Commit Graph

2 Commits (80d818aaa695984cdafe8c981681dc14b7c904e5)

Author SHA1 Message Date
Maria Luiza Soares 8c41d92560 Assert on siteName in all test cases 6 years ago
Daniel Aleksandersen 5a69d4a8eb Improve metadata extraction (#478)
* Improve metadata extraction

* Recognize meta[property] as a space-separated list
* Recognize Dulin Core (dc|dcterm): metadata.
* Prefer Dublin Core, Open Graph, Twitter, and HTML in that order.
* _getArticleTitle() is now only used as fallback if document
 doesn't provide good metadata.
6 years ago