python-readability/readability
Nathan Breit 75e2e0cb3a Defaulting to utf-8 when chardet returns None
On articles like this one chardet returns None:
http://news.zing.vn/nhip-song-tre/thay-giao-gay-sot-tung-bo-luat-tinh-yeu/a291427.html
This causes exceptions later on when encoding.lower() is called
2014-12-18 18:48:22 -08:00
..
__init__.py Expose Document in readability package 2011-06-30 12:17:08 +08:00
cleaners.py Updated scoring algorithm to match readability.js v1.7.1 2011-06-01 12:16:32 +07:00
debug.py Fixed indentation, encoding issue and README bug. Thanks to Greg Jastrab. Bump version to 0.2.3 2011-07-27 02:05:16 +07:00
encoding.py Defaulting to utf-8 when chardet returns None 2014-12-18 18:48:22 -08:00
htmls.py Quickfix for #41 2013-10-10 13:47:58 +07:00
readability.py Fix for #52: <input type="hidden"> are not counted any more for "form removal" heuristic. 2014-09-22 15:31:31 +07:00