Commit Graph

35 Commits (master)

Author SHA1 Message Date
Mišo Belica 0751fe0c97 Fixed failing tests
Problem was that when input document
was empty lxml raised ParseError instead
of XMLSyntaxError in new versions.
6 years ago
Mišo Belica aa83825334 Tests migrated into pytest style 6 years ago
Jelmer Vernooij 6f912830c0 Use chardet rather than charade.
The changes from charade have been merged into upstream chardet,
and chardet is available in Debian/Ubuntu whereas charade is not.
10 years ago
Mišo Belica e2f3391dc3 Better decoding page into unicode
- Fixes #22
- Fixes #23

Prepare for release
10 years ago
Mišo Belica 5cb028ec93 Tests are executable with pytest framework
Pytest ignores files with name "test.py" for me :(
10 years ago
Mišo Belica 6918eca90b Debug logging is less verbose 10 years ago
Mišo Belica 687d2ecfdf Merge branch 'master' of https://github.com/bookieio/breadability into upstream-sync
Conflicts:
	CHANGELOG.rst
	README.rst
	breadability/document.py
	breadability/scoring.py
	breadability/scripts/client.py
	setup.py
	tests/test_articles/test_sweetshark/article.html
	tests/test_articles/test_sweetshark/test.py
10 years ago
Richard Harding 6906f3b2fa Update logging to drop WARN to INFO 10 years ago
Richard Harding 347f3ea0b5 Lint 11 years ago
Richard Harding 17270db5f0 Add test for title 11 years ago
Richard Harding 1fc153d850 Rename it back. Respect others 11 years ago
Mišo Belica 3b5b2b1522 Renamed to readability 11 years ago
Mišo Belica cf781bc595 Updated implementation of cached property
Cached value of properties are stored
in instance's '__dict__'.
11 years ago
Mišo Belica 4e3227521e Fewer code - fewer bugs (I hope) 11 years ago
Mišo Belica 1a5970b238 Better names and positions for variables 11 years ago
Mišo Belica 930b6ced12 Fixed transformation of leaf <div> into <p> 11 years ago
Mišo Belica 314c999730 Drop useless tags by HTML cleaner 11 years ago
Mišo Belica 272fe480a3 Updated setup.py 11 years ago
Mišo Belica 18b5c9b447 Refactored file 'scoring.py' 11 years ago
Mišo Belica dcb7c18fd5 Refactored file 'document.py'
Removed non-intuitive parts and dead code
not covered by tests. Better names for objects.
Better coverage by tests.
11 years ago
Mišo Belica 03ff0be266 Moved client script into 'breadability.scripts' 11 years ago
Mišo Belica ec88a4efe6 Use docopt as an argument parser 11 years ago
Mišo Belica 8470ef2b45 Purification of file readable.py 11 years ago
Mišo Belica 2e2e906da7 Purification of document.py 11 years ago
Mišo Belica 9f0fc2d433 Purification 11 years ago
Mišo Belica baaefeda3c Refactored computing of link density 11 years ago
Mišo Belica 3f71e1b7d4 Refactored checking of node's attribute 11 years ago
Mišo Belica 636a38d705 Refactored generating of hash ID 11 years ago
Mišo Belica cc00976533 Replace implementation of 'cached_property'
Parameter 'ttl' isn't needed.
11 years ago
Mišo Belica c69cd4b2ba Purification 11 years ago
Mišo Belica 101950478e Simplify logging 11 years ago
Mišo Belica 3322681166 Use 'charade' for detecting encoding 11 years ago
Mišo Belica 544220e9a3 Replaced u"" literal wit function 'to_unnicode'
Literal u"" is not supported by Python v3.2.
11 years ago
Mišo Belica 94f6b0a84e Tests passes for both Python v2.7, v3.3 11 years ago
Mišo Belica c4dbe24a65 New repository structure 11 years ago