Mišo Belica
cf781bc595
Updated implementation of cached property
...
Cached value of properties are stored
in instance's '__dict__'.
12 years ago
Mišo Belica
4e3227521e
Fewer code - fewer bugs (I hope)
12 years ago
Mišo Belica
1a5970b238
Better names and positions for variables
12 years ago
Mišo Belica
930b6ced12
Fixed transformation of leaf <div> into <p>
12 years ago
Mišo Belica
314c999730
Drop useless tags by HTML cleaner
12 years ago
Mišo Belica
272fe480a3
Updated setup.py
12 years ago
Mišo Belica
9eacbd579c
Updated LICENSE, AUTHORS, README
12 years ago
Mišo Belica
18b5c9b447
Refactored file 'scoring.py'
12 years ago
Mišo Belica
dcb7c18fd5
Refactored file 'document.py'
...
Removed non-intuitive parts and dead code
not covered by tests. Better names for objects.
Better coverage by tests.
12 years ago
Mišo Belica
03ff0be266
Moved client script into 'breadability.scripts'
12 years ago
Mišo Belica
c92f61fa53
Fixed docopt version
12 years ago
Mišo Belica
ec88a4efe6
Use docopt as an argument parser
12 years ago
Mišo Belica
8470ef2b45
Purification of file readable.py
12 years ago
Mišo Belica
b3b987440d
Added test runner via nosetests
12 years ago
Mišo Belica
2e2e906da7
Purification of document.py
12 years ago
Mišo Belica
9f0fc2d433
Purification
12 years ago
Mišo Belica
baaefeda3c
Refactored computing of link density
12 years ago
Mišo Belica
3f71e1b7d4
Refactored checking of node's attribute
12 years ago
Mišo Belica
636a38d705
Refactored generating of hash ID
12 years ago
Mišo Belica
9a613317c0
Make package from tests
12 years ago
Mišo Belica
cc00976533
Replace implementation of 'cached_property'
...
Parameter 'ttl' isn't needed.
12 years ago
Mišo Belica
e3b6ee2fd6
Suppress warning "ResourceWarning: unclosed file"
12 years ago
Mišo Belica
c69cd4b2ba
Purification
12 years ago
Mišo Belica
101950478e
Simplify logging
12 years ago
Mišo Belica
81be8ccbfb
Updated readme
12 years ago
Mišo Belica
9f83ea973a
Fixed setup.py
12 years ago
Mišo Belica
726fe59ecd
Show build status from master branch [ci skip]
12 years ago
Mišo Belica
c7299b9852
Updated makefile [ci skip]
12 years ago
Mišo Belica
671d940ded
Removed branches from Travis configuration
...
[ci skip]
12 years ago
Mišo Belica
ea90ee5a5e
Updated changelog [ci skip]
12 years ago
Mišo Belica
c89010221e
Changed/renamed/added AUTHORS, CHANGELOG, LICENSE
...
[ci skip]
12 years ago
Mišo Belica
d31d804167
Exclude coverage file from repo
12 years ago
Mišo Belica
231d251536
Added commands test into README
12 years ago
Mišo Belica
3322681166
Use 'charade' for detecting encoding
12 years ago
Mišo Belica
544220e9a3
Replaced u"" literal wit function 'to_unnicode'
...
Literal u"" is not supported by Python v3.2.
12 years ago
Mišo Belica
915876b675
Added Travis status image to README
12 years ago
Mišo Belica
8c79d4c04b
Set white-list branches for @travisbot
12 years ago
Mišo Belica
94f6b0a84e
Tests passes for both Python v2.7, v3.3
12 years ago
Mišo Belica
912bb50b76
Skip failing test that I don't know how to fix
12 years ago
Mišo Belica
c4dbe24a65
New repository structure
12 years ago
Richard Harding
75b3151de9
Update the unittest import to grab unittest2 for 2.6
12 years ago
Richard Harding
84f6a079f9
Try to adjust the travis command to test py2.6
12 years ago
Richard Harding
b18589ced8
Use the right package doh
12 years ago
Richard Harding
316c550709
Add python 2.6 to the travis ci
12 years ago
Richard Harding
fee5c37b39
Add argparse as a install req for py <2.7
12 years ago
Richard Harding
3dea2f349b
Update ignore file
12 years ago
Nathan Nifong
920094c81a
Add a penalty for double quote chars in paragraphs.
...
- They are far more common in random commented code and proprietary metadata
that keeps slipping by the filter as actual content.
- Downgraded the score value of commas for the same reason.
- Prep for 0.1.10 release with these changes.
Add credits and tweak the " and , scoring
Update version and update the scoring code
12 years ago
Richard Harding
60da675da5
Reprocess without candidate in case of errors using one
...
- Fixes #10
12 years ago
Richard Harding
3984e04668
Add better handling around xml parsing issues
...
- Fixes #9 with empty/non parsable docs
- Fixes #8 and removes kwargs for the decode statements.
- Fixes #7 by checking if the node has a parent before dropping.
12 years ago
Richard Harding
fe9364295f
prep for 0.1.7 release
12 years ago