Commit Graph

122 Commits

Author SHA1 Message Date
Yuri Baburov
86cde20ba6 Merge pull request #84 from lwm/add-travis-file
Add travis file
2016-07-15 21:10:40 +07:00
Luke Murphy
a1d6bbcd3f add travis file 2016-07-14 22:52:13 +02:00
Yuri Baburov
75f2ea0d00 Version bump to 0.6.2 2016-04-09 14:34:00 +06:00
Yuri Baburov
47e473fb91 Merge pull request #73 from alphapapa/patch-1
Using Mozilla User-Agent by default
2016-04-09 14:29:07 +06:00
alphapapa
8443a87f5c Update readability.py 2016-04-03 21:38:17 -05:00
alphapapa
5fc2d3684a Use Mozilla User-Agent
Use a "Mozilla" user-agent to avoid HTTP 403 errors.  Fixes #71.
2016-04-03 21:32:36 -05:00
Yuri Baburov
65d1ebb06d Fixed #70 and added xpath option 2015-09-29 18:40:17 +02:00
Yuri Baburov
fae95bad45 Bump to v0.6.1 -- fixed logging 2015-08-26 16:14:04 +06:00
Yuri Baburov
c0d794fdd8 Update readability.py
Fixed logging namespace
2015-08-26 15:11:12 +05:00
Yuri Baburov
398f6ad748 Bump to 0.6.0.5 2015-07-27 12:00:58 +06:00
Yuri Baburov
8ff11e68a6 Debugging improvements. Bump to 0.6.0.5 2015-07-27 11:59:17 +06:00
Yuri Baburov
fcdbe563a5 Fixed #49. Bump to 0.6.0.4 2015-07-27 10:06:28 +06:00
Yuri Baburov
c51886d923 Bump to 0.6.0.3
Fixed installation.
2015-07-27 09:38:44 +06:00
Yuri Baburov
24bb20c761 Added dev branch features.
Bumped to version 0.6
2015-07-27 00:22:45 +06:00
Yuri Baburov
154658798b Merge pull request #64 from martinth/master
Added python 3 support (Supported: python 2.6, 2.7, 3.3, 3.4).
Thanks a lot to @martinth
2015-07-26 14:11:37 +05:00
Yuri Baburov
83a7ce67c1 Merge pull request #68 from digitaldavenyc/python3
fix for setup, convert print to python 3 compatible format
2015-07-24 11:47:41 +05:00
Dave Padovano
1ac3e019bd fix for setup, convert print to python 3 compatible format 2015-07-23 14:05:12 -04:00
Yuri Baburov
1aabdb3d27 Merge pull request #67 from horva/fix-logging-config
Move logging.basicConfig to main function
2015-07-13 09:10:35 +05:00
Marko Horvatic
f0ff9b2425 Move logging.basicConfig to main function 2015-06-24 16:21:04 +02:00
Yuri Baburov
e2bc1ea055 Improved #65 which has given warning, added cssselect lib, bumped to 0.5.1 2015-05-06 14:33:14 +06:00
Yuri Baburov
1cb17d919b Merge pull request #65 from avalanchy/best_elem_is_root
Failure if best_elem is root (fix #58)
Thanks a lot @avalanchy and @jnothman !
2015-05-06 12:44:41 +05:00
Mariusz Osiecki
bf9e7404fa Failure if best_elem is root (fix #58) 2015-05-06 09:34:55 +02:00
Martin Thurau
386e48d29b Fixes checking of declared encodings in get_encoding.
In PYthon 3 .decode() on bytes requires the name of the encoding to be a str type which means we have to convert the extracted encoding before we can use it.
2015-04-30 11:47:32 +02:00
Martin Thurau
046d2c10c3 Fixes regex declaration in get_encoding.
Since get_encoding() is only called when the input is *not* already unicode we need to declare the regexs as byte type so they continue to work in Python 3.
2015-04-29 23:36:50 +02:00
Martin Thurau
ce7ca26835 Adds compatibility raise_with_traceback method to support different raise syntax
Unfortunately the Python 2 `raise` syntax is not supported in Python 3.3 and not all 3.4.x versions so we deal with that by using conditional imports and a compatibility layer.
2015-04-29 23:35:18 +02:00
Martin Thurau
3ac56329e2 Corrects some things were 2to3 did to much. 2015-04-29 19:33:43 +02:00
Martin Thurau
aa4132f57a Adds Python 3.4 support.
Code now supports Python 2.6, 2.7 and 3.4. PYthon 3.3 isn't support
because of some issues with the parser and the difference between old and
new `raise` syntax.
2015-04-29 16:18:21 +02:00
Martin Thurau
13cca1dd19 Adds tox configuration.
Adds tox.ini to support running the tests on multiple versions. Adds
requirements.txt to support dependency installtion via pip.
2015-04-29 16:16:46 +02:00
Yuri Baburov
1d4ee9d421 Releasing as version 0.5 2015-04-27 16:00:08 +06:00
Yuri Baburov
987570bef0 Updated package links for Python 2.7 and Python 3 support 2015-04-27 15:59:31 +06:00
Yuri Baburov
dc648e7d0b Added a test for issue #48 but can't reproduce it -- seems to work fine. 2015-04-27 15:59:18 +06:00
Yuri Baburov
c715426584 Releasing as version 0.4 2015-04-27 14:54:13 +06:00
Yuri Baburov
1fac7e685a Added a feature to allow more images per article (with a test) 2015-04-27 14:35:00 +06:00
Yuri Baburov
c6796195a7 Fixed makefile testing. 2015-04-27 14:32:40 +06:00
Miguel Galves
d04d41b749 Insert text inside iframe for correct output 2015-04-27 14:05:31 +06:00
Miguel Galves
be2a1c4646 Let width and height attributes 2015-04-27 13:52:25 +06:00
Miguel Galves
f1759c1404 Allows iframes containing youtube or vimeo videos. People like them 2015-04-27 13:52:01 +06:00
Yuri Baburov
332ad810de Bumped to 0.3.0.6 2015-03-16 21:38:17 +05:00
Yuri Baburov
e4bcbe57d7 Fixes #53 2015-03-16 22:19:36 +06:00
Yuri Baburov
aeb4f4c782 Merge pull request #59 from seomoz/mac_10_10
Fix mac version comparison in setup.py for 10.10
2015-01-13 17:41:30 +05:00
Matthew Peters
c8c2f8809c Fix mac version comparison in setup.py for 10.10 2015-01-12 22:19:09 -08:00
Yuri Baburov
2d4cfdb2c8 Merge pull request #56 from nathanathan/patch-1
Defaulting to utf-8 when chardet returns None
2014-12-20 02:11:53 +05:00
Nathan Breit
75e2e0cb3a Defaulting to utf-8 when chardet returns None
On articles like this one chardet returns None:
http://news.zing.vn/nhip-song-tre/thay-giao-gay-sot-tung-bo-luat-tinh-yeu/a291427.html
This causes exceptions later on when encoding.lower() is called
2014-12-18 18:48:22 -08:00
Yuri Baburov
0c2f29ed0d Version bump. 2014-09-22 15:32:46 +07:00
Yuri Baburov
638f73f6a2 Fix for #52: <input type="hidden"> are not counted any more for "form removal" heuristic. 2014-09-22 15:31:31 +07:00
Yuri Baburov
2fab5ffa6b Merge pull request #48 from mperdomo1/master
Added code to check declared encodings first
2014-05-18 15:27:06 +07:00
Mark Perdomo
3a43a3fe7e Added code to check declared encodings first and check them
from kennethreitz/requests/utils.py.  Also I added some superset
encodings I have found in Chinese pages that are mishandled by
chardet/character declarations.
2014-05-13 15:09:47 +08:00
Yuri Baburov
1a4d3697bc Allow latest lxml on Mac OS X 10.9, see issue #39 for comments and setup instructions 2014-04-02 15:16:19 +07:00
Yuri Baburov
d8595b7103 Quickfix for #41 2013-10-10 13:47:58 +07:00
Yuri Baburov
318f25c577 Minor fix in encoding guessing. Claiming it v0.3.0.1 2013-10-10 02:57:53 +07:00