Andrey Popp
|
95852d5c18
|
readability.htmls: some docs do not have title elem
|
13 years ago |
Yuri Baburov
|
c2ec1d1c38
|
Sorted out unicode issues, thanks to Lee Semel.
|
13 years ago |
Lee Semel
|
f3d0a8d842
|
Allow passing unicode objects
|
13 years ago |
Yuri Baburov
|
43c34bacc1
|
Renamed encodings to encoding to avoid conflicts with system module.
|
14 years ago |
Yuri Baburov
|
96f476181c
|
Improved title shortener method, and added it to the Document class.
|
14 years ago |
Yuri Baburov
|
dada82099b
|
Moved to lxml (based on decruft version); better encoding recognition.
|
14 years ago |