You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
15 lines
498 B
Plaintext
15 lines
498 B
Plaintext
This code is under the Apache License 2.0. http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
This is a python port of a ruby port of arc90's readability project
|
|
|
|
http://lab.arc90.com/experiments/readability/
|
|
|
|
Given a html document, it pulls out the main body text and cleans it up.
|
|
|
|
Ruby port by starrhorne and iterationlabs
|
|
Python port by gfxmonk
|
|
|
|
This port uses BeautifulSoup for the HTML parsing. That means it can be
|
|
a little slow, but will work on Google App Engine (unlike libxml-based
|
|
libraries)
|