Richard Harding
|
19a38a2cea
|
Add support for sibling detection, need to figure out how to test it well still
|
13 years ago |
Richard Harding
|
4455ec226d
|
Fix logic in the changing of body -> div
|
13 years ago |
Richard Harding
|
5c1765a6ef
|
Update cmd line client/interface, update doc builders
- For now we're always getting a div back from the parser
- Update the client code, not all flags are enabled, but basic passing a url
works
|
13 years ago |
Richard Harding
|
5b3ef916ef
|
Update to add link density scoring adjustments, prep for sibling checks
|
13 years ago |
Richard Harding
|
e843940549
|
Garden
|
13 years ago |
Richard Harding
|
8e96cb7844
|
Update tests for scoring, returning div/html doc depending on the found content
|
13 years ago |
Richard Harding
|
60ab4a96b0
|
Fix tests to pass again
|
13 years ago |
Richard Harding
|
8f28e7c947
|
Add processing of content per the algorithm with some base tests
|
13 years ago |
Richard Harding
|
7960264c3b
|
Make sure we return body with our css class on it
|
13 years ago |
Richard Harding
|
e93a52a748
|
Start to add some processing for the readable contnet
- Add removal of style, script, etc bits in the content
|
13 years ago |
Richard Harding
|
2e7fb0aa89
|
Rework document into its own file
|
13 years ago |
Richard Harding
|
ac053979a9
|
Add support for links, absoluting links
- Add a test that we absolute correctly
- Add a links cached attribute to get all links in the doc
|
13 years ago |
Richard Harding
|
590a94345f
|
Start to add some basic tests and layout to use for breaking down documents.
|
13 years ago |
Richard Harding
|
5e95f531bc
|
start to add some initial target test articles
|
13 years ago |
Richard Harding
|
31c4439155
|
Start to add makefile for running life
|
13 years ago |
Richard Harding
|
b70dec4332
|
adding bits...ignore these commits for a while
|
13 years ago |
Richard Harding
|
1b95af78c5
|
Initial bootstrap of modern package template
|
13 years ago |
Rick Harding
|
84de8f5078
|
initial commit
|
13 years ago |