Richard Harding
745598dff9
Update news file with initial release
2012-05-06 20:47:24 -04:00
Richard Harding
279788c003
Update the readme for install info
2012-05-06 20:45:44 -04:00
Richard Harding
9e6835bd92
Work on tweaking out parser algorithm to help find the right candidate: fixes #2
2012-05-06 20:34:42 -04:00
Richard Harding
b78ea49c5a
Update readme so people don't misunderstand
2012-05-06 19:57:03 -04:00
Richard Harding
454e283850
Add link to readability
2012-05-06 19:55:04 -04:00
Richard Harding
d52d99f6b0
More readme tweaks
2012-05-06 19:53:59 -04:00
Richard Harding
773361efd9
Update readme with some real content
2012-05-06 19:52:59 -04:00
Richard Harding
7d2eec8f52
Add the conditional node checking during node cleaning
2012-05-06 19:41:30 -04:00
Richard Harding
14bbe701eb
Add some more debugging to support tracing wtf we did and why
2012-05-06 13:46:01 -04:00
Richard Harding
00ba7e5164
Start to add debugging process for the library/client
2012-05-06 09:04:15 -04:00
Richard Harding
e7873d3d92
Profile and adjust for performance, add bugfix to parse out mitechie blog post
2012-05-06 00:38:47 -04:00
Richard Harding
6b16b7b21f
Start to add scoring file specific tests
2012-05-05 23:26:30 -04:00
Richard Harding
ab79d9632b
Some refactoring starts to help us org tests/code
2012-05-05 21:31:36 -04:00
Richard Harding
ccac04e567
Add some cleaning/post processing of our target
...
- Starting to look decent
- Still need to port their cleanConditionally but going to have to think on
that
- Removes spare paragraphs, does some other cleaning tweaks
2012-05-05 20:52:15 -04:00
Richard Harding
19a38a2cea
Add support for sibling detection, need to figure out how to test it well still
2012-05-05 14:41:12 -04:00
Richard Harding
4455ec226d
Fix logic in the changing of body -> div
2012-05-05 13:09:45 -04:00
Richard Harding
5c1765a6ef
Update cmd line client/interface, update doc builders
...
- For now we're always getting a div back from the parser
- Update the client code, not all flags are enabled, but basic passing a url
works
2012-05-05 13:08:24 -04:00
Richard Harding
5b3ef916ef
Update to add link density scoring adjustments, prep for sibling checks
2012-05-05 08:07:13 -04:00
Richard Harding
e843940549
Garden
2012-05-04 22:54:30 -04:00
Richard Harding
8e96cb7844
Update tests for scoring, returning div/html doc depending on the found content
2012-05-04 22:46:37 -04:00
Richard Harding
60ab4a96b0
Fix tests to pass again
2012-05-04 17:18:30 -04:00
Richard Harding
8f28e7c947
Add processing of content per the algorithm with some base tests
2012-05-04 16:07:52 -04:00
Richard Harding
7960264c3b
Make sure we return body with our css class on it
2012-05-04 13:54:58 -04:00
Richard Harding
e93a52a748
Start to add some processing for the readable contnet
...
- Add removal of style, script, etc bits in the content
2012-05-03 22:11:46 -04:00
Richard Harding
2e7fb0aa89
Rework document into its own file
2012-05-03 21:42:58 -04:00
Richard Harding
ac053979a9
Add support for links, absoluting links
...
- Add a test that we absolute correctly
- Add a links cached attribute to get all links in the doc
2012-05-03 21:36:48 -04:00
Richard Harding
590a94345f
Start to add some basic tests and layout to use for breaking down documents.
2012-05-02 23:57:49 -04:00
Richard Harding
5e95f531bc
start to add some initial target test articles
2012-05-02 23:22:36 -04:00
Richard Harding
31c4439155
Start to add makefile for running life
2012-05-02 23:06:05 -04:00
Richard Harding
b70dec4332
adding bits...ignore these commits for a while
2012-05-02 21:52:31 -04:00
Richard Harding
1b95af78c5
Initial bootstrap of modern package template
2012-05-02 21:43:58 -04:00
Rick Harding
84de8f5078
initial commit
2012-05-02 18:37:00 -07:00