Commit Graph

82 Commits

Author SHA1 Message Date
Richard Harding
745598dff9 Update news file with initial release 2012-05-06 20:47:24 -04:00
Richard Harding
279788c003 Update the readme for install info 2012-05-06 20:45:44 -04:00
Richard Harding
9e6835bd92 Work on tweaking out parser algorithm to help find the right candidate: fixes #2 2012-05-06 20:34:42 -04:00
Richard Harding
b78ea49c5a Update readme so people don't misunderstand 2012-05-06 19:57:03 -04:00
Richard Harding
454e283850 Add link to readability 2012-05-06 19:55:04 -04:00
Richard Harding
d52d99f6b0 More readme tweaks 2012-05-06 19:53:59 -04:00
Richard Harding
773361efd9 Update readme with some real content 2012-05-06 19:52:59 -04:00
Richard Harding
7d2eec8f52 Add the conditional node checking during node cleaning 2012-05-06 19:41:30 -04:00
Richard Harding
14bbe701eb Add some more debugging to support tracing wtf we did and why 2012-05-06 13:46:01 -04:00
Richard Harding
00ba7e5164 Start to add debugging process for the library/client 2012-05-06 09:04:15 -04:00
Richard Harding
e7873d3d92 Profile and adjust for performance, add bugfix to parse out mitechie blog post 2012-05-06 00:38:47 -04:00
Richard Harding
6b16b7b21f Start to add scoring file specific tests 2012-05-05 23:26:30 -04:00
Richard Harding
ab79d9632b Some refactoring starts to help us org tests/code 2012-05-05 21:31:36 -04:00
Richard Harding
ccac04e567 Add some cleaning/post processing of our target
- Starting to look decent
- Still need to port their cleanConditionally but going to have to think on
that
- Removes spare paragraphs, does some other cleaning tweaks
2012-05-05 20:52:15 -04:00
Richard Harding
19a38a2cea Add support for sibling detection, need to figure out how to test it well still 2012-05-05 14:41:12 -04:00
Richard Harding
4455ec226d Fix logic in the changing of body -> div 2012-05-05 13:09:45 -04:00
Richard Harding
5c1765a6ef Update cmd line client/interface, update doc builders
- For now we're always getting a div back from the parser
- Update the client code, not all flags are enabled, but basic passing a url
works
2012-05-05 13:08:24 -04:00
Richard Harding
5b3ef916ef Update to add link density scoring adjustments, prep for sibling checks 2012-05-05 08:07:13 -04:00
Richard Harding
e843940549 Garden 2012-05-04 22:54:30 -04:00
Richard Harding
8e96cb7844 Update tests for scoring, returning div/html doc depending on the found content 2012-05-04 22:46:37 -04:00
Richard Harding
60ab4a96b0 Fix tests to pass again 2012-05-04 17:18:30 -04:00
Richard Harding
8f28e7c947 Add processing of content per the algorithm with some base tests 2012-05-04 16:07:52 -04:00
Richard Harding
7960264c3b Make sure we return body with our css class on it 2012-05-04 13:54:58 -04:00
Richard Harding
e93a52a748 Start to add some processing for the readable contnet
- Add removal of style, script, etc bits in the content
2012-05-03 22:11:46 -04:00
Richard Harding
2e7fb0aa89 Rework document into its own file 2012-05-03 21:42:58 -04:00
Richard Harding
ac053979a9 Add support for links, absoluting links
- Add a test that we absolute correctly
- Add a links cached attribute to get all links in the doc
2012-05-03 21:36:48 -04:00
Richard Harding
590a94345f Start to add some basic tests and layout to use for breaking down documents. 2012-05-02 23:57:49 -04:00
Richard Harding
5e95f531bc start to add some initial target test articles 2012-05-02 23:22:36 -04:00
Richard Harding
31c4439155 Start to add makefile for running life 2012-05-02 23:06:05 -04:00
Richard Harding
b70dec4332 adding bits...ignore these commits for a while 2012-05-02 21:52:31 -04:00
Richard Harding
1b95af78c5 Initial bootstrap of modern package template 2012-05-02 21:43:58 -04:00
Rick Harding
84de8f5078 initial commit 2012-05-02 18:37:00 -07:00