2016-08-24 14:00:15 +00:00
|
|
|
Next: Continue working on paragraphize; move p tags outside other p tags (do this when not converting br)
|
2016-08-23 17:06:29 +00:00
|
|
|
|
|
|
|
- `extract` (this kicks it all off)
|
|
|
|
x `node_is_sufficient`
|
|
|
|
- `_extract_best_node`
|
|
|
|
x `get_weight`
|
2016-08-23 19:15:12 +00:00
|
|
|
x `_strip_unlikely_candidates`
|
2016-08-24 14:51:20 +00:00
|
|
|
x `_convert_to_paragraphs`
|
2016-08-24 14:00:15 +00:00
|
|
|
x `_brs_to_paragraphs`
|
|
|
|
x `_paragraphize`
|
2016-08-23 17:06:29 +00:00
|
|
|
|
|
|
|
## Scoring
|
|
|
|
|
2016-08-24 19:30:16 +00:00
|
|
|
x `_get_score`
|
|
|
|
x `_set_score`
|
|
|
|
x `_add_score`
|
2016-08-23 17:06:29 +00:00
|
|
|
- `_score_content`
|
2016-08-24 19:30:16 +00:00
|
|
|
x `_score_node`
|
|
|
|
x `_score_paragraph`
|
2016-08-23 17:06:29 +00:00
|
|
|
|
|
|
|
## Top Candidate
|
|
|
|
|
|
|
|
- `_find_top_candidate`
|
|
|
|
- `extract_clean_node`
|
|
|
|
- `_clean_conditionally`
|
2016-08-24 19:35:27 +00:00
|
|
|
|
|
|
|
|
|
|
|
Add helper methods to clean up tests
|