mirror of
https://github.com/postlight/mercury-parser
synced 2024-11-05 12:00:13 +00:00
508 B
508 B
Next: Continue working on paragraphize; move p tags outside other p tags (do this when not converting br)
extract
(this kicks it all off) xnode_is_sufficient
_extract_best_node
xget_weight
x_strip_unlikely_candidates
_convert_to_paragraphs
x_brs_to_paragraphs
x_paragraphize
Scoring
_get_score
_set_score
_add_score
_score_content
_score_node
_score_paragraph
Top Candidate
_find_top_candidate
extract_clean_node
_clean_conditionally