Commit Graph

24 Commits

Author SHA1 Message Date
Marc Abonce Seguin
343e555ee9 [fix] append http if no scheme is provided in xpath's extact_url
This solves a bug with Yahoo where some results don't specify
a protocol.
2018-04-08 20:35:34 -05:00
Adam Tauber
1972a044a3 [fix] produce valid urls if scheme is missing 2017-05-22 15:48:37 +02:00
Adam Tauber
52e615dede [enh] py3 compatibility 2017-05-15 12:02:30 +02:00
David A Roberts
7492997c51 [fix] allow empty content 2017-01-17 21:14:33 +10:00
Alexandre Flament
90e1db3e5c [fix] extract_text: use html.tostring instead html_to_text. Fix #711 2016-12-31 13:56:09 +01:00
David A Roberts
1e9dab08e6 [fix] behaviour for page_size>1 and first_page_num>0
eg. pageno=1,21,41,... instead of 20,40,60,...
2016-08-14 22:10:25 +10:00
Kirill Isakov
bacc9a3df1 Add paging support to XPath & Erowid engines 2016-03-28 19:15:03 +06:00
Adam Tauber
bd22e9a336 [fix] pep8 compatibilty 2016-01-18 12:47:31 +01:00
Cqoicebordel
44c9216c49 Sanitize extract_text 2015-01-25 20:04:44 +01:00
potato
6f535b6fae [fix] error when xpath_results in extraxt_text is _ElementUnicodeResult instead of _ElementStringResult 2014-03-04 19:43:41 +01:00
asciimoo
c1d7d30b8e [mod] len() removed from conditions 2014-02-11 13:13:51 +01:00
asciimoo
b647244abf [fix] function parameters 2014-01-30 03:10:20 +01:00
asciimoo
3dcb835910 [fix] function parameters 2014-01-30 02:36:05 +01:00
asciimoo
fe82637eac [enh] importable url extractor 2014-01-30 02:32:58 +01:00
asciimoo
59eeeaab87 [fix] html tag removal 2014-01-23 11:08:08 +01:00
asciimoo
b2492c94f4 [fix] pep/flake8 compatibility 2014-01-20 02:31:20 +01:00
asciimoo
060ea4d2f5 [fix] whitespaces removed 2014-01-12 18:48:38 +01:00
Dalf
3dc3fc7770 [mod][fix] xpath engine simplified, yahoo engine never returns truncated urls 2014-01-05 14:06:52 +01:00
dalf
664c039b38 xpath engine: bug fix 2013-12-30 22:34:35 +01:00
asciimoo
e50a72b0e3 [enh] suggestion support for xpath engine 2013-11-13 19:33:09 +01:00
asciimoo
17bf00ee42 [enh] removing result html tags 2013-11-09 18:39:20 +01:00
asciimoo
7965da55a7 [fix] urlparsing fix 2013-10-27 12:01:03 +01:00
asciimoo
5d764f95cf [enh] xpath engine absolute xpath support 2013-10-26 13:45:43 +02:00
asciimoo
badd988545 [enh] xpath engine added 2013-10-26 02:22:20 +02:00