Commit Graph

925 Commits (9f5cd28dba3c923ade7e482615611ac055c2918a)

Author SHA1 Message Date
rinpatch e3bd2670d7
Traling whitespace 6 years ago
rinpatch 86f47689fe
Hopefully fix code style errors(again)(again) 6 years ago
rinpatch b8b23d8006
Hopefully fix code style errors(again) 6 years ago
rinpatch fb364ffae7
Hopefully fix code style errors 6 years ago
rinpatch 3ec35a06f7
Added Engine 6 years ago
rinpatch cd1374d2f2 added whitespaces 6 years ago
rinpatch a305dae9d8 switched currency convert to duckduckgo and added Bitcoin to currency list 6 years ago
Noémi Ványi 25b48143ea fix findx engine 6 years ago
Marc Abonce Seguin f7f9c50393 [fix] force English results in Google when using en-US 6 years ago
Marc Abonce Seguin b12857a70d [fix] make search requests on wikidata more accurate 6 years ago
Marc Abonce Seguin 835d1edd58 [fix] google news xpath 6 years ago
Marc Abonce Seguin 343e555ee9 [fix] append http if no scheme is provided in xpath's extact_url
This solves a bug with Yahoo where some results don't specify
a protocol.
6 years ago
Adam Tauber 283f6c9053
Merge pull request #1252 from MarcAbonce/search-languages
[mod] Refactor engine's search language handling
6 years ago
Marc Abonce Seguin 772c048d01 refactor engine's search language handling
Add match_language function in utils to match any user given
language code with a list of engine's supported languages.

Also add language_aliases dict on each engine to translate
standard language codes into the custom codes used by the engine.
6 years ago
Noémi Ványi 8cc529e9a3 forward category to engine without highlighting on the ui 6 years ago
Noémi Ványi d04e471ce5 add findx engine for general, images and videos 6 years ago
Marc Abonce Seguin d1eae9359f fix fetch_langauges to be more accurate
Add languages supported by either all default general engines or 10 engines.
6 years ago
Marc Abonce Seguin 216716db70 [fix] change duckduckgo url to avoid error response 6 years ago
Adam Tauber 03d7a8f065 [fix] pep8 6 years ago
Adam Tauber b4c657d227 [enh] add asksteem engine - closes #1163 6 years ago
Noémi Ványi 988cf38196 fix Microsoft Academic engine 6 years ago
Adam Tauber 2f69eaeb2f [fix] fix engine initialization 6 years ago
Noémi Ványi c645915171 fix bing videos engine 6 years ago
Marc Abonce Seguin 829032f306 [fix] read utf-8 files (settings, languages, currency) with python3.5
Related to discussion in #1124
The io.open import is necessary for python2
7 years ago
Adam Tauber 8511e64f35
Merge pull request #1124 from JosephKiranBabu/python3-unicode-support
Make Python 3 able to read settings files with Unicode characters
7 years ago
Noémi Ványi f3aa5ea47d fix yacy result url
Closes #1064
7 years ago
Joseph Nuthalapati bdc803e185
Make Python 3 able to read settings files with Unicode characters
SearX currently doesn't start up when run with Python 3 as it tries to parse the
settings.yml file with ASCII codecs.
There are similar problems with engines_languages.json and currencies.json
Python 3 requires that files with Unicode characters be read with a 'b' flag.
This also works with Python 2 and hence can be integrated into the main source
code.

Tested with the latest Python 3.6.4rc1 on Debian unstable.

Signed-off-by: Joseph Nuthalapati <njoseph@thoughtworks.com>
7 years ago
Noémi Ványi 2d5eed9b59 send constant cookie with query to Google 7 years ago
Adam Tauber 7752b0a807 [fix] gigablast url parameter change - fixes #1107 7 years ago
marc 4d1770398a remove 'all' option from search languages 7 years ago
Adam Tauber 1613c6319e [fix] handle /sorry redirects 7 years ago
Adam Tauber 1088c2a75d [fix] do not crash if publication date is missing in pubmed engine 7 years ago
Adam Tauber 801b5a56ad [fix] multiple fixes in pdbe engine 7 years ago
Adam Tauber 0969e50c5b [fix] convert json engine result attributes to string - closes #1006 7 years ago
Adam Tauber 6eb9503896 [fix] use english in google engine if no language was set - this prevents guessing the language by the IP of the instance 7 years ago
Adam Tauber 6fdb6640d9 [fix] revert language changes to prevent CAPTCHAs 7 years ago
Adam Tauber 9ab8536479 [fix] fix language support of google 7 years ago
pyrrh0n1c 4340c0b16c Fixed typos. 7 years ago
pyrrh0n1c 2231b02add Fixed the currency_convert engine. 7 years ago
Adam Tauber 00a7041daa
Merge pull request #1068 from Apply55gx/genius
Continuation of PR #1004 (seems abandoned)
7 years ago
Adam Tauber 7a9b18e9e9 [mod] add more error handling to json engine II. 7 years ago
Adam Tauber 181f1c6305 [mod] add more error handling to json engine 7 years ago
Noémi Ványi 5954a8e16a minor fix of BASE engine 7 years ago
Noémi Ványi d20bba6dc7 minor fixes of pubmed engine
Closes #1045
7 years ago
jibe-b df0d915806 [add] pubmed engine 7 years ago
Noémi Ványi 9c2b7a82f0 minor fixes of arxiv
Closes #1050
7 years ago
jibe-b e391b2d970 [fix] remove .encode for python3 compatibility 7 years ago
jibe-b 5278fa666c [enh] use format to concatenate strings 7 years ago
jibe-b 3e3672e079 [add] arxiv engine 7 years ago
Apply55gx d800e3fcfa Merge pull request #1 from asciimoo/master
-
7 years ago
Apply55gx 18a4e7035f removed unused indicies array 7 years ago
marc 44085e31d0 update engines_languages.json and languages.py
Also, fix fetch_languages.py so it can run on python3.
7 years ago
marc a524dbb823 [fix] language support for bing images and videos 7 years ago
misnyo 33fd938016 [mod] int_or_zero refactored to searx_utils 7 years ago
misnyo c3232b0e1a Merge branch 'master' into nyaa 7 years ago
Adam Tauber 234366b900 Merge pull request #1018 from misnyo/generalfile
[mod]generalfile engine removed
7 years ago
Adam Tauber c505c3c1ca Merge pull request #1019 from misnyo/gigablast
[fix] gigablast api fix
7 years ago
misnyo b91772eff1 [fix] gigablast pep8 fix 7 years ago
misnyo 01330f71cd [fix] nyaa.si fixed 7 years ago
misnyo 1e2b60f380 [mod] tokyotoshokan dependecy to other engine removed 7 years ago
misnyo 3182ba7069 [fix] google news dom xpath fix 7 years ago
misnyo 8849606e5f [fix] gigablast api fix 7 years ago
misnyo 2d66df0548 [mod]generalfile engine removed 7 years ago
Adam Tauber e74aaa781e Merge pull request #1016 from misnyo/blekko
[mod]blekko images removed
7 years ago
Adam Tauber 610117d054 Merge pull request #1015 from misnyo/digg
[fix] digg now requires cookie for search
7 years ago
misnyo 6e92fb383a [fix] digg cookie characters created in module import 7 years ago
misnyo 20d2d4dab1 [fix] pep8 fix for faroo engine 7 years ago
misnyo 0607b167f8 [fix] faroo json api and image layout fixed 7 years ago
misnyo a49cc18f9c [fix] digg now requires cookie for search 7 years ago
misnyo f139f9cfdb [mod]blekko images removed 7 years ago
misnyo d14e2781b2 [fix]torrentz search engine fixed for new version 7 years ago
woorst 2434c29dc5 New engine: Genius (lyrics) 7 years ago
marc 3ca9cad927 add bing videos engine 7 years ago
marc 856dfc3018 add google videos 7 years ago
Adam Tauber 077d8efed8 [fix] use poolrequests in ddg engines 7 years ago
Adam Tauber 0f6612bb40 [mod] separate engine load and initialization 7 years ago
Adam Tauber 1794f6a4d3 [enh] add "inactive" attribute to engines
This modification allows us to deactivate engines in settings.yml
without commenting them out
7 years ago
marc 405e5c8f24 [fix] duckduckgo images doesn't fail with countryless language 7 years ago
potato 9b82cb1908 [fix] is_valid_lang fixed for new languages.py + dictzone engine encoding 7 years ago
vache asatryan 5835ebf955 fix github url 7 years ago
Noémi Ványi c361811cb5 [fix] fix xpath of google images 7 years ago
Adam Tauber 343ac7197d [fix] pep8 7 years ago
Adam Tauber 78365ffb8a [enh] add init function to engines which loads parallel 7 years ago
marc 3b950929b1 [fix] fix duckduckgo's offset
First page now starts with 0 offset,
rather than starting on the 30th result.

DuckDuckGo returns 30 results on each page.
7 years ago
marc c65a409f0d add duckduckgo images engine 7 years ago
Adam Tauber 1972a044a3 [fix] produce valid urls if scheme is missing 7 years ago
Adam Tauber 8db527c1d2 [fix] use raw response with etree.parsefromstring - Unicode strings with encoding declaration are not supported 7 years ago
Adam Tauber a4f7046b8f [fix] pep8 7 years ago
Noémi Ványi 2a9339f4b7 [fix] add more description on possible search tpyes 7 years ago
Noémi Ványi 2db807368c [enh] make search type of mediawiki configurable 7 years ago
marc 9ee8e552da [fix] bing images 7 years ago
Dalf c233bf0df9 [fix] framalibre : remove result['thumbnail'] (not used) 7 years ago
Alexandre Flament f5128c7cb9 [mod] add/modify image fetching for bing_news, qwant and twitter engines 7 years ago
Adam Tauber 52e615dede [enh] py3 compatibility 7 years ago
Adam Tauber 46a2c63f8e [fix] yahoo news date parsing 7 years ago
Alexandre Flament 12d91c1d67 [mod] searx doesn't crash at startup when an engine can't be loaded (see #884) 7 years ago
François Revol 45d15bd6f0 Add framalibre engine
framalibre.org is a catalogue of Free Software, edited by Framasoft.

For now we pass the thumbnail as img_src as it doesn't seem to be used
for IT...
7 years ago
marc fd65c12921 make search language handling less strict
languages.py can change, so users may query on a language that is not
on the list anymore, even if it is still recognized by a few engines.

also made no and nb the same because they seem to return the same,
though most engines will only support one or the other.
7 years ago
marc 805fb02ed1 add language support for qwant
closes issue #863
7 years ago
Noémi Ványi 742e4dfdcc [fix] fix paging of duckduckgo 8 years ago
Adam Tauber 52d1087202 [enh] add result number parsing to google engine 8 years ago
Adam Tauber 57149661e4 Merge pull request #830 from davidar/se
Semantic Scholar
8 years ago
Adam Tauber b1d49bacb0 Merge pull request #827 from davidar/spell
[enh] show spelling corrections
8 years ago
David A Roberts 7492997c51 [fix] allow empty content 8 years ago
David A Roberts 1d30141c20 [enh] show spelling corrections 8 years ago
jcherqui 39a97825d9 Add torrent infos 8 years ago
jcherqui 4776abd1ac Remove categories 8 years ago
jcherqui a320328dda Fix test 8 years ago
jcherqui 0549fb40d2 Add 1337x.to engine 8 years ago
Adam Tauber 108392f8da [fix] skip non-complete google news results 8 years ago
Adam Tauber 0d4da30c7f [enh] add instant answers to google engine 8 years ago
Adam Tauber 525e5d275c Merge pull request #801 from dalf/extract_text
[fix] extract_text: use html.tostring instead html_to_text.
8 years ago
Alexandre Flament 90e1db3e5c [fix] extract_text: use html.tostring instead html_to_text. Fix #711 8 years ago
Adam Tauber 6bf9c398a7 [fix] use english as default language in bing
If no language is specified, bing returns results with multiple languages
for one query which isn't really useful. Setting english as default
insted if nothing.
8 years ago
marc 1175b3906f change language list to only include languages with a minimum of engines
that support them.
users can still query lesser supported through the :lang_code bang.
8 years ago
Adam Tauber 8bff42f049 Merge branch 'master' into languages 8 years ago
Adam Tauber ea034fafa9 [fix] proper engine init 8 years ago
Adam Tauber a605377c40 [enh] explicit engine init 8 years ago
Adam Tauber b6fc154616 [enh] add searx engine 8 years ago
Adam Tauber 0171db5c3f [fix] handle missing images in google news 8 years ago
marc 7fba3d1179 Merge branch 'master' into languages 8 years ago
marc 4a1ff56389 minor fixes in utils/fetch_languages.py 8 years ago
Adam Tauber 19cd2bf3b5 [fix] re-enable ssl verification for searchcode
closes #784
8 years ago
marc af35eee10b tests for _fetch_supported_languages in engines
and refactor method to make it testable without making requests
8 years ago
marc e0c270bd72 tests for language support in engines 8 years ago
marc f62ce21f50 [mod] fetch supported languages for several engines
utils/fetch_languages.py gets languages supported by each engine and
generates engines_languages.json with each engine's supported language.
8 years ago
marc 92c6e88ad3 small fixes 8 years ago
marc a11948c71b Add language support for more engines. 8 years ago
marc c677aee58a filter langauges 8 years ago
marc 149802c569 [enh] add supported_languages on engines and auto-generate languages.py 8 years ago
Noémi Ványi b034356825 add year filter to engines with time range support && tests
Following engines does not support "Last year":
 * Bing News
 * DeviantArt
 * DuckDuckGo
 * Yahoo
 * YouTube (noapi)
8 years ago
Noémi Ványi c59c76e6ee add year to time range to engines which support "Last year"
Engines:
 * Bing images
 * Flickr (noapi)
 * Google
 * Google Images
 * Google News
8 years ago
Thomas Renard 16c40ffefd htmlize yacy contents 8 years ago
Noémi Ványi 553f3a3a1a fix xpath of yandex 8 years ago
Adam Tauber 4676e1dcde [fix] remove result escaping from engine 8 years ago
Adam Tauber 1a82ed6f54 [mod] 500px rewrite 8 years ago
Adam Tauber f6e9c074bb [fix] vimeo engine change follow-up 8 years ago
Adam Tauber a2c94895c1 [fix] google news engine change follow-up 8 years ago
Adam Tauber c3dcebb778 [fix] remove html tags from qwant results 8 years ago
Adam Tauber e12a137a2f [fix] unicode response in deezer and spotify 8 years ago
Alexandre Flament e48f07a367 Merge branch 'master' into searchpy2 8 years ago
Adam Tauber 72a217f983 [mod] return empty string on missing osm title
Sometimes openstreetmap returns with None as title. In these cases use an empty
string instead.
8 years ago
Adam Tauber fdf63940e8 [fix] tests ++ flickr error 8 years ago
Adam Tauber 16bdc0baf4 [mod] do not escape html content in engines 8 years ago
Adam Tauber 55dc538398 [mod] move load_module function to utils 8 years ago
Adam Tauber 43ddbc60da [fix] pep8 8 years ago
Adam Tauber 16f2e346b3 [fix] bing unicode issue part III. 8 years ago
Adam Tauber 1176505fa4 [fix] bing character encoding - closes #760 8 years ago
Adam Tauber 17b08d096c [fix] unicode search expression for bing 8 years ago
Adam Tauber 16ff8d06c7 [fix] bing paging and language support
see https://msdn.microsoft.com/en-us/library/ff795620.aspx for bing
specific search operators

closes #755
8 years ago
Alexandre Flament 01e2648e93 Simplify search.py, basically updated PR #518
The timeouts in settings.yml is about the total time (not only the HTTP request but also the prepare the request and parsing the response)
It was more or less the case before since the threaded_requests function ignores the thread after the timeout even the HTTP request is ended.

New / changed stats :
* page_load_time : record the HTTP request time
* page_load_count: the number of HTTP request
* engine_time : the execution total time of an engine
* engine_time_count : the number of "engine_time" measure

The avg response times in the preferences are the engine response time (engine_load_time / engine_load_count)

To sum up :
* Search.search() filters the engines that can't process the request
* Search.search() call search_multiple_requests function
* search_multiple_requests creates one thread per engine, each thread runs the search_one_request function
* search_one_request calls the request function, make the HTTP request, calls the response function, extends the result_container
* search_multiple_requests waits for the the thread to finish (or timeout)
8 years ago
Noémi Ványi 1490d6bc93 add time range search for flickr 8 years ago
Noémi Ványi 5c02b9ef31 add time range support for bing images 8 years ago
Noémi Ványi d2e0cbafb2 add time range search for youtube noapi 8 years ago
Noémi Ványi f70d405202 add time range search for bing news 8 years ago
Noémi Ványi 53c9fde992 fix kickass torrents engine 8 years ago
Alexander Minges 3c5883408c initial commit of pdbe engine
Adds support for queries to the Protein Data Bank Europe (PDBe).
8 years ago
Pydo 55a5b686ed Merge branch 'master' of https://github.com/asciimoo/searx into feature/seedpeer-engine-integration
Resolved conflict searx/settings.yml
8 years ago
Adam Tauber 86daef2063 [fix] do not allow underscore in engine names - closes #708 8 years ago
marc d1d4ed4376 [fix] results with digbit don't truncate anymore 8 years ago
firebovine e145fdb86d #607 - noapi fix 8 years ago
Lorenzo J. Lucchini a8907224a1 Improving Wolfram Alpha search hit content
Making WA search hits contain
- the (parsed) input inside the "title" instead of just "Wolfram|Alpha", to better match other hit titles and to confirm correct parsing of input to the user
- the first output field that contains any text (skipping ones that are only pictures; this is usually the most meaningful "result" field) instead of the raw input as the "content", making it additionally possible to obtain WA computations from JSON API calls
8 years ago
marc 09ee2aa69d [fix] Result text in Wolfram|Alpha (#607) 8 years ago
Adam Tauber 3144ec1d59 [fix] unicode urls 8 years ago
potato 983415bc38 [enh] is_valid_lang moved to utils 8 years ago
potato b7d578ae80 [enh] engine header comments 8 years ago
potato 8c72a22757 [enh] api_key usage, disable the engine by default 8 years ago
potato c051e6a2c3 [fix] pep8 8 years ago
potato ab471fd13b [enh] mymemory translated engine added for multi-word translations 8 years ago
potato 22bd39fd42 [fix] only 1-word search triggers the engine 8 years ago
potato 7bf1013c15 [enh] removed missing params; [fix] pep8 8 years ago
potato bc806bfab1 [fix] no lambda anymore, cgi.escape 8 years ago
potato 5416f0f248 [enh] dictionary engine renamed to dictzone 8 years ago
potato 84ff6e289e [enh] filter non-existing language code/name containing requests 8 years ago
potato b808a2e266 [fix] don't merge with suggestions 8 years ago
potato 3f4cc2146c [enh] return results instead of answers 8 years ago
potato f4525880ed [enh] dictionary engine added 8 years ago
Pydo 2c2123b2e8 Added seepeer to config and added seepeer search parser 8 years ago
Adam Tauber 13bed1f872 Merge pull request #639 from kvch/digbt-engine
add digbt engine - fixes #638
8 years ago
David A Roberts 1e9dab08e6 [fix] behaviour for page_size>1 and first_page_num>0
eg. pageno=1,21,41,... instead of 20,40,60,...
8 years ago
Noemi Vanyi 3a1c5876b1 add digbt engine
Unfortunately, it is quite slow so it is disabled.
Furthermore, the display of number of files is wrong
on digbt.org, so it is not displayed on searx.
8 years ago
Adam Tauber 104cdb7d03 [fix] remove unused imports ++ int token timestamp by default 8 years ago
Adam Tauber eb57481450 [fix] google images paging - closes #571 8 years ago
David A Roberts 61fe5daf30 Add Microsoft Academic search engine 8 years ago
Adam Tauber 31866d9f33 Merge pull request #629 from stepshal/E305
Add missing blank lines after class or function definition.
8 years ago
stepshal 7e16c9d360 Add missing blank lines after class or function definition. 8 years ago
David A Roberts 117d36b66e Add Crossref search engine and DOAI rewrite plugin 8 years ago
marc ad58b14be7 [fix] merge infoboxes based on weight
also minor changes in attributes and images from wikidata
8 years ago
marc c2e4014287 [fix] urls merge in infobox (#593)
TODO:
    merge attributes
8 years ago
marc a0a1284998 wikidata refactor and more attributes (see issue #560) 8 years ago
a01200356 93ef11adc0 [enh] multilingual wikidata
disambiguation and tags are in local language

TOFIX:
    needs to query the api every time to know each label's name
8 years ago
Adam Tauber 350a84520d [fix] time range detection 8 years ago
Adam Tauber 7d9c898170 Merge pull request #634 from kvch/advanced-search
support time range search
8 years ago
Adam Tauber 54d987636e [fix] do not load engines which cannot be initialized - closes #585 8 years ago
Noemi Vanyi a7c8d5882c fix pep8 8 years ago
Noemi Vanyi e7baf24ec1 add time range search for deviantart 8 years ago
Noemi Vanyi e9a78f1434 add time range search for google images 8 years ago
Noemi Vanyi f13b9fa36a add time range search for duckduckgo 8 years ago
Noemi Vanyi 2e5839503f add time range search for google 8 years ago
Noemi Vanyi 93c0c49e9a add time range search with yahoo 8 years ago