Commit Graph

686 Commits (fdc63b862e2979814c82e1226012d539bc25cab8)
 

Author SHA1 Message Date
Ben Busby fdc63b862e
Autoload `whoogle.env` if it exists
The whoogle.env file previously needed to be created and enabled using
the WHOOGLE_DOTENV var. This removes the second step and loads the env
file if it's found during app init.

The Dockerfile has also been updated to copy in whoogle.env if it
exists.

Fixes #909
1 year ago
Ben Busby aa54491ae0
Log rate-limiting errors
Rate limiting is now reported to the console as an error message.

Fixes #914
1 year ago
Charles Zawacki cec10e81d3
Don't prepend to services that have schemes with '//' (#925) 1 year ago
Ben Busby 2827a4ef47
Update whoogle.lunar.icu instance details [skip ci] 1 year ago
Charles Zawacki a760476d1b
Omit 'mobile.' and 'm.' in site alt replacements (#922)
Resolves #921
1 year ago
Dominick Peluso 4f77f3680d
Add whoogle.ungovernable.men instance [skip ci] (#919) 1 year ago
fiestasiesta 253ea62f8f
[Mobile] Add line break between header options (#918)
Fixes an issue where "Time Period" shows up on a separate
line from its dropdown
1 year ago
Ben Busby c24caceb03
Omit "www." in site alt replacements
Fixes #913
1 year ago
Ben Busby 4f85076a2b
Restrict Tor outbound ports w/ `WHOOGLE_TOR_FF`
Setting `WHOOGLE_TOR_FF` will attempt to modify the existing torrc file
with the "FascistFirewall 1" config, which restricts outbound ports to
reachable values.

Fixes #907
1 year ago
Ben Busby 76c78d8584
Add public instance [skip ci]
https://whoogle.lunar.icu
1 year ago
Ahmad Alkadri 3dda8b25ef
Escape html text in result body (#912)
Moved the cleaner functions to app/utils/escaper.py

Removed unused import 're'

Moved the cleaner functionalities to the "search.py" and "routes.py"

Making sure escaped chars stay escaped during process

Replaced "<" and ">" with "andlt;" and "andgt;", respectively. This way,
when the 'response' object get loaded to bsoup (which happens several times
throughout the process between search.py and routes.py), bsoup will not
unescape them.
1 year ago
MoistCat 08aa1ab8f1
Handle missing result div in filter (#911)
Changed "find_all()[0]" for find; which yields only one result.

Added check to ensure result_div exists before searching
for results.
1 year ago
fiestasiesta 7041b43db9
Add time constraint to search options (#888)
Introduces the ability to refine searches by time period:
- Past hour
- Past 24 hours
- Past week
- Past month
- Past year

Co-authored-by: Ben Busby <contact@benbusby.com>
1 year ago
Ben Busby 424e6dd341
Add script for updating translations
Still a manual process, but better than updating the json line by line.
I'm undecided on automating this anytime the translation json gets
updated, since it could possibly ratelimit lingva instances.
1 year ago
Ben Busby c9c197bb5f
Bump version to 0.8.1 1 year ago
Ben Busby 7a852aa876
Allow HTTP-exclusive proxies for all requests
Proxies that only support HTTP were causing request timeouts due to an
invalid upgrade to HTTPS when creating the request. This update restores
the ability to have an HTTP-only proxy for all requests.

Fixes #906
1 year ago
Cx 8fbbdf2cec
Update Kurdish translation (#903) 1 year ago
Ben Busby 3dc6d14377
Only extract domain+ext when using site alts
Parent sites using a 'www' subdomain or something similar were not
redirecting properly. This updates the hostname check to only validate
against the primary domain, except for Wikipedia since the subdomain is
used for interface translation in that case.

Fixes #901
1 year ago
dependabot[bot] cd7fce2822
Bump certifi from 2020.4.5.1 to 2022.12.7 (#902)
Bumps [certifi](https://github.com/certifi/python-certifi) from 2020.4.5.1 to 2022.12.7.
- [Release notes](https://github.com/certifi/python-certifi/releases)
- [Commits](https://github.com/certifi/python-certifi/compare/2020.04.05.1...2022.12.07)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
1 year ago
Ben Busby fd85f1573a
Refactor site alt link replacement
Replacing result links and text when site alts are enabled is now part
of its own function, and handles replacement of link location and link
description separately.

Fixes #880
1 year ago
Ben Busby 0310f0f542
Use app init enc key by default for all queries
This can be updated later to allow users with cookies enabled to use a
key that is unique to their session (if they want, not mandatory), but
for now it makes more sense to just use a single key for all queries
from all users. This should eliminate a lot of issues that users have
reported where they are unable to decrypt queries or page elements due
to an expired/renewed session key.
1 year ago
PrivacyDevel 1226b8db9c
Update privacydev.net details, add onion instance [skip ci] (#899) 1 year ago
Raistlingru cde05ea55d
Add new public instance to readme [skip ci] (#897)
Add https://whoogle.hostux.net

Co-authored-by: Ben Busby <contact@benbusby.com>
1 year ago
Ben Busby 3bd785b9b7
Update sponsored result filter for german results
Adds 'gesponsert' to ad keyword blacklist

Fixes #892
1 year ago
Ben Busby 33742ce247
Revert change to light theme contrast text color
The change made to whoogle-contrast-text in #873 wasn't the right
decision, since whoogle-contrast-text is meant to contrast with darker
UI elements. whoogle-text already contrasts with the default white
background.
1 year ago
Anna 08b16f5a0c
Switch to PEP517 standard for builds (#887)
* Sync setup.cfg with requirements.txt

* Include tests in PyPI tarballs

And exclude them from setuptools

* Set version number only once

Switch to PEP517 standard (pyproject.toml) for builds
1 year ago
Ben Busby d099b46336
Bump version to 0.8.0 2 years ago
Ben Busby 09a90ec46a
Match only "//medium" and ".medium.com" for scribe links
Closes #885
2 years ago
Xabi 6bd48e40a7
Include new ad filter keyword (#879)
Adds "sponsored" result keyword for Spanish language
2 years ago
curlpipe 2d23e0e952
Add Welsh translation (#876) 2 years ago
xatier 1a66b195d4
Update zh-tw translation (#875) 2 years ago
Ben Busby a7fe1fd0df
Add public instance [skip ci]
https://www.indexia.gq

Closes #868
2 years ago
Ben Busby abbf037115
Remove dead public instance [skip ci]
whooglesearch.ml has apparently been broken for several weeks now, so
it's been removed from the readme and the instances list.

Closes #874
2 years ago
Ben Busby 06fd29f663
Update ad filter keywords
New changes to google search now include ads prefixed with the keyword
"sponsored". This update should remove these from appearing in search
results.

Fixes #871
2 years ago
Ben Busby 7494a14bc2
Remove `py` dependency, update `pytest` version
See https://github.com/advisories/GHSA-w596-4wvx-j9j6
2 years ago
Ben Busby 6696f2b12b
Escape word in term-bolding regex
Fixes #869
2 years ago
João 77884d05f2
Fix color for light contrast text (#873)
The color for the variable whoogle-contrast-text should be black or gray;
otherwise it will not be shown with white background.
2 years ago
João 3e39e0e041
Fix missing args in docstring [skip ci] (#872)
Update docstring with new arg
2 years ago
João 2a37619028
Replace error query params w/ preferences param (#867) 2 years ago
Abir10101 75682de892
Fix regex for bolding search terms (#865)
Updated regex to not remove chinese letters in bolding regex
2 years ago
João e99db8db26
Add country and interface lang to autocomplete (#866) 2 years ago
HPD47 6ca51ecdcb
Update readme (heroku + dead instance) (#864)
Heroku is no longer free, readme updated to reflect this change.

Removed whoogle.lunar.icu instance.

Co-authored-by: files18 <107342491+files18@users.noreply.github.com>
2 years ago
Shrirang Kahale a3fa999b0d
Removal of search.wef.lol [skip ci] (#860)
wef.lol redirects to a sketchy website and has been removed
2 years ago
Ben Busby 70df88b825
Add sourcehut to readme [skip ci] 2 years ago
Arya K 4d7254e74d
Add ~vern instances (clearnet, onion, i2p) [skip ci] (#858)
Co-authored-by: Ben Busby <contact@benbusby.com>
2 years ago
watchakorn-18k 4b2b0bf3c9
Include thai keyword in ads blacklist (#857) 2 years ago
watchakorn-18k 3943b2bc2c
Add thai translations (#856) 2 years ago
João 219fc58401
Fix handling of bangs (#851)
Changed the implementation to work if the bang is at anyplace in the query.

Added a check to not spend time looking for an operator if a "!" is not present
in the query.

No longer allowed to have the bang at the "!" char at the end, since this may
cause some conflicts like the issue cited before, where the ! is after a word
in the query, which is natural in most languages.
2 years ago
João 74503d542e
Encode config params in URL (#842)
Adds support for encoding (and optionally encrypting) user config values as
a single string that can be passed to any endpoint with the "preferences" url
param.

Co-authored-by: Ben Busby <contact@benbusby.com>
2 years ago
Biên 11275a7796
Add filter for ads in Vietnamese (#847) 2 years ago