2
0
mirror of https://github.com/carlostrub/sisyphus synced 2024-10-31 09:20:15 +00:00
Go to file
2018-05-22 23:02:39 +02:00
docs add slides 2018-02-28 23:43:19 +01:00
sisyphus Add dry run mode that does not move mails after classification (fixes #8) 2018-02-03 22:08:12 +01:00
test/Maildir Only accept large unicode characters as individual words 2017-09-16 22:07:36 +02:00
.coveralls.yml add coveralls 2017-03-14 21:11:39 +00:00
.gitignore gitignore 2017-09-16 21:49:52 +02:00
.travis.yml fix travis build 2018-04-06 22:06:09 -03:00
appveyor.yaml add line above appveyor.yaml 2017-03-08 07:54:51 +00:00
CHANGELOG.md changelog for dep change 2018-02-07 23:08:29 +01:00
classify_test.go if word list is too long, only take a random subsample (fixes #11) 2018-04-05 22:30:18 -03:00
classify.go please gometalinter 2018-04-22 21:36:57 +02:00
CODE_OF_CONDUCT.md Create CODE_OF_CONDUCT.md 2017-09-21 23:26:11 +02:00
CONTRIBUTING.md Create CONTRIBUTING.md 2017-09-21 23:30:13 +02:00
database_test.go make gometalinter happier 2017-05-13 22:34:54 +00:00
database.go Use path/filepath for cleaner and safer path generation 2018-01-11 21:35:27 +01:00
Gopkg.lock Update dependencies 2018-05-22 23:02:39 +02:00
Gopkg.toml replace glide by dep (fixes #10) 2018-02-07 22:17:04 +01:00
info.go move info into the library 2018-01-18 20:50:16 +01:00
learn_test.go fix tests 2017-06-05 14:13:27 +00:00
learn.go please gometalinter 2017-09-17 19:17:43 +02:00
LICENSE Welcome 2018 2018-01-11 21:14:13 +01:00
mail_test.go improve memory footprint 2017-09-17 00:56:17 +02:00
mail.go if word list is too long, only take a random subsample (fixes #11) 2018-04-05 22:30:18 -03:00
Makefile replace glide by dep (fixes #10) 2018-02-07 22:17:04 +01:00
mkdocs.yaml update initial docs 2017-03-08 08:23:38 +00:00
README.md Add dry run mode that does not move mails after classification (fixes #8) 2018-02-03 22:08:12 +01:00
sisyphus_suite_test.go separate out command and package 2017-04-15 20:23:26 +00:00

Sisyphus: Intelligent Junk Mail Handler

As we all know too well, many mails we receive are undesired for various reasons. Sometimes, we just do not want to be part of a scam, sometimes we really prefer no to have this latest joke mail sent by our beloved friends -- even though we are happy to exchange serious messages with them.

Sisyphus is a junk mail handler of the latest generation. It has the following features:

  • requires zero configuration, neither on the server nor on the client
  • works with any MTA and any client
  • learns about your preferences based on all messages in your inbox and your junk folder
  • can handle multiple mail accounts with independant junk mail preferences
  • requires minimal resources, e.g. learning over 50000 mails and keeping track of roughly 90000 words requires only 10MB of storage

Build Status Go Report Card GoDoc Documentation Codebeat Coverage

How it works

Sisyphus analyzes each mail in the inbox and the junk folder and uses its subject and text to improve the learning algorithm. Whenever a new mail arrives in the Maildir/new directory, Sisyphus classifies this mail based on its content. Junk mails are then moved automatically to the Maildir/.Junk directory, while good mails are left untouched. See the following blog post on a rather non-technical explanation.

Technically, Sisyphus applies a classic Bayesian Update algorithm to classify mails. However, in contrast to many traditional junk mail filters, classification is based on all mails ever received. This includes mails that are classified by the user as junk by moving them manually into the junk folder, or mails that have been correctly classified by Sisyphus previously. This is only possible with limited resources by applying the HyperLogLog algorithm to store the learned mails.

The learned information is stored in a local database called sisyphus.db which is located in each Maildir directory.

Install

Sisyphus can be installed by downloading the released binary package.

To build from source, you can

  1. Clone this repository into $GOPATH/src/github.com/carlostrub/sisyphus and change directory into it
  2. Run make build

This will leave you with ./sisyphus in the sisyphus directory, which you can put in your $PATH. (You can also take a look at make install to install for you.)

Usage

First, set the environment variable necessary for operation:

$ setenv SISYPHUS_DIRS PATHTOMAILDIR

or

$ export SISYPHUS_DIRS=PATHTOMAILDIR

or for Windows

$ set SISYPHUS_DIRS=PATHTOMAILDIR

For all other configuration options, please consult the help. It can be started by running

$ sisyphus help

To start sisyphus, do

$ sisyphus run

To display various statistics, do

$ sisyphus stats

(caveat: run at least one learning cycle)

See the help for more details.

License

Sisyphus is licensed under the 3-Clause BSD license. See the LICENSE file for detailed information.