Commit Graph

1055 Commits

Author SHA1 Message Date
Soner Tari
dff3f90c62 Add DEBUG_PROXY around debug func params 2021-10-08 01:17:50 +03:00
Soner Tari
f056f699c1 Add port option to all site specs, fix precedences in filtering rules
Now, all of the 'to' site fields in filtering rules can specify a port,
not just the dstip sites.

Fix the precedence of sites in the same type of rules. For example, if
we find a match with an sni site, we should not stop searching for a
match in cn, because a matching cn site may have a higher precedence
than the matching sni site. We should apply the action of the cn site,
although sni rules have precedence over cn. The same applies to http
host and uri rules too.

Fix the precedence of dstip rules.

Improve and update unit and e2e tests accordingly.
2021-10-07 22:22:23 +03:00
Soner Tari
114f01fa19 Fix missing all desc rules without user spec
If no user specified in an all desc (desc *) rule, we should set
all_user, otherwise the rule cannot be translated to data structs, they
go missing.
2021-10-07 12:22:36 +03:00
Soner Tari
7ed1396366 Add do {} while(0) around macros for semicolon termination 2021-10-07 01:20:03 +03:00
Soner Tari
017f0f8631 Improve and clean up code and documentation 2021-10-06 19:41:00 +03:00
Soner Tari
4602d0109b Update third-party licenses and documentation
aho_corasick_template*.h library is licensed under GPLv3. The developer
has been contacted for a license change to the LGPL.
2021-10-06 12:56:27 +03:00
Soner Tari
9d2e523cd0 Use Aho-Corasick machines for substring matching
Now, the filter uses B-trees for exact string matching and Aho-Corasick
machines for substring matching. B-trees and AC machines are exported to
linked lists for debug logging only.

Also,
- Separate all_sites and all_ports filters from substring filters. They
are not related with substring filters actually, and ACM keywords cannot
be empty strings anyway. So now they should be handled separately too.
- Improve debug logging of filtering rules.
- Update unit tests accordingly, and improve.
- Fix pxyconn_filter(), keep searching for a match in substring filters
if exact match does not have a matching site rule.
- Increase common names max len and tokens. weather.gov has 73 tokens.
- Rename keyword to desc.
- Update documentation.
- Clean up.
2021-10-05 23:00:17 +03:00
Soner Tari
97117d4e50 Fix and update documentation
We use B-tree not BST
2021-10-04 02:18:44 +03:00
Soner Tari
96ba8557d6 Add unit tests for substring 'from' fields in filtering rules
Also, improve code
2021-10-03 23:50:41 +03:00
Soner Tari
640558863c Add NLORNONE macro 2021-10-03 21:37:58 +03:00
Soner Tari
f6e6b25221 Never pass NULL as rule param to filter_set() 2021-10-03 20:54:29 +03:00
Soner Tari
477bb239a0 Do not tokenize ssl_names if there is no rule to match exact common names 2021-10-03 15:18:36 +03:00
Soner Tari
9959bb48e9 Fix argc inc in cn token loop 2021-10-03 15:06:48 +03:00
Soner Tari
765c0dac05 Fix site precedence handling, and use all_sites and all_ports
Actually, no need to check all_sites or all_ports, because strstr(3) on
OpenBSD reads that "If little is an empty string, big is returned", and
if all_sites or all_ports is set, site or port (little/needle) is empty.
But using all_sites and all_ports should improve performance by avoiding
the strstr() call.
2021-10-03 14:35:02 +03:00
Soner Tari
e654ca4e2c Fix memory leaks in filter
Add attributes
Update documentation
2021-10-03 13:08:27 +03:00
Soner Tari
f44f12456c Fix unit tests with WITHOUT_USERAUTH
And update documentation
2021-10-03 00:56:45 +03:00
Soner Tari
2ff0f728e5 Use template macros for code reuse, and append to linked lists
Add to the end of linked lists for correct list ordering, but btrees
cannot obey this ordering.
Also, update the unit tests accordingly.
And fix compile with WITHOUT_USERAUTH.
2021-10-02 23:13:56 +03:00
Soner Tari
4f36a21c78 Use kbtree BST for exact match in site and port 'to' fields
So, for 'to' fields too, we use two separate data structures: binary
search trees (BST) for exact match and linked lists for substring match.

Now all 'from' and 'to' fields in filtering rules use these two data
structures.

To repeat, filtering rules should be written with exact matches instead
of substring matches, as much as possible. Because BST search must be
much faster than substring search over linked lists.

To repeat, we have modifed kbtree to support complex data structures in
from fields.

Also, update the unit tests accordingly.
2021-10-02 21:21:24 +03:00
Soner Tari
15991dfb93 Use kbtree BST for exact match in user, keyword, and ip 'from' fields
So, now we use two separate data structures: binary search trees (BST)
for exact match and linked lists for substring match.

Currently, only user, keyword, and ip 'from' fields in filtering rules
use these two data structures. This also means that now we support exact
and substring matches in 'from' fields.

Filtering rules should be written with exact matches instead of
substring matches, as much as possible. Because BST search must be much
faster than substring search over linked lists.

We have modifed kbtree to support complex data structures in from
fields.
2021-10-02 12:20:28 +03:00
Soner Tari
95dd3bb9f8 Update khash, fixes typos 2021-09-29 12:02:27 +03:00
Soner Tari
ec816e7db6 Fix documentation of user desc keyword exact search 2021-09-29 12:01:18 +03:00
Soner Tari
2b4cbd27fd Increase max tokens for filter rules to 17
+= port + serverport

And improve documentation
2021-09-27 19:31:53 +03:00
Soner Tari
e844f30886 Convert warning and info logs in filtering rules to fine debug logs 2021-09-27 17:23:09 +03:00
Soner Tari
66f7a88374 Move DivertUsers/PassUsers options to filter.c
The DivertUsers and PassUsers options will be deprecated in favor of
filtering rules in the future.
2021-09-27 16:03:32 +03:00
Soner Tari
21fed37a92 Rename tmp_global_opts to global_tmp_opts 2021-09-27 14:35:16 +03:00
Soner Tari
fd6c852355 Move filtering rules to filter.c/h
Also, fix certain and possible memory leaks in debug printing
And improve code
2021-09-27 14:17:25 +03:00
Soner Tari
14f68457fb Fix the ordering of sites, ports, and macro values in filtering rules
all_sites and all_ports rules should be at the end of their lists, they
should be searched last, because they are the least specific rules in
their lists, hence have lower precedences.

Also, obey the order of rules in conf files by adding sites, ports, and
macro values to their lists in the same order they are in conf files.

Update the unit and e2e tests accordingly, and improve.
2021-09-26 21:11:48 +03:00
Soner Tari
c8f09d162a Add port field to Dst Host filter rules, and refactor for code reuse
Now the target IP address filters can use port specs too.
Refactor for code reuse, create filter_action struct used by rules,
sites, and ports.
Also, improve code and documentation.
2021-09-26 13:50:14 +03:00
Soner Tari
dc34bc1ccf Add e2e tests for filtering rules
End-to-end tests now require testproxy v0.0.4, which supports the new
Reconnect command for the Pass filtering rule.

Split mode with the -n option also supports filtering rules, so the
Divert rule can enable the divert mode even with the -n option. This is
because the purpose of the -n option is to convert sslproxy into an
sslsplit, and we want to support filtering rules in sslsplit-like
sslproxy too.
2021-09-25 11:40:44 +03:00
Soner Tari
39e1d87783 Fix deferred pass filter action in SSL filter
Do not call protopassthrough_engage() twice. If we call it to apply the
deferred pass action and also set the ctx->pass flag, it will be called
again in protossl_setup_src_ssl(). So, just set the flag and leave the
rest to protossl_setup_src_ssl().
2021-09-24 11:31:31 +03:00
Soner Tari
db9974617f Improve documentation
The Include and Define options, the ordering in config options, etc.
2021-09-23 22:03:32 +03:00
Soner Tari
712733b329 Reuse debug print of filter rule
Actually, the PassSite option will be deprecated, but reuse for
correctness.
2021-09-23 20:08:35 +03:00
Soner Tari
17630340d2 Add unit tests for filtering rules
for both the parser and the translator

TODO: Add reject tests
2021-09-23 18:23:50 +03:00
Soner Tari
e4d27bee99 Return 2 to indicate the end of structured proxyspec, not 1
Return value of 1 is for ending macro expansion. Otherwise, it coincided
with closing brace retval, which was wrongly breaking out of the while
loop in load_proxyspec_struct().

TODO: Use enums for return values.
2021-09-23 01:06:53 +03:00
Soner Tari
41dccaff14 Increase max tokens for filter rules to 15
+= master + cert
2021-09-22 23:01:38 +03:00
Soner Tari
f5bb67f581 Error out if struct proxyspec is not complete
The default proto is tcp, so we allow the Proto option to be omitted.
2021-09-22 21:11:06 +03:00
Soner Tari
c7e43f359d Do not allow empty option values except for the closing brace in struct proxyspecs
It is better to error out earlier than to pass the option down to be
rejected by its handler (the handler may fail to reject, as was the case
with a couple of options).
2021-09-22 21:09:28 +03:00
Soner Tari
abb301c418 Fail if struct ProxySpec has no arguments or closing brace 2021-09-22 16:42:38 +03:00
Soner Tari
d877b9a635 Fix and improve get_name_value() and unit tests
Fix possible segfault if name has leading white space
Pass the name param to get_name_value() as char *, so it cannot be
modified ever

Improve unit tests for get_name_value and proxyspec_parse
2021-09-22 15:29:09 +03:00
Soner Tari
ab7674b652 Do not allow Include option within include files, no recursive include files
Prevent infinitely recursive include files
2021-09-22 01:50:48 +03:00
Soner Tari
01ff382267 Do not allow macro within macro, no recursive macro definitions
Prevent infinite macro expansion
2021-09-22 01:23:00 +03:00
Soner Tari
206734e4b6 Return -1 on error from all configuration functions, never exit()
And use all those return values.

Since we support include files now, we should be able to report in which
include file the error has occured. This is not possible if functions
just bail out calling exit(), because the user has to scroll back stderr
lines to find which include file has failed loading (a line starting
with 'Conf: ').

Plus, calling exit() on errors reduces unit testability of functions.

Also, handle all possible out of memory conditions in opts.c.
2021-09-22 00:51:36 +03:00
Soner Tari
c38c065923 Add Include option for loading configuration from an include file 2021-09-21 19:20:43 +03:00
Soner Tari
0f5ed122fb Add Define option for macro definitions and macro expansion to filtering rules
The new Define option can be used for defining macros to be used in
filtering rules. Macro names must begin with a '$' char. Macro values
must be separated with spaces.

Macros are expanded by rewriting the rule with the values of macro.

PassSite rules do not support macros (the PassSite option will be
deprecated in favor of filtering rules in the future).
2021-09-20 00:58:00 +03:00
Soner Tari
446cc29f5f Don't change log action if not specified, and obey the order of filtering rules
Filtering rules can enable/disable or don't change logging. If a rule
does not mention a log action, its logging should not change. So, binary
log action fields were not enough to represent those 3 possibilities,
hence we have increased the size of those fields to 2-bits.

We should obey the order of rules as they are written in the conf file,
because latter rules should be able to override the log actions of
earlier rules. So, keep the order.
2021-09-19 16:24:35 +03:00
Soner Tari
ac3607a841 Add deferred pass and block actions
We should defer pass and/or block actions as long as possible, because a
higher precedence rule in SSL filter should be able to override (cancel)
deferred pass and block actions taken by a lower precedence rule in Dst
Host filter. And in HTTP filter the same applies to deferred block
actions taken by Dst Host and SSL filters.

Also, thanks to this new deferred actions, now HTTP filter can keep
enabled divert and split modes. In other words, a higher precedence HTTP
filter rule can cancel a deferred block action set by a lower precedence
rule earlier, which was not possible before without deferred actions and
rule precedence.

And other improvements.
2021-09-19 01:35:48 +03:00
Soner Tari
11884271fd Add negation prefix ! to log actions
Now filtering rules can disable log actions too. This is possible thanks
to the newly added precedence field of rules. Log actions of filtering
rules at higher precedence can modify logging now. In other words, more
specific rules can change the log actions of more general rules.
HTTP filtering rules can only disable logging.
2021-09-18 18:18:30 +03:00
Soner Tari
cc7bd4a332 Add precedence to filtering rules
Now we assign precedence to each filtering rule. More specific rules
have higher precedence. So, filtering rules at lower precedence cannot
override the actions applied to a conn by filtering rules at higher
precedence.
The other precedence rules still apply.
2021-09-18 16:58:42 +03:00
Soner Tari
0c9fe83bb5 Update unit tests with master and cert log actions 2021-09-18 01:01:47 +03:00
Soner Tari
8ec97d779f Do not take log actions in HTTP filtering rules
Log actions specified in HTTP filter rules can never enable disabled
logging, because their loggers would not be initialized.

Perhaps we should initialize them in the log submit function, if they
are initialized yet.
2021-09-18 00:43:18 +03:00