Term with payloads #280

LCBH · 2023-09-12T07:39:34Z

See [#279 ]
One approach to integrate bit-level mutations (like HAVOC) to tlspuffin and DY fuzzing in general is as follows:

Have a mutation make_bitstring that:
a. randomly choose a sub-term t in a recipe in the current trace (similar to what is done in this clone,
b. evaluate it to a bitstring b,
c. associate a mutable version of b, amenable to bit-level mutations, to t in fields payload and initial_payload of t.
The idea is that, from now on, future evaluation of t will use payload instead of using the evaluation of t and bit-level mutations can mutate in-place the field payload.
Have bit-level mutations like HAVOC operating on all payloads of all sub-terms of all recipes in the trace.
When evaluating the trace and one of its recipe t:
a. evaluate it to b_t as currently done.
b. for each sub-term t' of t having a payload b (hence make_message has been used on t'), do the following:
- replace t.initial_payload that must be found in b_t by t.payload (even if the two do not have the same size)
- [Optional] if multiple t.initial_payload can be found at different locations, find the right location corresponding to t'. (One option is to evaluate the first child of t that has t' as descendant, evaluate this child as a bitstring, and find b_t in it. If there is only one matching location, then we win, otherwise, we recursively do the same on this child (instead of t).
Send the modified b_t to the suitable agent.

Pros:

compared to [Constant bitstrings in atoms (for bit-level fuzzing) -- Attempt with Codec::read_bytes #278], this approach allows to accept any kind of bit-level mutations. Approaches based on re-interpreting the mutated bit-strings in the Mapper internal structure (here rustls) rejects a lot of mutations that make this re-interpretation/parsing fail. Problem 1: We are not really interested in fuzzing the Mapper: a mutation that may be gracefully rejected by the Mapper might crash the PUT's "Mapper"/parsing routines. Problem 2: even if we used the PUT as Mapper, the context of evaluation in our fuzzer will not be the same as the one of the agent we fuzz, hence some bugs may be missed because of we reject a mutation on the fuzzer side (parsing fails) but this would still crash the PUT.
--> This approach does not suffer from those problems, any bit-level mutation will go through.
we are able to fuzz any sub-terms with this approach. On the contrary, approaches based on fuzzing rustls payloads (like this clone) are only able to fuzz certain parts of messages (such as certificates). Problem 3: structures of messages (header, length, etc.) cannot be fuzzed this way.

Cons:

quite cumbersome and hacky
tracking down which precise location of the bitstring b to replace is non-trivial (but possible)
we need to re-architecture the interface tlspuffin-puffin since no bitstring-access to the PUT agents are exposed (for sending bitstring), only message-level-access to the PUT is exposed
maybe too inefficient once we have a lot of sub-terms with payloads (?)

Question: some HAVOC sub-mutations are more expressive if we give them the concatenation of all payloads of all sub-terms (e.g., the ones that swap around some parts of the bitstrings). We may want do to that but reconstructing the new mutated payloads after having applied the mutation is non-trivial.

…value)

…el mutations)

TODOs: - correctly evaluate any sub-term in MakeMessage - modify evaluate for terms and replace found payloads_0 by payloads

…igate zoo

…ests

…ture, fully tested (in seeds.rs)

…e sup-terms of non-symbolic, fix many bugs, test to investigate mutation failures TODO: clear understanding of which mutations and replacement can fail

… Countable trait + various fixes

find_relative_node now returns `shift_ancestor_to_search:usize`: this can be non-zero when p is not a sibling but a sibling of an ancestor of to_search. It then corresponds to the position of `to_search` relatively to the evaluation of this ancestor. This can happen for example for append(f, fn_support_group_extension(to_search)) when relative node is f and fn_support_group_extension add some headers in front of to_search. shift_ancestor_to_search will be the length of this header.

However, we end up with inconsistent payload replacements. Example: fn_true -> MakeMessage -> BitFlip -> MakeMessage with payload_0 = 0 instead of 1.

…t-in argument like `HELLO_RETRY_REQUEST_RANDOM` which made replace_payload failed!

…n the HandShakeMewssage) + Fix mutation tests

… adding fn_payload_u16 + Fix heuristic 2

…window = parent.window

… encoding, thus failing payloads replacements. Fix most of them. This requires to add several new function symbols.

…or Payload* types (had to use a hacky new trait) add missing types.

….read.encode. Fix try_read_bytes that made this test fail. Add custom Codec2::read for several types. Remove the identity function fn_opaque_message and use type filtering instead (adapt seeds). Disable some safety checks to explore more messages; can be enabled with `enable-guards`. Fix some offset issue with reading PayloadU8. New Codec::read for HandshakeHash.

…utable! Now the test passes.

…ions, in which case MakeMessage only applies to root messages) + better experiment folder formatting

LCBH and others added 30 commits September 11, 2023 13:48

TermEval: term + 2 optional payloads Vec<u8> (initial value, mutated …

d1d86d5

…value)

Traces now contain TermEval, with optional payloads (allowing bit-lev…

3996312

…el mutations)

MakeMessage and BitFlip mutations

c4cf1cc

TODOs: - correctly evaluate any sub-term in MakeMessage - modify evaluate for terms and replace found payloads_0 by payloads

some corrections for passing tests

26077c2

Update certificates with 100000 day lifetime

99d2aed

Add openssl 312

529b7ed

some corrections for passing tests (all of them now succeed)

0a00d80

cleaning up and add "TODO-bitlevel" todos

82cc4d7

idea of evaluate sub-term by downcasting

7bfa5fd

two types of evaluation

97ed6d0

correct evaluation now, TODO: any_get_encoding

47a73bf

any_get_encoding is implemented for TLS, TODO: macro instead + invest…

e00f229

…igate zoo

Add cross-platform RNG

29d7a54

Set lib dir

c20f4cc

Fix crash if non-git directory

1e62dbf

Add leak test

b7be0a5

Update certificates with 100000 day lifetime

f147e3e

macro for any_get_encoding + more useful Fn error messages + better t…

50852b8

…ests

Merge remote-tracking branch 'origin/certs' into termWithPayloads

5974259

evaluate and lazy_evaluate are fully tested now and work

9659538

new input.input() implem, trace can be executed with the new architec…

b37b56f

…ture, fully tested (in seeds.rs)

evaluate now replace bitstrings from payloads as expected (V1 for now)

185f19b

mutations: new organization for bit-level mutations

0848ef9

bit_mutations.rs: we now have almost full HAVOC

a1d5223

rustfmt pass

29b8dd5

fix error in puffin tests

c58d3c0

integration tests for bit-level mutations

aca4f76

refine choose_term: exclude subterms of non-symbolic, possibly exclud…

729c511

…e sup-terms of non-symbolic, fix many bugs, test to investigate mutation failures TODO: clear understanding of which mutations and replacement can fail

refine opaque filtering for MakeMessage + refine encoding implem with…

3a7c526

… Countable trait + various fixes

clewan up MakeMessage::mutate

5a6f2ad

LCBH added 10 commits April 11, 2024 13:18

Fix MakeMessage: never chooses a non-symbolic term to mutate

5e2eb00

However, we end up with inconsistent payload replacements. Example: fn_true -> MakeMessage -> BitFlip -> MakeMessage with payload_0 = 0 instead of 1.

Fix silly bug in Heuritstic 2

162c3dd

Make sure fn_hello_retry_request takes all its argument, no more buil…

3ec242a

…t-in argument like `HELLO_RETRY_REQUEST_RANDOM` which made replace_payload failed!

fix HEAD~1

6a00889

Add a CLI option to always Skip bit-level mutations

ca88869

Fix test

143aba8

Fix HelloRetryRequest reading (version and random were already read i…

09d072d

…n the HandShakeMewssage) + Fix mutation tests

Fix fn_certificate_verify by removing built-in PayloadU16 encoding so…

84cafcf

… adding fn_payload_u16 + Fix heuristic 2

type alias for mutations

8538c63

LCBH mentioned this pull request Apr 15, 2024

Generic uni tests for fuzzer reachability #312

Open

LCBH added 18 commits April 17, 2024 17:11

fn_key_share_deterministic_extension is opaque

0c2cf76

find_unique_match_rec: Relax heuristic 2 and try to first make it so …

2f1c66d

…window = parent.window

try_read_bytes: no longer owns bitstring

89ab121

many fn_* symbol function were not "atomic" and had Payload* built-in…

87a90b2

… encoding, thus failing payloads replacements. Fix most of them. This requires to add several new function symbols.

Fix any_get_encoding that made test_term_eval fails: add new Encode f…

b5c2923

…or Payload* types (had to use a hacky new trait) add missing types.

Fix seed_cve_2022_39173_minimized

e212e95

oupsi

65d0ff6

disable test_invalid_length_errors when enable-guards is not enabled

edf3182

re-enable a safety check as it is used to detect when to stop deframing

51e230f

disable more invalid tests when enable-guards is disabled

e1d221d

test_term_payloads_eval: do not try to add payload if already no exec…

60fda9b

…utable! Now the test passes.

disable sanity check post payload_replacement in release mode

6c95b3a

cleaning up the PR

50f7d82

add option -wo-dy to disable DY mutations (but keep bit-level mutat…

157e5e1

…ions, in which case MakeMessage only applies to root messages) + better experiment folder formatting

fix tests

58ace6b

fix tests

75d7068

fix tests

a4902db

maxammann self-requested a review May 3, 2024 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Term with payloads #280

Term with payloads #280

LCBH commented Sep 12, 2023

Term with payloads #280

Are you sure you want to change the base?

Term with payloads #280

Conversation

LCBH commented Sep 12, 2023