Support using comments to select parts to encrypt #974

mitar · 2021-12-19T23:43:41Z

This PR adds support to annotate comments with a string (e.g., sops:enc) which can then be matched with a regex. If it matches, the corresponding value (the one which follows the comment) is encrypted while other values are not. (There is also the opposite regex available, to select those values which should not be encrypted.)

This enables the YAML file to have the same structure encrypted and decrypted, without having to add suffixes or manage complex regexes to match keys. See #543 for more discussion.

cmd/sops/main.go

config/config.go

config/config_test.go

mitar · 2022-03-04T10:22:56Z

Rebased this as well.

mitar · 2022-09-30T11:46:26Z

Rebased to latest develop.

mitar · 2023-07-17T17:21:51Z

I rebased to latest main.

Gui13 · 2023-08-16T18:55:36Z

Hey guys, we are interested in the merging of this PR, it solves a long-standing issue of the --encrypted-suffix which requires us to perform sed -i 's/encrypted_suffix//g' on all our files after sops decryption.

The linked issue, #543, is 4 years old, and this change elegantly solves the issue.

Is there something I can do to help on this?

hiddeco · 2023-08-16T20:22:44Z

This did not make the cut for v3.8.0, as this version will contain many changes already (especially in the area of rewriting all key source implementations to their latest API clients). Making it risky to add much more on top of it. It is however scheduled to be looked at for v3.9.0.

sops.go

mitar · 2023-09-22T10:04:09Z

This PR now depends on #1300.

mitar · 2023-09-22T19:18:18Z

@felixfontein: I addressed all review comments. I moved unrelated changes to a separate commit which is also part of #1300, so once that is merged it will not be in this PR anymore.

I also rebased to the current main branch.

mitar · 2023-09-25T09:56:53Z

I rebased after #1300 was merged.

Signed-off-by: Mitar <[email protected]>

felixfontein

I noticed a problem: when I store this as x.yml

# x
x: x

and run

sops --encrypt --unencrypted-comment-regex ENC x.yml | sops --decrypt --input-type yaml --output-type yaml /dev/stdin

I get

MAC mismatch. File has A4ABD4448C49562D828115D13A1FCCEA927F52B4D5459297F8B43E42DA89238BC13626E43DCB38DDB082488927EC904FB42057443983E88585179D50551AFE62, computed 9D5B4CB3A99652A79C6149B4C1B9F17CE15E5BBB5E13D2054610BDE0C7BE57B58CAEE9B503F432AF0F27F857D26D549494D2D86CF3E68F3DA78554089C86CD8F

This is caused because ENC matches the encrypted comment.

I'm not really sure what to do here. Potentially shouldBeEncrypted should know whether it is decrypting, and if it is, and the regex for the comment matches, it should first check whether the value looks like an encrypted string (https://github.com/getsops/sops/blob/main/aes/cipher.go#L186) before accepting the regex match. And at the same time, when it encounters such comments (that match the regex) during encryption, it should reject the file. Does that sound reasonable? Do you have another idea?

felixfontein · 2023-09-25T20:38:32Z

README.rst

+
+Conversely, you can opt in to only left certain keys without encrypting by using the
+``--unencrypted-comment-regex`` option, which will leave the values and comments
+unencrypted when they have a preeceding comment that matches the supplied regular expression.


The comment can also be on the same line, like foo: bar # ENC. (Which is the case because that comment is moved before x: y.)

mitar · 2023-09-25T22:13:18Z

Thank you for the review.

This is caused because ENC matches the encrypted comment.

Nice catch. I didn't really expect for people to use such simple regexes. Personally I use sops:enc which does not seem to have the issue you are describing (it cannot appear in encoded string).

Maybe we should just document this issue and suggest a regex (e.g., sops:enc)? So documentation could be something like: "do not pick a regex which can match encrypted values" (with link to the format of encrypted values)?

But it is not really nice for the security tool like this one to be able to be misconfigured.

Potentially shouldBeEncrypted should know whether it is decrypting, and if it is, and the regex for the comment matches, it should first check whether the value looks like an encrypted string.

I think we should do a check like:

Try to match the comment with encryptedValueRegexp (matching the structure of encrypted values) and with given regexp.
If only one of them match, good, we know what to do.
If both match, then we transform the comment by removing content matched by the encryptedValueRegexp and try to match given regexp again.
If it still matches, good, we know what to do.
If it does not match, we abort and complain that the regexp is too broad.

I think this is slightly better in sense that it informs the user about misconfiguration and that they should fix the regexp.

And at the same time, when it encounters such comments (that match the regex) during encryption, it should reject the file.

So during encryption, every time we encrypt any comment, we try to match the result with the given regexp, and if it does match the regexp, we abort and complain that the regexp is too broad. If I understand it correctly, I think this is fine.

So we should just make sure to help the user pick a good regexp by detecting too broad (or should we way "too simple"?) regexp and complain in such case.

mitar · 2023-10-11T16:17:18Z

@felixfontein Do you agree with my proposal above?

felixfontein · 2023-11-05T14:19:01Z

Hmm, reading all this again after some time mainly shows me that this is something we have to be very, very careful with.

One other solution that came to my mind when looking at this again: during encryption, once a comment is encrypted, check whether the encrypted comment matches UnencryptedCommentRegex or EncryptedCommentRegex (when provided). If any of them matches, reject the whole file.

mitar · 2023-11-07T22:20:40Z

One other solution that came to my mind when looking at this again: during encryption, once a comment is encrypted, check whether the encrypted comment matches UnencryptedCommentRegex or EncryptedCommentRegex (when provided). If any of them matches, reject the whole file.

Hm, isn't this the same as we have been discussing already? I wrote:

So during encryption, every time we encrypt any comment, we try to match the result with the given regexp, and if it does match the regexp, we abort and complain that the regexp is too broad. If I understand it correctly, I think this is fine.

So I think this is great. We seems to be in agreement what to do when encrypting. Just check if the result matches the regexp and abort if it does.

But I think the question is what to do when decrypting. So I can imagine a scenario where you encrypted only few lines with a comment like sops:enc. Great. But then in the encrypted file you decide to change sops:enc to enc comment. And then you try to decrypt that with an updated CLI argument. What should happen? This is why I proposed:

I think we should do a check like:

Try to match the comment with encryptedValueRegexp (matching the structure of encrypted values) and with given regexp.

If only one of them match, good, we know what to do.

If both match, then we transform the comment by removing content matched by the encryptedValueRegexp and try to match given regexp again.

If it still matches, good, we know what to do.

If it does not match, we abort and complain that the regexp is too broad.

I think this is slightly better in sense that it informs the user about misconfiguration and that they should fix the regexp.

Maybe a simpler way would be to store (and MAC) used regexp to encrypt into config section of the file. And when decrypting only use the regexp from the config section. So if you want to change the regexp/comment, you have to decrypt and re-encrypt? In general it might be nice user experience to store all those CLI config switches into config section so that you do not have to specify them again when decrypting (and match them exactly, and know what was used when encrypting).

What do you think?

felixfontein · 2023-12-19T13:31:53Z

One other solution that came to my mind when looking at this again: during encryption, once a comment is encrypted, check whether the encrypted comment matches UnencryptedCommentRegex or EncryptedCommentRegex (when provided). If any of them matches, reject the whole file.

Hm, isn't this the same as we have been discussing already? I wrote:

So during encryption, every time we encrypt any comment, we try to match the result with the given regexp, and if it does match the regexp, we abort and complain that the regexp is too broad. If I understand it correctly, I think this is fine.

So I think this is great. We seems to be in agreement what to do when encrypting. Just check if the result matches the regexp and abort if it does.

Yes, this part we agree upon.

But I think the question is what to do when decrypting. So I can imagine a scenario where you encrypted only few lines with a comment like sops:enc. Great. But then in the encrypted file you decide to change sops:enc to enc comment. And then you try to decrypt that with an updated CLI argument. What should happen?

IMO: if you modify the encrypted file so it won't decrypt anymore, it's your own fault if it fails.

This is why I proposed:

I think we should do a check like:

Try to match the comment with encryptedValueRegexp (matching the structure of encrypted values) and with given regexp.

If only one of them match, good, we know what to do.

If both match,

In case both match, I would simply error out. The user did something wrong (assuming we don't have a bug) and they need to fix it manually. TBH I would simply check the encryptedValueRegexp regular expression, and if it matches, assume that the comment is encrypted (and not even check the other regexp).

then we transform the comment by removing content matched by the encryptedValueRegexp and try to match given regexp again.

If it still matches, good, we know what to do.

If it does not match, we abort and complain that the regexp is too broad.

I think this is slightly better in sense that it informs the user about misconfiguration and that they should fix the regexp.

I would avoid that. It makes decryption less efficient (two regular expressions to test instead of potentially only one per comment), and increases complexity for a situation that should not arise if users do not modify sops encrypted files by hand.

Maybe a simpler way would be to store (and MAC) used regexp to encrypt into config section of the file.

Right now the MAC only checks the message, and not the metadata. I would avoid extending it. We could add another MAC which only covers metadata, but also that is something that should not happen in this PR.

And when decrypting only use the regexp from the config section.

That's what should happen anyway.

So if you want to change the regexp/comment, you have to decrypt and re-encrypt? In general it might be nice user experience to store all those CLI config switches into config section so that you do not have to specify them again when decrypting (and match them exactly, and know what was used when encrypting).

If you have to specify any switch again when decrypting, it is a bug that needs to be fixed ASAP. The only options that should affect decryption are general things like --output, --output-type, --ignore-mac.

felixfontein · 2023-12-26T09:29:15Z

I rebased this PR to do some (unrelated) work on top of it; the resulting commit is contained in #1387.

mitar · 2023-12-26T13:59:25Z

Thanks. I will look into this again after holidays.

mitar · 2023-12-26T14:20:07Z

BTW, if you are currently working on this and you are mentally in this logic and willing, feel free to wrap up with this PR yourself. If it is easier for you.

felixfontein · 2023-12-26T15:55:51Z

Not right now, but I might try to invest more time in this during the next days.

felixfontein · 2023-12-27T16:47:01Z

I'll work on it in #1392.

mitar · 2024-01-04T12:57:00Z

This is continued in #1392.

mitar mentioned this pull request Dec 19, 2021

Add ability to strip encrypted or unencrypted suffix on decryption #543

Open

mitar commented Dec 19, 2021

View reviewed changes

cmd/sops/main.go Outdated Show resolved Hide resolved

mitar commented Dec 19, 2021

View reviewed changes

config/config.go Outdated Show resolved Hide resolved

mitar commented Dec 19, 2021

View reviewed changes

config/config_test.go Outdated Show resolved Hide resolved

mitar force-pushed the encrypted-comment-regex branch from 84f864d to 96daaaf Compare December 20, 2021 00:09

mitar force-pushed the encrypted-comment-regex branch from 96daaaf to e7fdf5a Compare March 4, 2022 10:22

mitar force-pushed the encrypted-comment-regex branch from e7fdf5a to 2cacaca Compare September 30, 2022 11:46

mitar force-pushed the encrypted-comment-regex branch 2 times, most recently from da0ec2e to 969b8df Compare July 17, 2023 17:21

hiddeco added this to the v3.9.0 milestone Aug 16, 2023

felixfontein reviewed Sep 16, 2023

View reviewed changes

sops.go Outdated Show resolved Hide resolved

sops.go Outdated Show resolved Hide resolved

sops.go Show resolved Hide resolved

mitar mentioned this pull request Sep 22, 2023

Fix descriptions of unencrypted-regex and encrypted-regex flags, and ensure unencrypted_regex is considered in config validation #1300

Merged

mitar force-pushed the encrypted-comment-regex branch 3 times, most recently from 3cd2bfe to 6bc4a23 Compare September 22, 2023 10:02

mitar force-pushed the encrypted-comment-regex branch from 6bc4a23 to 86f2fe2 Compare September 22, 2023 19:15

mitar force-pushed the encrypted-comment-regex branch from 86f2fe2 to fcd2db2 Compare September 25, 2023 09:56

Support using comments to select parts to encrypt

aa2e39a

Signed-off-by: Mitar <[email protected]>

mitar force-pushed the encrypted-comment-regex branch from fcd2db2 to aa2e39a Compare September 25, 2023 10:00

felixfontein reviewed Sep 25, 2023

View reviewed changes

This was referenced Dec 15, 2023

Invalid --set format error when trying to set deeply nested value #1375

Closed

encrypted_regex for particular nodes in yaml? #1367

Open

felixfontein mentioned this pull request Dec 26, 2023

Move extraction of encryption and rotation options to separate functions #1387

Closed

felixfontein mentioned this pull request Dec 26, 2023

Move extraction of encryption and rotation options to separate functions #1389

Merged

felixfontein mentioned this pull request Dec 27, 2023

Support using comments to select parts to encrypt #1392

Open

mitar closed this Jan 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support using comments to select parts to encrypt #974

Support using comments to select parts to encrypt #974

mitar commented Dec 19, 2021

mitar commented Mar 4, 2022

mitar commented Sep 30, 2022

mitar commented Jul 17, 2023

Gui13 commented Aug 16, 2023

hiddeco commented Aug 16, 2023

mitar commented Sep 22, 2023

mitar commented Sep 22, 2023

mitar commented Sep 25, 2023

felixfontein left a comment

felixfontein Sep 25, 2023

mitar commented Sep 25, 2023

mitar commented Oct 11, 2023

felixfontein commented Nov 5, 2023

mitar commented Nov 7, 2023 •

edited

felixfontein commented Dec 19, 2023

felixfontein commented Dec 26, 2023

mitar commented Dec 26, 2023

mitar commented Dec 26, 2023

felixfontein commented Dec 26, 2023

felixfontein commented Dec 27, 2023

mitar commented Jan 4, 2024

Support using comments to select parts to encrypt #974

Support using comments to select parts to encrypt #974

Conversation

mitar commented Dec 19, 2021

mitar commented Mar 4, 2022

mitar commented Sep 30, 2022

mitar commented Jul 17, 2023

Gui13 commented Aug 16, 2023

hiddeco commented Aug 16, 2023

mitar commented Sep 22, 2023

mitar commented Sep 22, 2023

mitar commented Sep 25, 2023

felixfontein left a comment

Choose a reason for hiding this comment

felixfontein Sep 25, 2023

Choose a reason for hiding this comment

mitar commented Sep 25, 2023

mitar commented Oct 11, 2023

felixfontein commented Nov 5, 2023

mitar commented Nov 7, 2023 • edited

felixfontein commented Dec 19, 2023

felixfontein commented Dec 26, 2023

mitar commented Dec 26, 2023

mitar commented Dec 26, 2023

felixfontein commented Dec 26, 2023

felixfontein commented Dec 27, 2023

mitar commented Jan 4, 2024

mitar commented Nov 7, 2023 •

edited