Zone: reset memory #2086

catap · 2020-12-22T23:26:09Z

This commit moved memset from alloc macros to zone.alloc to reset all allocated memory at any case.

LeeTibbert · 2020-12-29T17:44:38Z

nativelib/src/main/scala/scala/scalanative/unsafe/Zone.scala

@@ -54,6 +54,7 @@ object Zone {
 if (rawptr == null) {
 throw new OutOfMemoryError(s"Unable to allocate $size bytes")
 }
+ libc.memset(rawptr, 0, size)


You are absolutely correct that having package.scala alloc and z.alloc have the same name but different behaviors when it comes to clearing is a nightmare. I have been
hosed by that more times that I care to admit in public, or even in to myself in the dead and quiet of night.

I believe the discussion of memory handling here is larger than proper for a review comment. I will create an Issue to describe my concerns. In short, I think that
one needs to be able to create either raw or cleared memory.

See line 199 for the case which concerns me, why allocate cleared memory
when one knows that it will be immediately overwritten?

Limiting myself to only the suggested change. From my reading, in most c runtime libraries, especially libc, he combination malloc/memset is not directly equivalent to
calloc and the latter is preferred. At the very least, calloc does a check for
integer overflow (yes, that slows it down by a few cycles). I understand that
calloc for more than 128 KiB can use some memory management techniques to delay the zeroing and otherwise improve performance.

A small improvement, however, a well know one and, IMHO, well worth reaping.
I suspect/believe that you went for a minimal change for existing code.

@LeeTibbert I've spent a lot of time before I realised this small different. And I strongly suggest to apply it because to find this the second time is a nightmare.

Yes, you are corrrect two methods with the same name and opposite behavior is at best an
H. P. Lovecraft nightmare and probably worse.

I agree with fixing that and propose that the alloctor of cleared memory use calloc instead
of the memset/memcpy of this region. This is for all the reasons that you have described
in your other PR.

This part of the PR is good and, in my opinion should advance. I believe that it would be better
with the suggested calloc change, if only to calloc(1, size).

It is not a direct concern of this PR, but I believe that it needs a prominent release
note. This change increases the likelihood of correct use, but also has an impact on
performance. Devos should be made aware of that.

My concern is that there should be a raw allocator, as well as a cleared allocator.

There are roughly 141 instances of " = alloc" now in the SN project. To advance things, in
another PR and after discussion I could create an allocRaw() and look at each of those
instances and see if it should clear or not. That would be a companion PR and limit
the performance impact of this PR.

Did I mention that your fixing this is a Mercy? Yes, it is, thank you.

@LeeTibbert technically calloc is the same malloc and memset at least it is true for GNU libc.

Anyway, I see one differences between calloc and malloc + memset and it is a way that allow to optimize calloc to allocate zeros pages from the kernel and allow it to make overcommit for memory for SN application.

I'm thinking about this use case and the more I think the more I agree with idea that it is bad because it open very bad door for SN user that allows to allocate more memory than system has and when applicatio treis to use it, it will be killed by OOM killer.

Basicly this "unsure" is a reason why I've created this trivial PR to fix this issue, and move calloc vs malloc + memset to future release.

LeeTibbert · 2020-12-29T17:57:20Z

nativelib/src/main/scala/scala/scalanative/unsafe/package.scala

@@ -238,7 +238,6 @@ package object unsafe {
 val $size = _root_.scala.scalanative.unsafe.sizeof[$T]($tag)
 val $ptr = $z.alloc($size)
 val $rawptr = $runtime.toRawPtr($ptr)


Again, I will discuss this in an Issue.

To express my concern in short:

In full generality, one would have both an allocator for cleared memory (this PR edit) and one
for raw memory (before this PR). I suspect that almost all cases, memory allocated on the
stack is going to be immediately overwritten by other data. That is, the clearing is wasted.

A developer knows if they want cleared or raw memory and could/should call the proper
method. The key here is that the method have a name which gives a clue and documentation
which says exactly what it does.

To try to describe my thoughts. If there is only one stack allocator (pair?) it should have the existing behavior
to minimize change. It is documented in package.scala as not clearing memory and there is no Zone
equivalent to have opposite behavior.

The question, at least in my mind, if implementing a new stackallocZeroed or stackallocCleared is
worth the added complexity. How one answers that may be a matter of personal programming style, expectation,
and experience.

Perhaps the stackalloc changes could be split out to another so that the z.alloc changes can advance quickly?

stackalloc is tricky one. Tehcnically it is {0} and for LLVM that I do have and investigate it adds memset.

Anyway, I haven't got time proof that it is behaviour for all LLVM/clang and not some side effect for macOS.

As soon as I do have some time for SN I'll do this investiagetion and if it is true I'll remove this memset but I don't like blind shot without proof on this case because it is easy to change and very difficult to track.

This commit moved `memset` from `alloc` macros to `zone.alloc` to reset all allocated memory at any case.

LeeTibbert reviewed Dec 29, 2020

View reviewed changes

LeeTibbert mentioned this pull request Dec 29, 2020

Migrated to calloc #2080

Draft

catap force-pushed the reset-memory branch from 73ee9bc to aaf5fc4 Compare December 29, 2020 20:23

catap requested a review from LeeTibbert December 29, 2020 20:25

catap force-pushed the reset-memory branch from aaf5fc4 to 614d7aa Compare April 5, 2021 04:50

Zone: reset memory

244a631

This commit moved `memset` from `alloc` macros to `zone.alloc` to reset all allocated memory at any case.

catap force-pushed the reset-memory branch from 614d7aa to 244a631 Compare April 9, 2021 09:48

ekrich mentioned this pull request May 4, 2021

Fix #2277: Expunge redundant memset calls #2278

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zone: reset memory #2086

Zone: reset memory #2086

catap commented Dec 22, 2020

LeeTibbert Dec 29, 2020

LeeTibbert Dec 29, 2020

catap Dec 29, 2020

LeeTibbert Dec 29, 2020 •

edited

LeeTibbert Dec 29, 2020

catap Dec 29, 2020

LeeTibbert Dec 29, 2020

LeeTibbert Dec 29, 2020

LeeTibbert Dec 29, 2020

catap Dec 29, 2020

Zone: reset memory #2086

Are you sure you want to change the base?

Zone: reset memory #2086

Conversation

catap commented Dec 22, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LeeTibbert Dec 29, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LeeTibbert Dec 29, 2020 •

edited