Improve efficiency of summary metadata aggregation #539

asnyv · 2021-01-13T15:54:36Z

Currently the summary metadata is handled by a separate method in fmu-ensemble, which is about as heavy as the get_smry() itself, as it loads all the summary data.

In ecl2df it is now proposed to add this metadata directly to the pandas DataFrame.attrs for the dataset. The DataFrame.attrs is said to be experimental, so a bit risky to base on it, but there are some good opportunities. An alternative is something like returning a tuple of the df and a metadata dict instead of just the df if you set a flag.
At least, fmu-ensemble is planning to base this part of the code directly on ecl2df, so then we could get the same feature there.

On the webviz-side, the pandas.to_parquet() for portables doesn't support metadata directly, but according to this article the route to combine a df and json-like metadata dict in a parquet file doesn't seem too hard. A feature to combine dfs with metadata in our portables is anyways something I am sure that we can benefit from (unless reading the parquet back into pandas becomes a lot slower).

Think this can be a major gain in build time, and possibly also memory usage during build for apps using data from UNSMRY.

The text was updated successfully, but these errors were encountered:

asnyv · 2021-01-13T15:56:56Z

Alternative path to quicker and more memory efficient aggregation of metadata from SMSPEC is to solve equinor/ecl/issues/796 and utilize that in fmu-ensemble / ecl2df with an implementation of the aggregation close to how it currently is (so a separate function for metadata as today in fmu-ensemble)

anders-kiaer · 2021-06-28T19:58:36Z

Today, from webviz side, we probably want to let the .arrow dump solve efficient metadata aggregation "automatically" (simply be being a good format for arbitrary reads). 🚀

Co-authored-by: Havard Bjerke <[email protected]>

asnyv added enhancement 🚀 New feature or request Data input This issue related to extracting/manipulating or organizing input data to Webviz labels Jan 13, 2021

asnyv added this to Backlog 📝 in Webviz via automation Jan 13, 2021

anders-kiaer mentioned this issue May 31, 2021

Slow libecl reading with large ensembles #418

Closed

anders-kiaer closed this as completed Jun 28, 2021

Webviz automation moved this from Backlog 📝 to Done 🏁 Jun 28, 2021

VincentNevermore pushed a commit to VincentNevermore/webviz-subsurface that referenced this issue Jul 19, 2022

Added missing addon-redux dependency. (equinor#539)

b898010

Co-authored-by: Havard Bjerke <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve efficiency of summary metadata aggregation #539

Improve efficiency of summary metadata aggregation #539

asnyv commented Jan 13, 2021 •

edited

Loading

asnyv commented Jan 13, 2021

anders-kiaer commented Jun 28, 2021 •

edited

Loading

Improve efficiency of summary metadata aggregation #539

Improve efficiency of summary metadata aggregation #539

Comments

asnyv commented Jan 13, 2021 • edited Loading

asnyv commented Jan 13, 2021

anders-kiaer commented Jun 28, 2021 • edited Loading

asnyv commented Jan 13, 2021 •

edited

Loading

anders-kiaer commented Jun 28, 2021 •

edited

Loading