Different loss values for seemingly same forecast #32

tvonich · 2023-12-04T18:58:48Z

I performed "1 Eval Step" forecast with Graphcast small using dataset_source-era5_date-2022-01-01_res-1.0_levels-13_steps-01.nc, steps-12.nc, and steps-40.nc.

The loss is different for each case even though we are only performing a 6 hr forecast in each case. Why might this be? As I understand it, the prediction should be the same in all these cases and the target should also be the same (ERA5 Reanalysis 6hr into the future).

Loss Values:
0.9296875 for 6hr forecast 1-step data 01 Jan 2022
0.69140625 for 6hr forecast 12-step data 01 Jan 2022
0.66015625 for 6hr forecast 40-step data 01 Jan 2022

tewalds · 2023-12-20T08:22:00Z

One confusing thing about the extract_inputs_targets_forcings is it does it right-aligned instead of left-aligned, so your dataset may start from the same dates, but your actual inputs are not the same date.

tvonich · 2024-01-13T19:01:40Z

One confusing thing about the extract_inputs_targets_forcings is it does it right-aligned instead of left-aligned, so your dataset may start from the same dates, but your actual inputs are not the same date.

Hey Timo,
That would explain it. Why is it structured this way? How does the training method deal with it?

tewalds · 2024-01-14T01:37:32Z

I'm not sure why this was chosen as the default, but sometimes you want to know how your error changes as you predict the same time from different points in the past. There should probably be an option for choosing left vs right alignment. @alvarosg may have more context here.

tvonich · 2024-01-14T01:59:46Z

Thanks for the quick response. This work will hopefully get me going on the 1st chapter of my dissertation.

The differences I'm getting are fairly subtle. For example, I just ran a 12 hour forecast with the step-04 netcdf and did the same with the step-40 netcdf. The 6 hr and 12 hr losses are in the jpeg. If alignment was the issue, I'd think the differences would be really large in this case. Would you tend to agree or am I thinking about this the wrong way?

tewalds · 2024-01-14T02:03:23Z

The step-04.nc should be a subset of step-40.nc, I think with the same initial time. That should be pretty easy to verify by loading both and looking at the data. Then just make sure you're extracting the data correctly for your use case. Feel free to send a PR with a left/right alignment option.

tvonich · 2024-01-14T05:25:59Z

Ok. Yep. I see how it picks out the inputs and targets now. I'll make a few changes and try to submit a pull request this week. Thanks!

tvonich changed the title ~~Different loss values seemingly same forecast~~ Different loss values for seemingly same forecast Dec 4, 2023

tvonich mentioned this issue Jan 18, 2024

extract_input_target_forcings add option for left-justification of train/eval #56

Open

tewalds mentioned this issue Feb 22, 2024

when is the prediction result of this demo? #62

Closed

tvonich mentioned this issue Feb 23, 2024

Forecasting beyond 10 days #63

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different loss values for seemingly same forecast #32

Different loss values for seemingly same forecast #32

tvonich commented Dec 4, 2023

tewalds commented Dec 20, 2023

tvonich commented Jan 13, 2024 •

edited

tewalds commented Jan 14, 2024

tvonich commented Jan 14, 2024

tewalds commented Jan 14, 2024

tvonich commented Jan 14, 2024

Different loss values for seemingly same forecast #32

Different loss values for seemingly same forecast #32

Comments

tvonich commented Dec 4, 2023

tewalds commented Dec 20, 2023

tvonich commented Jan 13, 2024 • edited

tewalds commented Jan 14, 2024

tvonich commented Jan 14, 2024

tewalds commented Jan 14, 2024

tvonich commented Jan 14, 2024

tvonich commented Jan 13, 2024 •

edited