Saga stopped inside loop without throwing any error #1592

joes-code · 2018-09-06T12:28:04Z

Steps to reproduce

I've created a small repo and updated readme with expected and actual outcome.
https://github.com/joes-code/redux-saga-beginner-tutorial

Description of the bug/issue

Inside the saga function, I'm simply looping and adding numbers. Depends on the loop size and the browser you use, saga might finish the process and work as expected, or it might just stop half way and that's it.

Steps to reproduce the bug/issue

Use Google Chrome
npm install
npm start
Follow https://github.com/joes-code/redux-saga-beginner-tutorial#issue

Example

https://github.com/joes-code/redux-saga-beginner-tutorial

Actual results

Hello Sagas!
loop # 0
0 "-" 405450
loop # 1
1 "-" 405450
loop # 2

The Expected results

Hello Sagas!
loop # 0
0 "-" 405450
loop # 1
1 "-" 405450
loop # 2
2 "-" 405450
loop # 3
3 "-" 405450
loop # 4
4 "-" 405450

Environment information

latest redux-saga version (please see package.json from my sample repo for more information)
Google Chrome 68.0.3440.106 (Official Build) (64-bit)
Mac

feichao93 · 2018-09-07T06:22:57Z

This issue is caused by the yield statement here. Remove the yield and you'll get the expected results.

If you look at the call stack when processBigLoop is running, you can see that every yield will add up 3 layer to the call stack. A stack overflow exception will be thrown because we have too many yield here, but it seems the exceptions is somehow swallowed.

Andarist · 2018-09-07T13:12:27Z

Thanks for the report, I'm planning to look into the issue - to see if we can avoid stack overflow when processing a large amount of sync consecutive effects.

joes-code · 2018-09-08T01:26:29Z

Thanks @shinima @Andarist for confirming the exception. Yes it would be great if the exception didn't get swallowed or even avoid it altogether. In my real project, I was able to get around this by pretty much removing the yield inside loop, which might freeze the UI if the loop is big, but shouldn't be a problem in my case.

mshustov · 2018-09-10T09:57:48Z

I checked our source code and it seems that saga catches an error here

redux-saga/packages/core/src/internal/proc.js

Lines 262 to 266 in 63dfb94

 if (mainTask._isCancelled) { 

 env.logError(error) 

 } 

 mainTask._isRunning = false 

 mainTask.cont(error, true)

but because it handles an error in another function call

redux-saga/packages/core/src/internal/proc.js

Line 266 in 63dfb94

mainTask.cont(error, true)

, browser engine (Chrome) ignores that expression, so error remains unhandled

how can we fix that?

make the whole process async - no, as it breaks current contract
TCO - no, it is supported only is Safari
switch from recursion to an iteration - probably, but that could require significant code changes

Andarist · 2018-09-10T10:02:12Z

I would like to explore this:

switch from recursion to an iteration - probably, but that could require significant code changes

I know it would be a significant change, but it would fix some long standing issues. I'm not entirely sure yet how feasible it is to do that now though - need to make a code assessment.

mshustov · 2018-09-17T18:15:46Z

I'm on vacation, so working on stuff from time to time. I've had a look at the case and some notes:
on the whole it seems that we cannot get rid of recursion completely with small changes, as some effects are resolved asynchronously. but I added iterative approach to recursion for effects that are resolved immediately (sync) https://github.com/redux-saga/redux-saga/blob/master/packages/core/src/internal/proc.js#L227-L264

and there are some problems

next is being mutated for each call with rewriting cancel. ok, can be solved
I have a failed test with https://github.com/redux-saga/redux-saga/blob/master/packages/core/__tests__/interpreter/takeSync.js#L522
need take a look on scheduling.

Andarist · 2018-09-17T19:49:30Z

Do u have this on some pushed out branch?

mshustov · 2018-09-18T16:35:09Z

https://github.com/redux-saga/redux-saga/compare/flatten-stack-experiment
edited: seems that's not a scheduling problem, but forking model doesn't play well with that implementation :(

Andarist · 2018-10-02T14:25:20Z

Could u describe the faced issue in more details? My idea to fix that would be actually quite similar, tracking isSync in the closure and queueing tasks, why exactly it fails? Maybe knowing why would help figuring out how to tackle this properly.

Alternative solution would be to use some kind of special completion value but that would force us to return each sync value (instead of passing it through completion callback) and probably would make handlers like runPutEffect more complicated

mshustov · 2018-10-02T18:12:59Z

Alternative solution would be to use some kind of special completion value but that would force us to return each sync value (instead of passing it through completion callback) and probably would make handlers like runPutEffect more complicated

I considered this option, but that approach

complicates intrnals, as we need to use different approaches for sync and async effects
will be confusing for effect authors when we introduce custom effects

regarding the problem with fork model. we have a failing test https://github.com/redux-saga/redux-saga/blob/master/packages/core/__tests__/interpreter/takeSync.js#L522
how fork model works without changes
finishes s3
finishes s1
finishes s2

how fork model works with changes
finishes s3
finishes s2
finishes s1

and it even looks like more logical sequence, but I'm not sure we don't break any users code.

as I understand test fails due to internal scheduling, but not entirely understand why - all steps are synchronous. it's easy to fix test with running it as

function* root(){
  yield fork(s1)
}
middleware.run(root)

instead of

middleware.run(s1)

Andarist · 2018-10-03T15:53:12Z

Well, this should be unrelated - but this reminds me that we root should be considered a regular task (its just being run from outside instead of being forked inside) and thus normal scheduling should apply for it.

So we should suspend the scheduler before running, I can dig up a test case for this as I had it somewhere, going to create a ticket for this.

But going back to the topic here - gonna play around with ur code when I find the time to and going to try pinpoint the problem. BTW does your queue ever becoming bigger than 1?

mshustov · 2018-10-03T16:46:33Z

BTW does your queue ever becoming bigger than 1?

no. that's was just easier for brains to consider it as a 'queue', actually it has 0 or 1 element inside. we should switch to ordinary variable later, that's a minor

mshustov · 2018-10-05T14:53:52Z

@Andarist I updated the branch to get rid of an array.
Indeed I was able to fix the failed test with adding in https://github.com/redux-saga/redux-saga/blob/master/packages/core/src/internal/runSaga.js

+ suspend()
const task = proc(env, iterator, context, effectId, getMetaInfo(saga), null)
+ flush()

feichao93 · 2018-10-07T02:46:22Z

I've done a few tests against branch flatten-stack-experiment. Sadly I found that due to the internal scheduling, all put/take are async -- isSync flag is false when these effects are resolved. I pushed up my tests into branch print-is-sync, you can checkout it and run the following command under packages/core/ folder.

`yarn bin`/jest  --testPathPattern='interpreter/takeSync.js' --testNamePattern='deeply nested forks/puts'

As put/take are most used effects in applications, it's a pity that only a few effects (select/fork/cancel/channel etc.) can benefit from stack reduction.

Andarist · 2018-10-07T12:37:27Z

Indeed I was able to fix the failed test with adding in https://github.com/redux-saga/redux-saga/blob/master/packages/core/src/internal/runSaga.js

While this maybe fixes the failing test i dont think it should be considered as a proper fix - this means that the internal order of things has changed and this refactor should be completely transparent, otherwise I'm not confident if it doesnt break some other real world use cases that are not covered currently by our test suite.

@shinima takes are indeed async as they act as watchers for actions, but their resolution is sync

puts are completely sync, the only hard part of this problem is that they dont necessarily are executed right away when they are scheduled, but we have certainty that they will get executed in the same frame in which they are scheduled

so in theory it should be possible at least for puts to benefit from stack reduction, it's only tricky to pull it off for some reason, probably because they might be handled because different tasks

mshustov · 2018-10-07T13:58:28Z

@Andarist I'm a bit confused, since it's absolutely the same fix as #1628

Andarist · 2018-10-08T08:35:00Z

The fix is the same, but the other PR targets a specific situation - it makes the behaviour of root sagas the same as forked ones.

But when it comes to "stack flattening" I believe the change should be transparent to the scheduling, nothing should really change as our desired change is solely avoiding a stack overflow, not changing how things behave.

mshustov · 2018-10-08T09:15:39Z

@Andarist completely agree from this point of view, will put on block until #1628 is merged

Andarist · 2018-10-08T10:25:50Z

I believe we should still try (if possible) to implement stack flattening on top of older code, without #1628 merged in. That way we'd be more confident that the change is indeed transparent to current use cases.

omegdadi · 2019-03-12T00:17:32Z

Hello! Any updates on this issue?

Andarist · 2019-03-12T08:37:18Z

Stack flattening seems hard to implement, we couldn't do it quickly - it would require further exploration.

Maybe what we could do is implementing a DEV counter and warn when encountering such pattern in the code way before it actually throws on its own? 🤔

danenania · 2019-04-17T16:00:14Z

@Andarist This one has been biting us pretty badly in production--we have a yield call(... containing some simple logic inside a while loop that started occasionally failing silently and stalling after ~100+ iterations. We spent about a week debugging before finally zeroing in on this issue as the cause. While you all figure out a solution, I'd recommend adding some prominent warnings about this to the README/docs so others can be saved from a similar fate.

nwpray · 2019-05-02T20:55:06Z

I was running a test to see performance difference between a regular function call and a yield call of the same function, I saw the stack overflow there at about 1170 when only using a function that returns null. Here is my sandbox:
https://codesandbox.io/s/x0r91y1x4

oyshan · 2019-08-20T13:09:21Z

@Andarist What is the status on this?
We have code that loops over possibly thousands of objects, yield calling a saga that performs async operations for each loop. But after about 170 iterations it seems to fail silently and stalls, like in @danenania's case.

My case:

function* a(){
    yield call(loop)
    return true // this never returns
}

function* loop(){
    // objects: array of thousands++ objects
    for (const obj of objects) {
        yield call(anotherAsyncSaga, obj) // stalls after ~170 iterations
    }
}

Is there a possible workaround for this? This seems like a major drawback of redux-saga if you cannot loop for more than ~100 iterations, and something that should be documented and informed to all users.

Andarist · 2019-08-20T13:39:18Z

The current workaround is to implement simple backpressure into the loop. You could even use microtasks to make sure no code should execute in between.

I've tried to change internal implementation, but couldn't figure out how to do that properly while having all existing tests passing. If you have time and resources to work on improving the story around that I would gladly accept a PR fixing this.

oyshan · 2019-08-21T07:19:12Z

Thanks for the fast reply, @Andarist. Do you have any code examples for implementing simple backpressure into loops and/or use of microtasks?

Andarist · 2019-08-21T09:37:14Z

Promises (if you have access to native ones) can be used to schedule things on microtask queue. One of the alternatives in a browser is to use a MutationObserver, but that's slightly more complex in implementation. You can check out how core-js implements it here

Anyhow, with Promises you could do this:

const deferred = Promise.resolve()
const microTick = deferred.then.bind(deferred)

function* loop(){
    // objects: array of thousands++ objects
    for (const obj of objects) {
        yield call(anotherAsyncSaga, obj) 
        yield call(microTick)
    }
}

oyshan · 2019-08-21T12:30:16Z

Thanks again, @Andarist. That workaround solved the problem for now :)
Is there any ongoing work to avoid that it fails silently? It would be easier to identify what's going on

Andarist · 2019-08-21T14:18:14Z

There is no ongoing work regarding this right now, I've tried to fix it once but failed to do so and I don't have much time right now to get back to it. I got some new ideas on how maybe I could try to solve it, but it really would be an experiment - I have no idea if my ideas are any good.

I encourage anyone interested to try to fix this and prepare a PR for that.

nabeelItilite · 2019-09-26T13:58:35Z

same issue. any solution ?

Andarist · 2019-09-26T14:39:23Z

I'll be looking into fixing this in October, but don't get your hopes up - can't promise that I will actually succeed.

lnunno · 2020-01-08T20:07:53Z

Hey @Andarist wanted to check and see if you had the chance to look into this? Even surfacing the error would be a big help, debugging this issue was pretty challenging and only would surface intermittently when placing try-catches at the right scope in the saga hierarchy.

Andarist · 2020-01-08T22:29:01Z

I've tried to fix it but, unfortunately, I've failed. Every fix I've attempted has changed the order of certain things happening, it has impacted the internal scheduler a little bit too much.

It would certainly be possible to introduce a log for this. Would you be willing to implement it?

geydson · 2020-03-11T23:32:38Z

@Andarist Am I facing the same problem, any solution?

i'm making a call in an api where it returns more than 1000 records and it just stays in the infinite loop calling the api :/

Andarist · 2021-08-26T12:14:28Z

It's a long-standing issue that would be awesome to fix. I don't have time to do this and I've already failed in the past - maybe somebody smarter than me could take a stab at it. I think it's reasonable to put a bounty on this issue - whoever fixes the problem can get 1000usd from our opencollective (assuming that it will have enough funds to pay out this).

EarlOld · 2022-05-03T12:35:23Z

@joes-code why do you use yield in this line? Just remove yield and all works fine

**k + total ** — not asynchronous operation

EarlOld · 2022-05-03T12:44:25Z

If you want to use async operation in cycle just add yield delay(0); for correct work

import { delay } from "redux-saga";

export default function* processBigLoop() {
    const LOOP_COUNT = 5;
    const ADD_UP_TO = 900;
    try {
        for (let i = 0; i < LOOP_COUNT; i += 1) {
            console.log('loop #', i);
            let total = 0;
            yield delay(0);
            for (let k = 1; k <= ADD_UP_TO; k += 1) {
                total = yield k + total;
            }
            console.log(i, '-', total);
        }
    } catch (error) {
        console.log('error', error);
    }
    return 'finished';
}

stevenpal · 2022-12-19T16:50:28Z

I encountered this issue as well while doing a yield call effect to another iterator from within a for loop (similar to the example that @joes-code provided). The iterator being called then made an asynchronous call using yield call. But just doing a yield call in a big loop to any iterator (whether it does anything or not) also causes the issue.

It was trying to loop through roughly 500 - 600 items and was silently causing saga to crash, which basically brought my app to a halt with no errors being thrown because I have everything wired up to saga. The silent crash certainly made it very difficult to narrow down. I initially solved this by just reducing the number of items that were being iterated through, but I can't really control the amount of data my app needs to process so that seemed like a stop-gap. I then also tried the workaround @EarlOld suggested:

If you want to use async operation in cycle just add yield delay(0); for correct work

That actually worked even when iterating through many more cycles.

@EarlOld - can you share why putting the delay(0) effect into the loop works? I'm trying to understand what's going on under the hood to see if this is a robust solution or just a stop-gap until this issue is fixed. Thanks again!

Previously inside a `shift()` instruction we would always return the value of `k()`. This caused another tick in the stack frame which caused the stack to increase until an overflow was reached. Previously it only took about 1495 `shift()` calls inside a loop to cause a stack overflow. Now we provide the ability for the end-user to decide whether or not they care about the return value of `k()` with the addition of `k.tail()` which will continue the same stack frame instead of creating a new one, preventing the likely-hood of a stack overflow. The reason why we started investigating this inside `continuation` was because of a previous issue in `redux-saga`: redux-saga/redux-saga#1592 Co-authored-by: Charles Lowell <[email protected]>

Andarist added this to the v1 milestone Sep 7, 2018

feichao93 mentioned this issue Oct 7, 2018

Suspend before running root #1623

Closed

mshustov mentioned this issue Oct 8, 2018

Remove try/finally from runForkEffect #1629

Closed

Andarist modified the milestones: v1, post-v1 Oct 28, 2018

mshustov mentioned this issue Feb 11, 2019

Scheduler can get into bad state with takeEvery or takeLatest inside a while(true) #1624

Closed

danenania mentioned this issue Apr 17, 2019

Permanently stuck at decrypting config... page envkey/envkey-app#26

Closed

Andarist mentioned this issue Jul 26, 2021

Yielding on a synchronous function inside a for loop causes the generator to stop running. #2207

Closed

neurosnap added bug help wanted labels Feb 26, 2022

neurosnap mentioned this issue Mar 10, 2023

prevent stack overflows with k.tail() thefrontside/continuation#11

Merged

Saga stopped inside loop without throwing any error #1592

Saga stopped inside loop without throwing any error #1592

Comments

joes-code commented Sep 6, 2018

Steps to reproduce

Description of the bug/issue

Steps to reproduce the bug/issue

Example

Actual results

The Expected results

Environment information

feichao93 commented Sep 7, 2018

Andarist commented Sep 7, 2018

joes-code commented Sep 8, 2018

mshustov commented Sep 10, 2018 • edited

Andarist commented Sep 10, 2018

mshustov commented Sep 17, 2018 • edited

Andarist commented Sep 17, 2018

mshustov commented Sep 18, 2018 • edited

Andarist commented Oct 2, 2018

mshustov commented Oct 2, 2018 • edited

Andarist commented Oct 3, 2018

mshustov commented Oct 3, 2018 • edited

mshustov commented Oct 5, 2018

feichao93 commented Oct 7, 2018 • edited

Andarist commented Oct 7, 2018

mshustov commented Oct 7, 2018

Andarist commented Oct 8, 2018

mshustov commented Oct 8, 2018

Andarist commented Oct 8, 2018

omegdadi commented Mar 12, 2019

Andarist commented Mar 12, 2019

danenania commented Apr 17, 2019 • edited

nwpray commented May 2, 2019

oyshan commented Aug 20, 2019

Andarist commented Aug 20, 2019

oyshan commented Aug 21, 2019

Andarist commented Aug 21, 2019 • edited

oyshan commented Aug 21, 2019

Andarist commented Aug 21, 2019

nabeelItilite commented Sep 26, 2019

Andarist commented Sep 26, 2019

lnunno commented Jan 8, 2020

Andarist commented Jan 8, 2020

geydson commented Mar 11, 2020

Andarist commented Aug 26, 2021

EarlOld commented May 3, 2022 • edited

EarlOld commented May 3, 2022

stevenpal commented Dec 19, 2022

mshustov commented Sep 10, 2018 •

edited

mshustov commented Sep 17, 2018 •

edited

mshustov commented Sep 18, 2018 •

edited

mshustov commented Oct 2, 2018 •

edited

mshustov commented Oct 3, 2018 •

edited

feichao93 commented Oct 7, 2018 •

edited

danenania commented Apr 17, 2019 •

edited

Andarist commented Aug 21, 2019 •

edited

EarlOld commented May 3, 2022 •

edited