Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve reliability of long requests #26

Open
smithara opened this issue Mar 2, 2019 · 1 comment
Open

Improve reliability of long requests #26

smithara opened this issue Mar 2, 2019 · 1 comment

Comments

@smithara
Copy link
Member

smithara commented Mar 2, 2019

Sometimes a long request (probably chunked) can fail because the server is down for redeployment, or the connection is otherwise lost. This causes the whole chunked request to fail with a ConnectionRefusedError. It would be good to handle this case by waiting and resubmitting the job and continuing.

@pacesm
Copy link
Collaborator

pacesm commented Mar 2, 2019

FYI, I case of asynchronous requests it might be a bit more complicated.

When a connection fails during posting of a new job the job is not created. But when the connection failure happens while polling the job status the job still exists on the server. The asynchronous jobs are never lost and they are always executed until they either successfully finish or fail with an error.

In case of a server restart, the running jobs are interrupted but they get automatically reloaded when the server is up and running again.

In other words, once an asynchronous job was accepted by the server the client should assume it still exists even if its status cannot be temporally retrieved.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants