Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aggressive Retry Behavior with Non-Existent SQS Queue for S3 Source #20428

Open
Shadow6363 opened this issue May 3, 2024 · 0 comments
Open
Labels
domain: reliability Anything related to Vector's reliability source: aws_s3 Anything `aws_s3` source related type: bug A code related bug.

Comments

@Shadow6363
Copy link

A note for the community

  • Please vote on this issue by adding a 馃憤 reaction to the original issue to help the community and maintainers prioritize this request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

Problem

When the queue for an aws_s3 source is deleted, the source retries at an extremely high frequency seemingly without any backoff. This has separately exceeded the limits of an AWS VPC DNS resolver with lookups and once that potential issue was addressed, it instead exceeded the account-level AWS SQS API limits that cannot be increased.
The retry frequency ends up sufficient to elevate the service's CPU triggering upward scaling, which just further increases the volume until the upper autoscaling limit is hit.

We would expect the behavior to instead be similar to when the queue is just empty or if a special case is needed, we would expect there to be a randomized, exponential backoff, with an eventual retry limit.

Configuration

[sources.s3_logs_ads]
type = "aws_s3"
region = "us-east-1"
compression = "gzip"
sqs.queue_url = "https://sqs.us-east-1.amazonaws.com/000000000000/vector-logs-ads-bucket"

Version

0.29.1

Debug Output

No response

Example Data

No response

Additional Context

No response

References

No response

@Shadow6363 Shadow6363 added the type: bug A code related bug. label May 3, 2024
@jszwedko jszwedko added source: aws_s3 Anything `aws_s3` source related domain: reliability Anything related to Vector's reliability labels May 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain: reliability Anything related to Vector's reliability source: aws_s3 Anything `aws_s3` source related type: bug A code related bug.
Projects
None yet
Development

No branches or pull requests

2 participants