You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The spark-k8s-operator example rely on the taxi-trip-execute.sh script to populate test data in an S3 bucket. The current version of this script downloads a 36M file locally and duplicates it 100 times (3.6gb). It then uploads those copies to an S3 bucket. If this script is run from an EC2 instance it takes about 2 minutes. Run from a laptop on wifi, this takes forever.
Rather than doing a sync from local to S3 we can upload the file (local to s3) one time. Then do background S3 to S3 copies in about 40seconds without using the local wifi network at all.
Additionally, there are currently 6 duplicate copies of this script.
[ x] ✋ I have searched the open/closed issues and my issue is not listed.
The text was updated successfully, but these errors were encountered:
raykrueger
changed the title
taxi-trip-execute.sh has poor performance and is duplicated many times
taxi-trip-execute.sh has poor performance and is 6 many times
Apr 4, 2024
raykrueger
changed the title
taxi-trip-execute.sh has poor performance and is 6 many times
taxi-trip-execute.sh has poor performance and is duplicated six times
Apr 4, 2024
Description
The spark-k8s-operator example rely on the taxi-trip-execute.sh script to populate test data in an S3 bucket. The current version of this script downloads a 36M file locally and duplicates it 100 times (3.6gb). It then uploads those copies to an S3 bucket. If this script is run from an EC2 instance it takes about 2 minutes. Run from a laptop on wifi, this takes forever.
Rather than doing a sync from local to S3 we can upload the file (local to s3) one time. Then do background S3 to S3 copies in about 40seconds without using the local wifi network at all.
Additionally, there are currently 6 duplicate copies of this script.
The text was updated successfully, but these errors were encountered: