You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I am using Metaflow and scheduling parallel jobs on AWS Step function. My Flow script is as follows:
start() -> run() -> join() -> end()
in the start() i am using foreach to call run() in parallel. I tried using the Client API to access the data in the run() step, as follows:
flow = Flow('MyFlow')
step = Step('MyFlow/sfn-id/run')
list(step.tasks())#there are multiple of these available, i choose one below:
step.task#is something like: MyFlow/sfn-id/run/task-id
step.task.data#gives the output: <MetaflowData: >
Since I have attached a variable called "self.results" to the run() step, I expected to access the results by calling the following:
step.task.data.results, but this raises a keyError:
correct - you might be running into this expected behavior. The link has details on how to correctly move state through a foreach. Let me know if that works!
Hello, I am using Metaflow and scheduling parallel jobs on AWS Step function. My Flow script is as follows:
start() -> run() -> join() -> end()
in the start() i am using foreach to call run() in parallel. I tried using the Client API to access the data in the run() step, as follows:
Since I have attached a variable called "self.results" to the run() step, I expected to access the results by calling the following:
step.task.data.results, but this raises a keyError:
I also tried to see what artifacts are there by calling:
step.task.data._artifacts
but that returns a Null set
The text was updated successfully, but these errors were encountered: