pipelines: Kubeflow / AI Platform Pipelines runtime context missing when output is taken from cache
What steps did you take:
- deploy pipeline with one component
- run pipeline with one component (π works)
- add another component
- run the pipeline (this time the output is taken from cache) (π fails)
What happened:
The pipeline runs into the following error
Traceback (most recent call last):
File "/opt/conda/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/opt/conda/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/opt/conda/lib/python3.7/site-packages/tfx/orchestration/kubeflow/container_entrypoint.py", line 360, in <module>
main()
File "/opt/conda/lib/python3.7/site-packages/tfx/orchestration/kubeflow/container_entrypoint.py", line 353, in main
execution_info = launcher.launch()
File "/opt/conda/lib/python3.7/site-packages/tfx/orchestration/launcher/base_component_launcher.py", line 198, in launch
self._exec_properties)
File "/opt/conda/lib/python3.7/site-packages/tfx/orchestration/launcher/base_component_launcher.py", line 167, in _run_driver
component_info=self._component_info)
File "/opt/conda/lib/python3.7/site-packages/tfx/dsl/components/base/base_driver.py", line 270, in pre_execution
driver_args, pipeline_info)
File "/opt/conda/lib/python3.7/site-packages/tfx/dsl/components/base/base_driver.py", line 158, in resolve_input_artifacts
producer_component_id=input_channel.producer_component_id)
File "/opt/conda/lib/python3.7/site-packages/tfx/orchestration/metadata.py", line 948, in search_artifacts
pipeline_info)
RuntimeError: Pipeline run context for PipelineInfo(pipeline_name: sentiment4, pipeline_root: gs://sascha-playground-doit-kubeflowpipelines-default/sentiment4, run_id: sentiment4-qnknl) does not exist
Assume the second component doesnβt find the cached data because the component did not exist in the first run.
First run:

Second run with additional component

What did you expect to happen:
Pipeline run completes without errors
Environment:
AI Platform Pipelines
How did you deploy Kubeflow Pipelines (KFP)? AI Platform Pipelines
KFP version: https://github.com/kubeflow/pipelines/commit/d79071c0bef19442483abc101769a0d893e72f42
KFP SDK version: no pip in AI Platform Pipelines
Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]
/kind bug
About this issue
- Original URL
- State: closed
- Created 3 years ago
- Comments: 24 (14 by maintainers)
Commits related to this issue
- fix(tfx): fix missing mlmd data when sdk label is overridden. Fixes #5303 (#6035) — committed to kubeflow/pipelines by Bobgy 3 years ago
@AlirezaSadeghi did you read the docs: https://www.kubeflow.org/docs/distributions/gke/pipelines/upgrade/?
Hi @Bobgy here is the example you can use https://github.com/SaschaHeyer/Sentiment-Analysis-TFX