kubernetes 在Kubeflow中部署管道时出现“ERROR:root:Failed to get healthz info attempt 1 of 5.”

q5iwbnjs  于 2023-08-03  发布在  Kubernetes
关注(0)|答案(1)|浏览(115)

我正在尝试使用Kubeflow v2 quickstart
首先,我通过以下方式将Kubeflow部署到本地Kubernetes集群:

export PIPELINE_VERSION="2.0.0-alpha.4"

kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/cluster-scoped-resources?ref=$PIPELINE_VERSION"
kubectl wait --for condition=established --timeout=60s crd/applications.app.k8s.io
kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/env/dev?ref=$PIPELINE_VERSION"

字符串
我港口转发

kubectl port-forward service/ml-pipeline-ui --namespace=kubeflow 38620:80


我可以在http://localhost:38620看到UI
x1c 0d1x的数据
接下来,我安装了kfp 2.0.1。下面是我的代码:

from kfp import client, dsl

@dsl.component
def addition_component(num1: int, num2: int) -> int:
    return num1 + num2

@dsl.pipeline(name="addition-pipeline")
def my_pipeline(a: int, b: int, c: int = 10):
    add_task_1 = addition_component(num1=a, num2=b)
    add_task_2 = addition_component(num1=add_task_1.output, num2=c)

endpoint = "http://localhost:38620"  # <- Not entirely sure if it is correct as it is missing in the quickstart document.
kfp_client = client.Client(host=endpoint)
run = kfp_client.create_run_from_pipeline_func(
    my_pipeline,
    arguments={"a": 1, "b": 2},
)
url = f"{endpoint}/#/runs/details/{run.run_id}"
print(url)


然而,我得到了错误

python src/main.py

/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp/client/client.py:158: FutureWarning: This client only works with Kubeflow Pipeline v2.0.0-beta.2 and later versions.
  warnings.warn(
ERROR:root:Failed to get healthz info attempt 1 of 5.
Traceback (most recent call last):
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp/client/client.py", line 435, in get_kfp_healthz
    return self._healthz_api.get_healthz()
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/api/healthz_service_api.py", line 63, in get_healthz
    return self.get_healthz_with_http_info(**kwargs)  # noqa: E501
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/api/healthz_service_api.py", line 134, in get_healthz_with_http_info
    return self.api_client.call_api(
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/api_client.py", line 364, in call_api
    return self.__call_api(resource_path, method,
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/api_client.py", line 188, in __call_api
    raise e
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/api_client.py", line 181, in __call_api
    response_data = self.request(
                    ^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/api_client.py", line 389, in request
    return self.rest_client.GET(url,
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/rest.py", line 230, in GET
    return self.request("GET", url,
           ^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp_server_api/rest.py", line 224, in request
    raise ApiException(http_resp=r)
kfp_server_api.exceptions.ApiException: (404)
Reason: Not Found
HTTP response headers: HTTPHeaderDict({'X-Powered-By': 'Express', 'Content-Security-Policy': "default-src 'none'", 'X-Content-Type-Options': 'nosniff', 'Content-Type': 'text/html; charset=utf-8', 'Content-Length': '159', 'Date': 'Mon, 03 Jul 2023 03:59:33 GMT', 'Connection': 'keep-alive', 'Keep-Alive': 'timeout=5'})
HTTP response body: <!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Error</title>
</head>
<body>
<pre>Cannot GET /apis/v2beta1/healthz</pre>
</body>
</html>


endpoint中,我使用的是http://localhost:38620,然而,我不完全确定我是否使用了正确的一个,因为它在quickstart document中缺失。
我试着进入

  • http://localhost:38620/apis/v2beta1/healthz
  • http://localhost:38620/#/apis/v2beta1/healthz

他们并不存在
我在https://github.com/kubeflow/kubeflow/issues/5989上发现了类似的问题,但里面没有任何有用的信息。
任何导游都会感激!

xzv2uavs

xzv2uavs1#

UPDATE:我已经打开了一个pull request来更新文档。今天应该不错。

我仔细查看了Python脚本日志。它有一条线:
未来警告:此客户端仅适用于Kubeflow Pipeline v2.0.0-beta.2及更高版本。
原来Kubeflow v2 quickstart已经过时了。
在我将2.0.0-alpha.4替换为2.0.0(您可以在https://github.com/kubeflow/pipelines/releases?q=Version上找到最新版本)并通过以下方式重新部署之后

export PIPELINE_VERSION="2.0.0"

kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/cluster-scoped-resources?ref=$PIPELINE_VERSION"
kubectl wait --for condition=established --timeout=60s crd/applications.app.k8s.io
kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/env/dev?ref=$PIPELINE_VERSION"

字符串
现在管道脚本成功了:

python src/main.py

/Users/hongbo-miao/Library/Caches/pypoetry/virtualenvs/hm-kubeflow-calculate-PriecqfA-py3.11/lib/python3.11/site-packages/kfp/client/client.py:158: FutureWarning: This client only works with Kubeflow Pipeline v2.0.0-beta.2 and later versions.
  warnings.warn(
Experiment details: http://localhost:38620/#/experiments/details/c9d82bc3-712e-422a-804f-fcbe5d2b8acb
Run details: http://localhost:38620/#/runs/details/87398dae-6488-446b-b650-7bbc580aab56
http://localhost:38620/#/runs/details/87398dae-6488-446b-b650-7bbc580aab56


x1c 0d1x的数据
健康端点http://localhost:38620/apis/v2 beta1/healthz也返回

{
  "buildDate": "Tue Jun 20 16:56:00 UTC 2023",
  "frontendCommitHash": "e03e31219387b587b700ba3e31a02df486aa364f",
  "frontendTagName": "2.0.0",
  "apiServerReady": true,
  "apiServerCommitHash": "e03e31219387b587b700ba3e31a02df486aa364f",
  "apiServerTagName": "2.0.0",
  "apiServerMultiUser": false,
  "multi_user": false
}

相关问题