kubernetes服务无法与kafka一起按预期工作

nfzehxib  于 2021-06-07  发布在  Kafka
关注(0)|答案(1)|浏览(435)

我正在尝试将zookeeper和kafka设置为共享命名空间中独立的kubernetes部署/pods。我已经通过kubeadm在我的ubuntu沙盒上引导了本地的k8s1.8和calico。。。
对于Zookeeper,我用的是图片zookeeper:3.4 from 我和hub.docker.com创建了一个kubernetes部署和服务,在这里我公开了端口:2181 2888 3888。服务名是zookeeper,我假设我应该能够通过名称空间中的pod中的这个主机名来使用它。
对于Kafka1.0,我创建了自己的容器映像,可以用环境变量控制它。。。我正在设置zookeeper.connect到zookeeper:2181. 我假设kubernetes dns会解决这个问题并打开到服务的连接。
不幸的是我得到:

  1. [2018-01-03 15:48:26,292] INFO Waiting for keeper state SyncConnected (org.I0Itec.zkclient.ZkClient)
  2. [2018-01-03 15:48:32,293] INFO Terminate ZkClient event thread. (org.I0Itec.zkclient.ZkEventThread)
  3. [2018-01-03 15:48:46,286] INFO Opening socket connection to server zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn)
  4. [2018-01-03 15:48:46,299] INFO Socket connection established to zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181, initiating session (org.apache.zookeeper.ClientCnxn)
  5. [2018-01-03 15:48:46,319] INFO Session establishment complete on server zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181, sessionid = 0x10000603c560001, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn)
  6. [2018-01-03 15:48:46,331] INFO Session: 0x10000603c560001 closed (org.apache.zookeeper.ZooKeeper)
  7. [2018-01-03 15:48:46,333] FATAL Fatal error during KafkaServer startup. Prepare to shutdown (kafka.server.KafkaServer)
  8. org.I0Itec.zkclient.exception.ZkTimeoutException: Unable to connect to zookeeper server 'zookeeper:2181' with timeout of 6000 ms
  9. at org.I0Itec.zkclient.ZkClient.connect(ZkClient.java:1233)
  10. at org.I0Itec.zkclient.ZkClient.<init>(ZkClient.java:157)
  11. at org.I0Itec.zkclient.ZkClient.<init>(ZkClient.java:131)
  12. at kafka.utils.ZkUtils$.createZkClientAndConnection(ZkUtils.scala:115)
  13. at kafka.utils.ZkUtils$.withMetrics(ZkUtils.scala:92)
  14. at kafka.server.KafkaServer.initZk(KafkaServer.scala:346)
  15. at kafka.server.KafkaServer.startup(KafkaServer.scala:194)
  16. at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:38)
  17. at kafka.Kafka$.main(Kafka.scala:92)
  18. at kafka.Kafka.main(Kafka.scala)

所以我假设我的集群中有一个通用的网络问题,然后我注意到一些更让我困惑的事情。。。如果我将zookeeper.connect设置为10.107.41.148:2181(zookeeper服务的当前地址),则连接正常(至少从kafka到zookeeper)。

  1. [2018-01-03 15:51:31,092] INFO Waiting for keeper state SyncConnected (org.I0Itec.zkclient.ZkClient)
  2. [2018-01-03 15:51:31,094] INFO Opening socket connection to server 10.107.41.148/10.107.41.148:2181. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn)
  3. [2018-01-03 15:51:31,105] INFO Socket connection established to 10.107.41.148/10.107.41.148:2181, initiating session (org.apache.zookeeper.ClientCnxn)
  4. [2018-01-03 15:51:31,134] INFO Session establishment complete on server 10.107.41.148/10.107.41.148:2181, sessionid = 0x10000603c560005, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn)

通过这个设置,我可以使用kubernetes集群主机上的zookeeper服务来执行例如“bin/kafka-topics.sh--list--zookeeper 10.107.41.148:2181”。产生一个信息是行不通的。。。我假设一旦网络正常工作,我需要添加Kafka广告地址。。。

  1. kafka-console-producer.sh --broker-list 10.100.117.196:9092 --topic test1
  2. >test-msg1
  3. >[2018-01-03 17:05:35,689] WARN [Producer clientId=console-producer] Connection to node 0 could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient)

任何提示我的kubernetes网络设置有什么问题,或者至少从哪里开始故障排除?
谢谢并致以最诚挚的问候,帕维尔

new9mtju

new9mtju1#

如果使用statefulset,则需要首先部署服务。
这是我的服务。

  1. apiVersion: v1
  2. kind: Service
  3. metadata:
  4. name: zookeeper
  5. labels:
  6. app: zookeeper
  7. spec:
  8. clusterIP: None
  9. ports:
  10. - port: 2181
  11. name: client
  12. - port: 2888
  13. name: server
  14. - port: 3888
  15. name: leader-election
  16. selector:
  17. app: zookeeper

这里是configmap(稍后使用):

  1. apiVersion: v1
  2. kind: ConfigMap
  3. metadata:
  4. name: zookeeper-cm
  5. data:
  6. jvm.heap: "1G"
  7. tick: "2000"
  8. init: "10"
  9. sync: "5"
  10. client.cnxns: "60"
  11. snap.retain: "3"
  12. purge.interval: "0"

这是我的状态集:

  1. apiVersion: apps/v1beta1
  2. kind: StatefulSet
  3. metadata:
  4. name: zookeeper
  5. spec:
  6. serviceName: zookeeper
  7. replicas: 1
  8. template:
  9. metadata:
  10. labels:
  11. app: zookeeper
  12. spec:
  13. affinity:
  14. podAntiAffinity:
  15. requiredDuringSchedulingIgnoredDuringExecution:
  16. - labelSelector:
  17. matchExpressions:
  18. - key: "app"
  19. operator: In
  20. values:
  21. - zookeeper
  22. topologyKey: "kubernetes.io/hostname"
  23. containers:
  24. - name: zookeeper
  25. imagePullPolicy: IfNotPresent
  26. image: sorintdev/zookeeper:20171204m
  27. resources:
  28. requests:
  29. memory: 500Mi
  30. cpu: 200m
  31. ports:
  32. - containerPort: 2181
  33. name: client
  34. - containerPort: 2888
  35. name: server
  36. - containerPort: 3888
  37. name: leader-election
  38. env:
  39. - name : ZK_REPLICAS
  40. value: "1"
  41. - name : ZK_HEAP_SIZE
  42. valueFrom:
  43. configMapKeyRef:
  44. name: zookeeper-cm
  45. key: jvm.heap
  46. - name : ZK_TICK_TIME
  47. valueFrom:
  48. configMapKeyRef:
  49. name: zookeeper-cm
  50. key: tick
  51. - name : ZK_INIT_LIMIT
  52. valueFrom:
  53. configMapKeyRef:
  54. name: zookeeper-cm
  55. key: init
  56. - name : ZK_SYNC_LIMIT
  57. valueFrom:
  58. configMapKeyRef:
  59. name: zookeeper-cm
  60. key: tick
  61. - name : ZK_MAX_CLIENT_CNXNS
  62. valueFrom:
  63. configMapKeyRef:
  64. name: zookeeper-cm
  65. key: client.cnxns
  66. - name: ZK_SNAP_RETAIN_COUNT
  67. valueFrom:
  68. configMapKeyRef:
  69. name: zookeeper-cm
  70. key: snap.retain
  71. - name: ZK_PURGE_INTERVAL
  72. valueFrom:
  73. configMapKeyRef:
  74. name: zookeeper-cm
  75. key: purge.interval
  76. - name: ZK_CLIENT_PORT
  77. value: "2181"
  78. - name: ZK_SERVER_PORT
  79. value: "2888"
  80. - name: ZK_ELECTION_PORT
  81. value: "3888"
  82. command:
  83. - bash
  84. - -c
  85. - zkGenConfig.sh && zkServer.sh start-foreground
  86. readinessProbe:
  87. exec:
  88. command:
  89. - "zkOk.sh"
  90. initialDelaySeconds: 10
  91. timeoutSeconds: 5
  92. livenessProbe:
  93. exec:
  94. command:
  95. - "zkOk.sh"
  96. initialDelaySeconds: 10
  97. timeoutSeconds: 5
  98. volumeMounts:
  99. - name: data
  100. mountPath: /var/lib/zookeeper
  101. securityContext:
  102. runAsUser: 1000
  103. fsGroup: 1000
  104. volumeClaimTemplates:
  105. - metadata:
  106. name: data
  107. spec:
  108. resources:
  109. requests:
  110. storage: 1Gi
  111. accessModes:
  112. - ReadWriteOnce
  113. storageClassName: zookeeper-class

在您部署了一个正常工作的zookeeper配置并且他们选择了一个主机之后,您可以继续kafka部署。
一旦zookeeper部署完毕,您的kafka配置必须通过服务引用zookeeper StatefolSet。在Kafka中,必须定义/覆盖此属性:

  1. --override zookeeper.connect=zookeeper-0.zookeeper:2181

从一个豆荚里你应该 ping zookeeper-0.zookeeper 成功。
希望这有帮助。

展开查看全部

相关问题