我正在用Kafka、Druid和超集测试数据流。
我在Druid有一些数据(见1)。皮克特)。
之后,我可以通过选项“refresh druid metadata”(参见2.pic)在超集中生成druid数据源。问题是,当我想查询数据时,我收到以下错误消息:
URLError: <urlopen error [Errno -2] Name or service not known>
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/superset/viz.py", line 329, in get_df_payload
df = self.get_df(query_obj)
File "/usr/lib/python2.7/site-packages/superset/viz.py", line 142, in get_df
self.results = self.datasource.query(query_obj)
File "/usr/lib/python2.7/site-packages/superset/connectors/druid/models.py", line 1238, in query
client=client, query_obj=query_obj, phase=2)
File "/usr/lib/python2.7/site-packages/superset/connectors/druid/models.py", line 959, in get_query_str
return self.run_query(client=client, phase=phase,**query_obj)
File "/usr/lib/python2.7/site-packages/superset/connectors/druid/models.py", line 1126, in run_query
client.timeseries(**qry)
File "/usr/lib/python2.7/site-packages/pydruid/client.py", line 167, in timeseries
return self._post(query)
File "/usr/lib/python2.7/site-packages/pydruid/client.py", line 484, in _post
res = urllib.request.urlopen(req)
File "/usr/lib64/python2.7/urllib2.py", line 154, in urlopen
return opener.open(url, data, timeout)
File "/usr/lib64/python2.7/urllib2.py", line 431, in open
response = self._open(req, data)
File "/usr/lib64/python2.7/urllib2.py", line 449, in _open
'_open', req)
File "/usr/lib64/python2.7/urllib2.py", line 409, in _call_chain
result = func(*args)
File "/usr/lib64/python2.7/urllib2.py", line 1244, in http_open
return self.do_open(httplib.HTTPConnection, req)
File "/usr/lib64/python2.7/urllib2.py", line 1214, in do_open
raise URLError(err)
URLError: <urlopen error [Errno -2] Name or service not known>
另见3。照片
知道有什么问题吗?
我通过nifi喂Kafka,然后我把Kafka的源头连接到山姆的Druid目标上。
谢谢您!
皮克特
皮克特
皮克特
超集中没有数据
2条答案
按热度按时间pu3pd22g1#
似乎超集在连接到代理节点时遇到问题。检查群集运行状况。尤其是代理和协调器节点日志。
ivqmmu1c2#
问题解决了,问题是在超集ui的集群配置中并没有定义代理主机。我将它设置为value:localhost,现在它已经启动并运行了。