查询hiveserver2时，impyla从org.apache.hadoop.hive.ql.exec.mr.mapredtask返回代码1

wh6knrhe 于 2021-05-29 发布在 Hadoop

关注(0)|答案(1)|浏览(412)

我正在使用impyla查询hive的一些结果，但是遇到了以下问题：
来自impyla：

impala.error.OperationalError: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask

从hiveserver2：

WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
Query ID = hduser_20180827031927_fdb148b0-725b-434c-a0f8-98b6843d4348
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks not specified. Defaulting to jobconf value of: 1
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapreduce.job.reduces=<number>
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask

我的源代码是：

from impala.dbapi import connect
import sys

dbName = sys.argv[1:][0]
query = sys.argv[1:][1]

conn = connect(host='192.168.0.10', port=10000, database=dbName , auth_mechanism='NOSASL' , use_ssl=True)
cursor = conn.cursor()
cursor.execute(query, configuration={'hive.exec.reducers.bytes.per.reducer': '100000', 'hive.auto.convert.join.noconditionaltask':'false','mapreduce.job.reduces':'1','hive.auto.convert.join':'false'})
returnData = []
for row in cursor:
    returnData.append(row[0])
pprint(returnData)

如您所见，我添加了许多配置，但它不起作用

hadoop Hive hiveql impyla

来源：https://stackoverflow.com/questions/52029159/impyla-return-code-1-from-org-apache-hadoop-hive-ql-exec-mr-mapredtask-when-quer