我的剧本不断地出错
java.lang.RuntimeException: native snappy library not available: this version of libhadoop was built without snappy support
代码本身如下所示
WITH step1 AS(
SELECT columns
FROM t1 stg
WHERE time_key < '2017-04-08' AND time_key >= DATE_ADD('2017-04-08', -31)
GROUP BY columns
HAVING conditions1
)
, step2 AS(
SELECT columns
FROM t2
WHERE conditions2
)
, step3 AS(
SELECT columns
FROM stg
JOIN comverse_sub
ON conditions3
)
INSERT INTO TABLE t1 PARTITION(time_key = '2017-04-08')
SELECT columns
FROM step3
WHERE conditions4
我检查了是否安装了snappy
hadoop checknative -a
得到了
snappy: true /usr/hdp/2.5.0.0-1245/hadoop/lib/native/libsnappy.so.1
我对tez的设置是
set tez.queue.name=adhoc;
set hive.execution.engine=tez;
set hive.tez.container.size=4096;
set hive.auto.convert.join=true;
set hive.exec.parallel=true;
set hive.tez.auto.reducer.parallelism=true;
SET hive.exec.compress.output=true;
SET tez.runtime.compress=true;
SET tez.runtime.compress.codec=org.apache.hadoop.io.compress.SnappyCodec;
我还应该注意到,并不是所有关于tez的脚本都会失败。一些工作。就像这个
WITH hist AS(
SELECT columns
FROM t1
WHERE conditions1
)
INSERT INTO TABLE t1 PARTITION(time_key)
SELECT columns
FROM hist
INNER JOIN t2
on conditions2
INNER JOIN t3
ON conditions3
WHERE conditions4
为什么会这样?
我检查了这个还有这个还有这个。没用。而且,当我在mr上运行脚本时,它们都能工作。
1条答案
按热度按时间dba5bblo1#
我解决了。我一直在添加设置,直到成功。还是不知道是什么问题,解决方案不是很好,但暂时可以。