配置单元python udf错误

rggaifut  于 2021-06-02  发布在  Hadoop
关注(0)|答案(1)|浏览(298)

我有一个简单的python脚本


# !/usr/local/bin/python

import sys
import datetime
for line in sys.stdin:
line = line.strip()
fname , lname = line.split('\t')
l_name = lname.lower()
print '\t'.join([fname, str(l_name)])

配置单元表数据如下所示:

Akash   Gupta
Ashish  Agarwal
Aarav   Kedia
Rajesh  Lakhia
Sunita  Patel
Raj     Dutta
Nadeem  Siddiqui

表结构为:

hive> desc fullName;
OK
fname                   string
lname                   string

我将我的python脚本添加为:

add FILE /full-path-to-the-script/convertToLowerCase.py;

现在,我正在为脚本运行转换操作:

SELECT TRANSFORM(fname, lname) USING 'python convertToLowerCase.py' AS (fname, l_name) FROM fullName;

但是,map reduce作业正在抛出错误: FAILED: Execution Error, return code 20003 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask. An error occurred when trying to close the Operator running your custom script. 我做错什么了?

agxfikkp

agxfikkp1#

python代码有问题。这个 Indentation of the For Loop .
解决了问题。

相关问题