我有一个简单的python脚本
# !/usr/local/bin/python
import sys
import datetime
for line in sys.stdin:
line = line.strip()
fname , lname = line.split('\t')
l_name = lname.lower()
print '\t'.join([fname, str(l_name)])
配置单元表数据如下所示:
Akash Gupta
Ashish Agarwal
Aarav Kedia
Rajesh Lakhia
Sunita Patel
Raj Dutta
Nadeem Siddiqui
表结构为:
hive> desc fullName;
OK
fname string
lname string
我将我的python脚本添加为:
add FILE /full-path-to-the-script/convertToLowerCase.py;
现在,我正在为脚本运行转换操作:
SELECT TRANSFORM(fname, lname) USING 'python convertToLowerCase.py' AS (fname, l_name) FROM fullName;
但是,map reduce作业正在抛出错误: FAILED: Execution Error, return code 20003 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask. An error occurred when trying to close the Operator running your custom script.
我做错什么了?
1条答案
按热度按时间agxfikkp1#
python代码有问题。这个
Indentation of the For Loop
.解决了问题。