pig脚本因gc失败:超出开销限制

iugsix8n  于 2021-06-24  发布在  Pig
关注(0)|答案(0)|浏览(237)

我在大约1GB的数据上运行一个pig脚本,它涉及几个groupby和foreach语句。以下是示例清管器代码:
ab=y组(y1、y2、y3、y4、y5、y6);
xy=foreach ab{
=每个y
生成
x1、x2、x3、x4、x5、x6、rel1、rel2;
生成
集团;
} ;
注:rel1和rel2是按组生成的,它们本身也是行李,对于一个[y1、y2、y3、y4、y5、y6],行李大小包含大约448条记录,大小为700mb,pig未通过xy关系,表示超出了gc开销限制。
Yarn原木

2018-08-08 15:01:13,299 INFO [Service Thread] org.apache.pig.impl.util.SpillableMemoryManager: first memory handler call - Collection threshold init = 1148190720(1121280K) used = 5726479864(5592265K) committed = 5726797824(5592576K) max = 5726797824(5592576K), toFree = 3046581752
2018-08-08 15:04:22,192 FATAL [ResponseProcessor for block BP-1779694772-10.xxx.xx.17-1533341581987:blk_1074055963_315162] org.apache.hadoop.yarn.YarnUncaughtExceptionHandler: Thread Thread[ResponseProcessor for block BP-1779694772-10.xxx.xx.17-1533341581987:blk_1074055963_315162,5,main] threw an Error.  Shutting down now...
java.lang.OutOfMemoryError: GC overhead limit exceeded
2018-08-08 15:05:24,112 INFO [ResponseProcessor for block BP-1779694772-10.xxx.xx.17-1533341581987:blk_1074055963_315162] org.apache.hadoop.util.ExitUtil: Halt with status -1 Message: HaltException

暂无答案!

目前还没有任何答案,快来回答吧!

相关问题