在下面的pig脚本中,两个表是相继读取还是并行读取?
a = LOAD 'sampledb1.tb1' USING org.apache.hcatalog.pig.HCatLoader();
a_filter = FILTER a BY cpd_dt == '20150602';
b = LOAD 'sampledb2.tb2' USING org.apache.hcatalog.pig.HCatLoader();
b_filter = FILTER b BY cpd_dt == '20150602';
/* do some analysis on the data in the above tables*/
1条答案
按热度按时间p5cysglq1#
从我创建的脚本和表中,我注意到表一个接一个地读取。