Pig的分层抽样

cwxwcias  于 2021-06-21  发布在  Pig
关注(0)|答案(0)|浏览(208)

我正尝试使用以下代码在pig中实现分层抽样:

  1. REGISTER datafu-1.2.0.jar
  2. DEFINE SRS datafu.pig.sampling.SimpleRandomSample('0.01');
  3. pop = LOAD 'pop';
  4. grouped = GROUP pop BY metroid;
  5. strsampled = FOREACH grouped GENERATE FLATTEN(SRS(pop));
  6. strsampled2 = FOREACH (GROUP strsampled all) GENERATE FLATTEN(strsampled);
  7. STORE strsampled2 INTO 'strsample';

但我得到以下错误:

  1. ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2997: Encountered IOException. Call From pdnhwhdplinc04.xxxxx.local/0.0.0.0 to pnnhwhdplinc01.xxxxx.local:8020 failed on connection exception: java.net.ConnectException: Connection refused; For more details see: http://wiki.apache.org/hadoop/ConnectionRefused

有人能提供一些见解吗?
谢谢!

暂无答案!

目前还没有任何答案,快来回答吧!

相关问题