java编写hadoop减少elasticsearch的输出

hl0ma9xz 于 2021-05-30 发布在 Hadoop

关注(0)|答案(1)|浏览(432)

我在理解如何将简单hadoop的输出写回elasticsearch时遇到了一些困难。
作业配置为：

job.setOutputFormatClass(EsOutputFormat.class);
job.setOutputKeyClass(NullWritable.class);
job.setOutputValueClass(MapWritable.class);

减速机有：

final DoubleWritable average = new DoubleWritable(sum / size);

final MapWritable output = new MapWritable();
output.put(key, average);
context.write(NullWritable.get(), output);

但我有一个莫名其妙的例外：

14/08/15 16:59:54 INFO mapreduce.Job: Task Id : attempt_1408106733881_0013_r_000000_2, Status : FAILED Error: 
  org.elasticsearch.hadoop.EsHadoopIllegalArgumentException: 
  [org.elasticsearch.hadoop.serialization.field.MapWritableFieldExtractor@5796fabe] cannot extract value from object [org.apache.hadoop.io.MapWritable@dcdb8e97]    
    at org.elasticsearch.hadoop.serialization.bulk.TemplatedBulk$FieldWriter.write(TemplatedBulk.java:49)   
    at org.elasticsearch.hadoop.serialization.bulk.TemplatedBulk.writeTemplate(TemplatedBulk.java:101)  
    at org.elasticsearch.hadoop.serialization.bulk.TemplatedBulk.write(TemplatedBulk.java:77)   
    at org.elasticsearch.hadoop.rest.RestRepository.writeToIndex(RestRepository.java:130)   
    at org.elasticsearch.hadoop.mr.EsOutputFormat$EsRecordWriter.write(EsOutputFormat.java:161)     
    at org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.write(ReduceTask.java:558)

在context.write（）调用中。
我有点困惑，知道吗？

Java hadoop elasticsearch elasticsearch-hadoop

来源：https://stackoverflow.com/questions/25329016/writing-hadoop-reduce-output-to-elasticsearch

1条答案

按热度按时间

gv8xihay1#

原来我在工作配置上犯了个错误。我补充了以下内容：

configuration.set("es.mapping.id", "_id");

没有在输出mapwritable中实际添加\u id字段；这导致es抛出异常。
如果mapwritablefleldextractor记录了它失败的字段，这将非常有用。

赞(0）回复(0）举报 2021-05-30

我来回答

java编写hadoop减少elasticsearch的输出

1条答案

相关问题

热门标签

最新问答