更正Logstash Elasticsearch输出中document_id的语法

t3psigkw 于 2022-12-09 发布在 Logstash

关注(0)|答案(1)|浏览(164)

因此，我尝试使用logstash将数据从MongoDB移动到elasticsearch。我不希望写入重复的数据，所以我在输出中使用doc_as_upsert => true沿着document_id参数。这是我的logstash配置文件

input {
  jdbc{
    jdbc_driver_class => "com.dbschema.MongoJdbcDriver"
    jdbc_driver_library => "/path/to/mongojdbc1.8.jar"
    jdbc_user => ""
    jdbc_password => ""
    jdbc_connection_string => "jdbc:mongodb://127.0.0.1:27017/db1"
    statement => "db1.coll1.find({ },{'_id': false})"
  }
}

output {
  elasticsearch {
    hosts => ["http://127.0.0.1:9200"]
    index => "test"
    user => ""
    password => ""
    doc_as_upsert => true
    document_id => "%{datetime}"
  }
}

正如您所看到的，我尝试使用MongoDB文档的 datetime 字段（一个字符串）作为elasticsearch的文档id。

{
    "_index" : "test",
    "_type" : "_doc",
    "_id" : "%{datetime}",
    "_score" : 1.0,
    "_source" : {
        "@timestamp" : "2020-05-28T08:53:28.244Z",
        "document" : {
            # .. some fields ..
            "datetime" : "2020-05-28 14:22:29.133363",
            # .. some fields ..
        },
        "@version" : "1"
    }
}

将字符串%{datetime}用作ID，而不是将datetime字段的值用作_id。如何修复此问题？

logstash

来源：https://stackoverflow.com/questions/62060896/correct-syntax-for-document-id-in-logstash-elasticsearch-output

1条答案

按热度按时间

hpcdzsge1#

document_id字段不在根级别，因此需要将语法更改为以下格式：

document_id => "%{[document][datetime]}"

赞(0）回复(0）举报 2022-12-09

我来回答

更正Logstash Elasticsearch输出中document_id的语法

1条答案

相关问题

热门标签

最新问答