如何将流式查询的结果写入多个数据库表？

7rfyedvj 于 2021-05-29 发布在 Spark

关注(0)|答案(1)|浏览(357)

我正在使用spark结构化流媒体和Kafka主题的阅读。目标是将消息写入多个表的postgresql数据库。
消息架构为：

root
  |-- id: string (nullable = true)
  |-- name: timestamp (nullable = true)
  |-- comment: string (nullable = true)
  |-- map_key_value: map (nullable = true)
    |-- key: string
    |-- value: string (valueContainsNull = true)

在删除map\u key\u值后写入一个表时，使用以下代码：
我的代码是：

message.writeStream.foreachBatch { (batchDF: DataFrame, batchId: Long) =>
    batchDF.write.format("jdbc").option("url", "url")
      .option("user", "username")
      .option("password", "password")
      .option(JDBCOptions.JDBC_TABLE_NAME, "table_1')
      .mode(SaveMode.Append).save();
  }.outputMode(OutputMode.Append()).start().awaitTermination()

我想将消息写入表1（id、name、comment）和表2的两个db表中，表2需要有map\u key\u值。

apache-spark apache-spark-sql spark-structured-streaming

来源：https://stackoverflow.com/questions/62192115/how-to-write-dataframe-streaming-in-mysql-scala