如何用spark中的其他值替换数组中的项?

yfwxisqw  于 2021-05-16  发布在  Spark
关注(0)|答案(1)|浏览(682)
  1. +--------------------------------+
  2. |Subject |
  3. +--------------------------------+
  4. |[English, Math, Science, Spark] |
  5. +--------------------------------+
  6. |[English, History, Art] |
  7. +--------------------------------+

我们如何用两排的英语来代替英语?

mzillmmw

mzillmmw1#

使用自定义自定义项替换单词:

  1. val replace = udf{ x: Seq[String] => x.map(y => if(y == "English") "ENGLISH" else y) }
  2. val df2 = df.select(replace($"Subject").alias("Subject"))
  3. df2.show(false)
  4. +-------------------------------+
  5. |Subject |
  6. +-------------------------------+
  7. |[ENGLISH, Math, Science, Spark]|
  8. |[ENGLISH, History, Art] |
  9. +-------------------------------+

相关问题