如何在配置单元中的分区表之间移动数据

mrfwxfqh 于 2021-06-28 发布在 Hive

关注(0)|答案(1)|浏览(392)

我有两张table emp1 以及 emp2 具有字段-
用户ID
名称
职业
国家 emp1 在国家和地区上有分歧 emp2 在职业上有分歧
如何从中移动数据 emp1 至 emp2

Hive partitioning

来源：https://stackoverflow.com/questions/39721429/how-to-move-data-between-partitioned-tables-in-hive

1条答案

按热度按时间

l7mqbcuq1#

用emp1 plus中的数据集覆盖目标表( union all )emp2表中的旧数据。注意 distribute by 在查询的最后-这是为了优化分区创建，最终的缩减器将只接收它们的分区数据，这将减少内存消耗。

insert overwrite table emp2 partition(occupation) 
select userid, name, country, occupation from emp1 
union all
select userid, name, country, occupation from emp2
distribute by occupation;

此外，您还可以使用行\ u number（）添加或删除重复项。

赞(0）回复(0）举报 2021-06-28

我来回答

如何在配置单元中的分区表之间移动数据

1条答案

相关问题

热门标签

最新问答