sql server—在同一数据集上按不同级别对sql进行分组

ou6hu8tu 于 2021-05-27 发布在 Hadoop

关注(0)|答案(2)|浏览(398)

我有下面的数据集，我希望创建不同的组来计算name下的值的出现次数。
有：（县在串）

name   state  county 
apple   MD      1
apple   DC      1
pear    VA      1
pear    VA      2
pear    CA      5
peach   CO      3
peach   CO      3
peach   CO      2
peach   CO      2

想要：

name   state  county freq_name  freq_state  freq_county
apple   MD      1     2            1            2
apple   DC      1     2            1            2
pear    VA      1     3            2            3
pear    VA      2     3            2            3
pear    CA      5     3            1            3
peach   CO      3     4            4            2
peach   CO      2     4            4            2

我相信通过sql，over partition将允许按不同级别进行计数，例如：

count(name) over (partition by name) as freq_name,
count(name) over (partition by state) as freq_state,
count(name) as freq_county
from have
group by name,state, county;

由于某些原因，这段代码没有为freq\u name提供正确的计数。我还想检查我的freq\u state和freq\u county的代码是否正确。谢谢！

sql hadoop Hive sql-server cloudera

来源：https://stackoverflow.com/questions/60200014/sql-group-by-different-levels-on-the-same-dataset

2条答案

按热度按时间

8hhllhi21#

你似乎想要：

select name, state, county, count(*) as this_count,
       sum(count(*)) over (partition by name) as freq_name,
       sum(count(*)) over (partition by state) as freq_state,
       sum(count(*)) as freq_county
from have
group by name, state, county;

赞(0）回复(0）举报 2021-05-27

bgibtngc2#

对于 freq_name ，使用 count(*) 而不是 count(name) ```
count(*) over (partition by name) as freq_name,
count(name) over (partition by state) as freq_state,
count(name) as freq_county
from have
group by name,state, county;

赞(0）回复(0）举报 2021-05-27

我来回答

sql server—在同一数据集上按不同级别对sql进行分组

2条答案

相关问题

热门标签

最新问答