我试图找到前5名的关键字评论的每一类产品,我有以下代码
# Group by category and count keyword frequencies
keyword_counts <- filtered_data %>%
group_by(category, keyword) %>%
summarise(n = n()) %>%
arrange(desc(n))
# Find the top 5 keywords in each category
top_keywords_by_category <- keyword_counts %>%
group_by(category) %>%
top_n(5, wt = n) %>%
ungroup() # Ungroup the data
# Print the table
print(top_keywords_by_category)
字符串
它提供的输出
category keyword n
<chr> <chr> <int>
1 Computers&Accessories|Accessories&Peripherals|Cables&Accessori… product 354
2 Computers&Accessories|Accessories&Peripherals|Cables&Accessori… cable 277
3 Computers&Accessories|Accessories&Peripherals|Cables&Accessori… chargi… 200
4 Computers&Accessories|Accessories&Peripherals|Cables&Accessori… quality 179
5 Computers&Accessories|Accessories&Peripherals|Cables&Accessori… nice 147
6 Electronics|WearableTechnology|SmartWatches watch 129
7 Electronics|Mobiles&Accessories|Smartphones&BasicMobiles|Smart… phone 127
8 Electronics|HomeTheater,TV&Video|Televisions|SmartTelevisions tv 117
9 Electronics|WearableTechnology|SmartWatches product 102
10 Electronics|HomeTheater,TV&Video|Televisions|SmartTelevisions product 80
型
虽然我想要的结果是
Category Computers&Accessories
Keyword n
1 Product 354
2 Cable 277
3 Chargi... 200
4 Quality 179
5 Nice 147
型
1条答案
按热度按时间1sbrub3j1#
虽然这些数据并不有趣,但它应该向您展示如何使用
tidyr::separate_rows
。字符串
从这里,你可以做你的前5个过滤和旋转:
型