R语言 在ggplot2中为具有多个变量的条形图分组数据时出现问题

yjghlzjz  于 2022-12-20  发布在  其他
关注(0)|答案(1)|浏览(177)

我有一个 Dataframe df:

  1. df=data.frame("temp"=c(60.80,46.04,26.96,24.98),"humid"=c(93.79,53.33,50.34,54.65),"wind_speed"=c(40.27,39.12,14.96, 13.81), "date" =c("2013-01-01","2013-01-03","2013-02-01", "2013-02-02"))
  2. df$date <- as.Date(df$date, "%Y-%m-%d")
  3. temp humid wind_speed date
  4. 1 60.80 93.79 40.27 2013-01-01
  5. 2 46.04 53.33 39.12 2013-01-03
  6. 3 26.96 50.34 14.96 2013-02-01
  7. 4 24.98 54.65 13.81 2013-02-02

我用下面这行代码将它转换为如下所示:

  1. df_mod<- cbind(df[4], stack(df_w_delays_mod[1:3]))
  2. metric values date
  3. temp 60.80 2013-01-01
  4. temp 46.04 2013-01-03
  5. temp 26.96 2013-02-01
  6. temp 24.98 2013-02-02
  7. humid 93.79 2013-01-01
  8. humid 53.33 2013-01-03
  9. humid 50.34 2013-02-01
  10. humid 54.65 2013-02-02
  11. wind_speed 40.27 2013-01-01
  12. wind_speed 39.12 2013-01-03
  13. wind_speed 14.96 2013-02-01
  14. wind_speed 13.81 2013-02-02

那么我已经提取了月份:

  1. transform(df, month = month(date, label=TRUE))
  2. metric values month
  3. temp 60.80 Jan
  4. temp 46.04 Jan
  5. temp 26.96 Feb
  6. temp 24.98 Feb
  7. ...

现在,我尝试构建一个类似的图表。

我想得到每个条形高度的平均值,所以我想按月和变量分组,然后取每个月内的平均值。
我正在尝试这个代码,但它给我错误。

  1. df_mod %>%
  2. group_by(metric) %>%
  3. summarize(mean= mean(values)) %>%
  4. ggplot(aes(fill=metric, y=mean, x=month)) +
  5. geom_bar(position="dodge", stat="identity") +
  6. theme_bw()+
  7. labs(title="Weather metrics",
  8. x="", y = "values")
  9. Don't know how to automatically pick scale for object of type function. Defaulting to continuous.
  10. Error in `f()`:
  11. ! Aesthetics must be valid data columns. Problematic aesthetic(s): x = month.
  12. Did you mistype the name of a data column or forget to add after_stat()?
  13. Run `rlang::last_error()` to see where the error occurred.

我还尝试过group_by(月份、公制),结果显示

  1. `summarise()` has grouped output by '.groups'. You can override using the `.groups` argument.

有人能帮我一下吗?

nwnhqdif

nwnhqdif1#

问题是你必须group_bymetricmonth。否则你只计算每个metric的平均值,结果month列被删除。这就是你得到错误的原因,因为ggplot2现在认为你想把month函数Map到x上。
注意:我还切换到了tidyverse方法来重塑数据。

  1. library(tidyr)
  2. library(ggplot2)
  3. library(lubridate)
  4. library(dplyr)
  5. df_mod <- df %>%
  6. tidyr::pivot_longer(-date, names_to = "metric", values_to = "values") %>%
  7. mutate(month = month(date, label = TRUE))
  8. df_mod %>%
  9. group_by(month, metric) %>%
  10. summarize(mean = mean(values)) %>%
  11. ggplot(aes(fill = metric, y = mean, x = month)) +
  12. geom_col(position = "dodge") +
  13. theme_bw() +
  14. labs(
  15. title = "Weather metrics",
  16. x = "", y = "values"
  17. )

展开查看全部

相关问题