pandas 如何用区间数据绘制百分位数图

bzzcjhmw  于 2023-11-15  发布在  其他
关注(0)|答案(2)|浏览(128)

如何用区间数据绘制百分位数图?
请参阅下面的代码,以根据特定的间隔计算数据的百分比。

  1. idx = pd.IntervalIndex.from_breaks([39.9, 42.9,45.9,48.9,51.9,54.9,57.9])
  2. df = pd.DataFrame({"Bin": idx, "Frequency": [2,2,5,5,12,3]})
  3. n = df["Frequency"].sum()
  4. df['cumulativeSumFreq'] = df["Frequency"].cumsum()
  5. df['cumulativePercent'] = (df["Frequency"]/n)*100
  6. df
  7. bins = [39.9, 42.9,45.9,48.9,51.9,54.9,57.9]
  8. df.hist(column='cumulativePercent', bins=bins)
  9. plt.show()

字符串
出于某种原因,df.hist()不排除bins=idx
我得到下面的图,它没有遵循正确的装箱。我如何实现这一点?


的数据


frebpwbc

frebpwbc1#

使用pyplot.stairsAxes.stairs

  1. edges = [39.9, 42.9,45.9,48.9,51.9,54.9,57.9]
  2. plt.stairs(df['cumulativePercent'], edges, fill=True)

字符串

  1. edges = [39.9, 42.9,45.9,48.9,51.9,54.9,57.9]
  2. fig, ax = plt.subplots()
  3. ax.stairs(df['cumulativePercent'], edges, fill=True)


输出量:


的数据

展开查看全部
sxissh06

sxissh062#

出现这个问题是因为hist方法试图从数据中创建自己的bin,而不是严格使用您提供的bin。您可以使用bar来绘制预分仓数据的直方图,而不是使用hist方法。这里是更新的完整代码:

  1. import pandas as pd
  2. import matplotlib.pyplot as plt
  3. # Create the dataframe
  4. idx = pd.IntervalIndex.from_breaks([39.9, 42.9,45.9,48.9,51.9,54.9,57.9])
  5. df = pd.DataFrame({"Bin": idx, "Frequency": [2,2,5,5,12,3]})
  6. n = df["Frequency"].sum()
  7. # Calculate cumulative sum and cumulative percent
  8. df['cumulativeSumFreq'] = df["Frequency"].cumsum()
  9. df['cumulativePercent'] = (df["Frequency"]/n)*100
  10. # Plotting the histogram using bar
  11. left_edges = [interval.left for interval in df["Bin"]]
  12. right_edges = [interval.right for interval in df["Bin"]]
  13. # The bar heights will be the 'cumulativePercent' values
  14. heights = df['cumulativePercent']
  15. # Plot the bars
  16. plt.bar(left_edges, heights, width=[right-left for left, right in zip(left_edges, right_edges)], align='edge', edgecolor='black')
  17. # Label the x-axis
  18. plt.xticks((left_edges + right_edges)[:-1], labels=[str(interval) for interval in df["Bin"]], rotation=45, ha="right")
  19. plt.show()

字符串

展开查看全部

相关问题