如何找到一列Pandas过滤后的最小最大值？

piztneat 于 2023-03-06 发布在其他

关注(0)|答案(2)|浏览(159)

我有一个 Dataframe ：

import pandas as pd
df = pd.DataFrame(
    {'team': ['A', 'A', 'A', 'A', 'B', 'B', 'B', 'B'],
     'variable': [8, 9, 10, 11, 2, 3, 4, 5],
     'another_variable': [1, 1, 1, 2, 1, 1, 2, 2]}
)

我希望找到variable（向上计数），其中another_variable仍然等于1。
我可以对数据框进行分组并过滤相关行：

df.groupby(['team']).apply(lambda g: g[g['another_variable'] == 1])

# Output:
#       team    variable    another_variable
#team               
#A  0   A       8           1
#   1   A       9           1
#   2   A       10          1
#B  4   B       2           1
#   5   B       3           1

但是如果我加上.variable.min()，我只得到一个值，而不是每个组一个值（然后我可以计算最大值），我做错了什么？

pandas

来源：https://stackoverflow.com/questions/75558210/how-to-find-the-smallest-maximum-of-a-column-with-pandas-after-filtering

2条答案

按热度按时间

ee7vknir1#

首先过滤，然后groupby：

df[df['another_variable'].eq(1)].groupby('team')['variable'].max()

输出：

team
A    10
B     3
Name: variable, dtype: int64

如果某个组可能没有1，而您希望有NaN，则使用：

df['variable'].where(df['another_variable'].eq(1)).groupby(df['team']).max()

A中没有1时的示例：

team
A   NaN
B     3
Name: variable, dtype: int64

赞(0）回复(0）举报 2023-03-06

55ooxyrt2#

以下似乎是Mozway已经提出的一个变体：

import pandas as pd

df = pd.DataFrame(
    {'team': ['A', 'A', 'A', 'A', 'B', 'B', 'B', 'B'],
     'variable': [8, 9, 10, 11, 2, 3, 4, 5],
     'another_variable': [1, 1, 1, 2, 1, 1, 2, 2]}
)

s = (df.groupby(['team', 'another_variable'])['variable']
       .max()
       .reset_index(['team', 'another_variable'])
       )

print( s[s['another_variable'].eq(1)] )

team  another_variable  variable
0    A                 1        10
2    B                 1         3

赞(0）回复(0）举报 2023-03-06

我来回答

如何找到一列Pandas过滤后的最小最大值？

2条答案

相关问题

热门标签

最新问答