使用if-else创建新列时出现Pandas错误：系列的真值是模糊的[重复]

lnlaulya 于 2023-09-29 发布在其他

关注(0)|答案(5)|浏览(128)

此问题已在此处有答案：

Truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all()（14个回答）
Create new column based on values from other columns / apply a function of multiple columns, row-wise in Pandas（8个回答）
4天前关闭。
我正在使用Pandas，并试图使用Python if-else语句（又名三元条件运算符）创建一个新列，以避免被零除。
例如下面的例子，我想通过除以A/B来创建一个新的列C。我想使用if-else语句来避免除以0。

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.randint(0, 5, size=(100, 2)), columns=list('AB'))
df.head()
#    A  B
# 0  1  3
# 1  1  2
# 2  0  0
# 3  2  1
# 4  4  2
df['C'] = (df.A / df.B) if df.B > 0.0 else 0.0

但是，我从最后一行得到一个错误：

ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

我在StackOverflow上搜索了一下，发现了其他关于这个错误的帖子，但是没有一个涉及这种类型的if-else语句。一些职位包括：
Truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all()
The truth value of a Series is ambiguous in dataframe
Error: The truth value of a Series is ambiguous - Python pandas
如果你能帮忙的话，我将不胜感激。

pandas

来源：https://stackoverflow.com/questions/48123368/pandas-error-when-using-if-else-to-create-new-column-the-truth-value-of-a-serie

5条答案

按热度按时间

idv4meu81#

那做什么呢

>>> df['C'] = np.where(df.B>0., df.A/df.B, 0.)

其内容如下：
如果df.B为正，则返回df.A/df.B，否则返回0.

赞(0）回复(0）举报 2023-09-29

0qx6xfy62#

df.B > 0产生一个系列，例如：

0      True  # 4 > 0 => True
1      True  # 2 > 0 => True
2      True  # ...
3      True
4      True
5      True
6      True
7      True
8     False  # 0 is not > 0 => False
9     False  # 0 is not > 0 => False
...

返回多个值，这会导致歧义（一些值为True，而另一些值为False）。
一个解决方案是使用np.where：

sentinel = np.nan  # Or 0 if you must...
df = df.assign(C=np.where(df['B'] != 0, df['A'] / df['B'], sentinel))
>>> df
   A  B    C
0  2  4  0.5
1  0  2  0.0
2  1  2  0.5
3  4  4  1.0
4  1  1  1.0
5  4  4  1.0
6  2  4  0.5
7  1  2  0.5
8  4  0  NaN  # NaN is assigned in cases where the value in Column `B` is zero.
9  1  0  NaN
...

展开查看全部

赞(0）回复(0）举报 2023-09-29

w41d8nur3#

基于上面关于迭代 Dataframe 的@vaishnav提案，这里是一个工作提案：

for index, row in df.iterrows():
    if row.B > 0:
        df.loc[index, 'C'] = row.A / row.B
    else:
        df.loc[index, 'C'] = 0

输出量：

A  B         C
0  3  4  0.750000
1  0  4  0.000000
2  4  3  1.333333
3  2  1  2.000000
4  1  0  0.000000
5  0  2  0.000000

赞(0）回复(0）举报 2023-09-29

oo7oh9g94#

df['C']=df.A.div(df.B.mask(df.B.lt(0),0)).fillna(0)
df
Out[89]: 
   A  B         C
0  1  3  0.333333
1  1  2  0.500000
2  0  0  0.000000
3  2  1  2.000000
4  4  2  2.000000

使用apply lambda

df['C']=df.apply(lambda x : x['A']/x['B'] if x['B']>0 else 0,1)
df
Out[93]: 
   A  B         C
0  1  3  0.333333
1  1  2  0.500000
2  0  0  0.000000
3  2  1  2.000000
4  4  2  2.000000

展开查看全部

赞(0）回复(0）举报 2023-09-29

sxpgvts35#

或者你可以打开一个for循环。

for i,j in df['a'],df['b']:
    if j>0:
        df['c']=i/j
    else:
        df['c']=0.0

赞(0）回复(0）举报 2023-09-29

我来回答

使用if-else创建新列时出现Pandas错误：系列的真值是模糊的[重复]

5条答案

相关问题

热门标签

最新问答