pandas 将方法应用于panda Dataframe 中的几个选定列

7gcisfzg 于 2023-02-11 发布在其他

关注(0)|答案(3)|浏览(138)

我想对Dataframe中的几个列应用一个小方法。color_negative方法不能应用于字符串列，因此我需要跳过这些列。我可以想到两种方法来解决这个问题，但遗憾的是没有一种方法有效。

*在办法1中：

我尝试通过使用Dataframe的索引并将while循环的递增计数器设置为1，将方法逐个应用于每一列（跳过第一列）。在执行此方法时，我得到了错误，即“Series”对象没有属性“style”，因此显然，我无法将方法应用于单个列。

*在办法2中：

我尝试使用subset来将该方法仅应用于那些具有数值的列，尽管我不确定是否正确使用了subset。在执行此方法时，我得到了类型为'Styler'的对象没有len()的错误。
下面是一个简化的示例：

import pandas as pd

d = {'col1': ['a', 'b'], 'col2': [21, 22], 'col3': [3, 51]}
df = pd.DataFrame(data=d)

def color_negative_red(val):
    color = 'black'
    if val < -1 : color = 'red'
    if val > 1 :  color = 'green'
    return 'color: %s' % color    
    
i=1
while i <= len(df):
    #Approach 1
    df.iloc[:, i] = df.iloc[:, i].style.applymap(color_negative_red)
    #Approach 2
    df = df.style.applymap(color_negative_red, subset = df.iloc[:, i])
    i+=1    

df

有人对如何解决这个问题有什么建议吗？

pandas

来源：https://stackoverflow.com/questions/51306491/applying-a-method-to-a-few-selected-columns-in-a-pandas-dataframe

3条答案

按热度按时间

0yycz8jy1#

您可以选择所需的列，然后对它们执行applymap，如下所示：

column_names = ['name_a','name_b']
df[column_names] = df[column_names].applymap(my_func)

如果需要，可以筛选出字符串列

from numpy.core.multiarray import dtype

column_names = [name for name,col_type in df.dtypes.items() if col_type!=dtype('O')]

赞(0）回复(0）举报 2023-02-11

ujv3wf0j2#

您可以将style.Styler.apply与DataFrame of styles和numpy.select配合使用来填充：

d = {'col1': ['a', 'b'], 'col2': [21, 22], 'col3': [0, -51]}
df = pd.DataFrame(data=d)

def color_negative_red(x):
    #select only numeric columns
    x1 = x.select_dtypes(np.number)
    c1 = 'color: red'
    c2 = 'color: green'
    c3 = '' 
    #boolean masks
    m1 = x1 < -1
    m2 = x1 > 1
    #numpy array by conditions
    arr = np.select([m1, m2], [c1, c2], default=c3)
    df1 =  pd.DataFrame(arr, index=df.index, columns=x1.columns)
    #added strings columns filled by c3 string 
    df1 = df1.reindex(columns=x.columns, fill_value=c3)
    return df1

df.style.apply(color_negative_red, axis=None)

赞(0）回复(0）举报 2023-02-11

dgenwo3n3#

对函数进行矢量化

import numpy as np

f = np.vectorize(color_negative_red)

然后可以使用简单的apply，同时根据需要按列名进行过滤：

df.apply(lambda x: f(x) if x.name not in ['col1'] else x)
#   col1          col2          col3
# 0    a  color: green  color: green
# 1    b  color: green  color: green

赞(0）回复(0）举报 2023-02-11

我来回答

pandas 将方法应用于panda Dataframe 中的几个选定列

3条答案

相关问题

热门标签

最新问答