scipy 如何将NumPy数组规范化到一定范围内？

3mpgtkmj 于 2022-11-10 发布在其他

关注(0)|答案(8)|浏览(145)

在对音频或图像数组进行一些处理之后，需要在一定范围内对其进行归一化，然后才能将其写回文件。可以这样做：


# Normalize audio channels to between -1.0 and +1.0

audio[:,0] = audio[:,0]/abs(audio[:,0]).max()
audio[:,1] = audio[:,1]/abs(audio[:,1]).max()

# Normalize image to between 0 and 255

image = image/(image.max()/255.0)

有没有一种更简洁、更方便的函数方法来实现这一点？matplotlib.colors.Normalize()似乎不相关。

scipy

来源：https://stackoverflow.com/questions/1735025/how-to-normalize-a-numpy-array-to-within-a-certain-range

8条答案

按热度按时间

kgsdhlau1#


# Normalize audio channels to between -1.0 and +1.0

audio /= np.max(np.abs(audio),axis=0)

# Normalize image to between 0 and 255

image *= (255.0/image.max())

使用/=和*=可以消除中间的临时数组，从而节省一些内存。

image *= 255.0/image.max()    # Uses 1 division and image.size multiplications

略快于

image /= image.max()/255.0    # Uses 1+image.size divisions

由于我们在这里使用的是基本的numpy方法，我认为这是numpy中最有效的解决方案。
就地操作不会改变容器数组的数据类型。因为所需的标准化值是浮点数，所以audio和image数组在执行就地操作之前需要具有浮点数据类型。如果它们还不是浮点数据类型，则需要使用astype转换它们。例如，

image = image.astype('float64')

赞(0）回复(0）举报 2022-11-10

1tu0hz3e2#

如果数组同时包含正数据和负数据，我会选择：

import numpy as np

a = np.random.rand(3,2)

# Normalised [0,1]

b = (a - np.min(a))/np.ptp(a)

# Normalised [0,255] as integer: don't forget the parenthesis before astype(int)

c = (255*(a - np.min(a))/np.ptp(a)).astype(int)        

# Normalised [-1,1]

d = 2.*(a - np.min(a))/np.ptp(a)-1

如果数组包含nan，一种解决方案是将它们删除，如下所示：

def nan_ptp(a):
    return np.ptp(a[np.isfinite(a)])

b = (a - np.nanmin(a))/nan_ptp(a)

但是，根据上下文的不同，您可能希望以不同的方式处理nan。例如，对值进行插值，将中的替换为例如0，或者引发错误。
最后，值得一提的是，即使不是OP的问题，standardization：

e = (a - np.mean(a)) / np.std(a)

赞(0）回复(0）举报 2022-11-10

r3i60tvu3#

也可以使用sklearn重新调整比例。其优点是，除了对数据进行均值置中之外，还可以对标准差进行归一化调整，并且可以在任一轴上按要素或按记录执行此操作。

from sklearn.preprocessing import scale
X = scale( X, axis=0, with_mean=True, with_std=True, copy=True )

关键字参数axis，with_mean，with_std是自解释的，并且以默认状态显示。参数copy如果设置为False，则就地执行操作。文档here。

赞(0）回复(0）举报 2022-11-10

idfiyjo84#

您正在尝试将audio的值在-1和+1之间进行最小-最大缩放，并将image的值在0和255之间进行最小-最大缩放。
使用sklearn.preprocessing.minmax_scale，应该可以轻松解决您的问题。
例如：

audio_scaled = minmax_scale(audio, feature_range=(-1,1))

和

shape = image.shape
image_scaled = minmax_scale(image.ravel(), feature_range=(0,255)).reshape(shape)

注：不要与将向量的norm（长度）缩放到某个值（通常为1）的操作混淆，该操作通常也称为归一化。

赞(0）回复(0）举报 2022-11-10

2uluyalo5#

您可以使用“i”（如idiv、imul..）版本，它看起来还不错：

image /= (image.max()/255.0)

对于另一种情况，您可以编写一个函数，通过列来规范化n维数组：

def normalize_columns(arr):
    rows, cols = arr.shape
    for col in xrange(cols):
        arr[:,col] /= abs(arr[:,col]).max()

赞(0）回复(0）举报 2022-11-10

nwwlzxa76#

This answer到一个similar question为我解决了这个问题

np.interp(a, (a.min(), a.max()), (-1, +1))

赞(0）回复(0）举报 2022-11-10

nhn9ugyo7#

一个简单的解决方案是使用sklearn.preprocessing库提供的定标器。

scaler = sk.MinMaxScaler(feature_range=(0, 250))
scaler = scaler.fit(X)
X_scaled = scaler.transform(X)

# Checking reconstruction

X_rec = scaler.inverse_transform(X_scaled)

错误X_rec-X将为零。您可以根据需要调整feature_range，甚至使用标准缩放器sk.StandardScaler（）

赞(0）回复(0）举报 2022-11-10

xtfmy6hx8#

我尝试跟随this，但出现错误

TypeError: ufunc 'true_divide' output (typecode 'd') could not be coerced to provided output parameter (typecode 'l') according to the casting rule ''same_kind''

我试图规范化的numpy数组是一个integer数组。看起来他们在〉1.10的版本中反对类型转换，你必须使用numpy.true_divide()来解决这个问题。

arr = np.array(img)
arr = np.true_divide(arr,[255.0],out=None)

img是一个PIL.Image对象。

赞(0）回复(0）举报 2022-11-10

我来回答

scipy 如何将NumPy数组规范化到一定范围内？

8条答案

相关问题

热门标签

最新问答