scipy 拟合泊松直方图

tzcvj98z 于 2022-12-18 发布在其他

关注(0)|答案(2)|浏览(149)

我正在尝试在泊松分布的直方图上拟合一条曲线，如下所示
我修改了fit函数，使其类似于泊松分布，参数t作为变量。但curve_fit函数无法绘制，我不知道为什么。

def histo(bsize):
    N = bsize
    #binwidth
    bw = (dt.max()-dt.min())/(N-1.)
    bin1 = dt.min()+ bw*np.arange(N)
    #define the array to hold the occurrence count
    bincount= np.array([])
    for bin in bin1:
        count = np.where((dt>=bin)&(dt<bin+bw))[0].size
        bincount = np.append(bincount,count)
    #bin center
    binc = bin1+0.5*bw
    plt.figure()
    plt.plot(binc,bincount,drawstyle= 'steps-mid')
    plt.xlabel("Interval[ticks]")
    plt.ylabel("Frequency")
histo(30)
plt.xlim(0,.5e8)
plt.ylim(0,25000)
import numpy as np
from scipy.optimize import curve_fit
delta_t = 1.42e7
def func(x, t):
    return t * np.exp(- delta_t/t) 
popt, pcov = curve_fit(func, np.arange(0,.5e8),histo(30))
plt.plot(popt)

scipy

来源：https://stackoverflow.com/questions/25828184/fitting-to-poisson-histogram

2条答案

按热度按时间

cl25kdpy1#

你的代码的问题在于你不知道curve_fit的返回值是什么，它是拟合函数的参数和它们的协方差矩阵，而不是你可以直接绘制的东西。

分组最小二乘拟合

一般来说，你可以得到一切非常，非常容易：

import numpy as np
import matplotlib.pyplot as plt
from scipy.optimize import curve_fit
from scipy.special import factorial
from scipy.stats import poisson

# get poisson deviated random numbers
data = np.random.poisson(2, 1000)

# the bins should be of integer width, because poisson is an integer distribution
bins = np.arange(11) - 0.5
entries, bin_edges, patches = plt.hist(data, bins=bins, density=True, label='Data')

# calculate bin centers
bin_centers = 0.5 * (bin_edges[1:] + bin_edges[:-1])

def fit_function(k, lamb):
    '''poisson function, parameter lamb is the fit parameter'''
    return poisson.pmf(k, lamb)

# fit with curve_fit
parameters, cov_matrix = curve_fit(fit_function, bin_centers, entries)

# plot poisson-deviation with fitted parameter
x_plot = np.arange(0, 15)

plt.plot(
    x_plot,
    fit_function(x_plot, *parameters),
    marker='o', linestyle='',
    label='Fit result',
)
plt.legend()
plt.show()

这就是结果：

未分组最大似然拟合

更好的可能性是根本不使用直方图，而是执行最大似然拟合。
但是通过更仔细的研究，甚至这也是不必要的，因为泊松分布参数的最大似然估计量是算术平均值。
但是，如果您有其他更复杂的PDF，您可以使用以下示例：

import numpy as np
import matplotlib.pyplot as plt
from scipy.optimize import minimize
from scipy.special import factorial
from scipy import stats

def poisson(k, lamb):
    """poisson pdf, parameter lamb is the fit parameter"""
    return (lamb**k/factorial(k)) * np.exp(-lamb)

def negative_log_likelihood(params, data):
    """
    The negative log-Likelihood-Function
    """

    lnl = - np.sum(np.log(poisson(data, params[0])))
    return lnl

def negative_log_likelihood(params, data):
    ''' better alternative using scipy '''
    return -stats.poisson.logpmf(data, params[0]).sum()

# get poisson deviated random numbers
data = np.random.poisson(2, 1000)

# minimize the negative log-Likelihood

result = minimize(negative_log_likelihood,  # function to minimize
                  x0=np.ones(1),            # start value
                  args=(data,),             # additional arguments for function
                  method='Powell',          # minimization method, see docs
                  )
# result is a scipy optimize result object, the fit parameters 
# are stored in result.x
print(result)

# plot poisson-distribution with fitted parameter
x_plot = np.arange(0, 15)

plt.plot(
    x_plot,
    stats.poisson.pmf(x_plot, result.x),
    marker='o', linestyle='',
    label='Fit result',
)
plt.legend()
plt.show()

赞(0）回复(0）举报 2022-12-18

lfapxunr2#

感谢您发送编修。
您可能需要考虑以下事项：
1)为了获得更好的数值行为，计算“对数泊松”，而不是计算“泊松
2)不要使用“lamb”，而是使用对数（我称之为“log_mu”），以避免拟合“徘徊”到“mu”的负值。

log_poisson(k, log_mu): return k*log_mu - loggamma(k+1) - math.exp(log_mu)

其中“loggamma”是scipy.special.loggamma函数。
实际上，在上面的拟合中，“loggamma”项仅向被最小化的函数添加了一个常量偏移，因此，只需执行以下操作即可：

log_poisson_(k, log_mu): return k*log_mu - math.exp(log_mu)

注意：log_poisson_()与log_poisson()不同，但当以上述方式用于最小化时，将给予相同的拟合最小值（相同的mu值，直到数值问题）。被最小化的函数的值将被偏移，但通常不关心这一点。

赞(0）回复(0）举报 2022-12-18

我来回答

scipy 拟合泊松直方图

2条答案

分组最小二乘拟合

未分组最大似然拟合

相关问题

热门标签

最新问答