opencv 如何在Python中将PNG图像中的选中图案替换为透明图案？

8nuwlpux 于 2022-11-15 发布在 Python

关注(0)|答案(3)|浏览(239)

我试图用Python脚本删除一些PNG中带有透明颜色（alpha通道）的方格背景（在Adobe Illustrator和Photoshop中表示透明背景）。
首先，我使用 * 模板匹配 *：

import cv2
import numpy as np
from matplotlib import pyplot as plt

img_rgb = cv2.imread('testimages/fake1.png', cv2.IMREAD_UNCHANGED)
img_gray = cv2.cvtColor(img_rgb, cv2.COLOR_BGR2GRAY)
template = cv2.imread('pattern.png', 0)

w, h = template.shape[::-1]
res = cv2.matchTemplate(img_gray, template, cv2.TM_CCOEFF_NORMED)
threshold = 0.8
loc = np.where( res >= threshold)

for pt in zip(*loc[::-1]):
    if len(img_rgb[0][0]) == 3:
        # add alpha channel
        rgba = cv2.cvtColor(img_rgb, cv2.COLOR_RGB2RGBA)
        rgba[:, :, 3] = 255 # default not transparent
        img_rgb = rgba
    # replace the area with a transparent rectangle
    cv2.rectangle(img_rgb, pt, (pt[0] + w, pt[1] + h), (255, 255, 255, 0), -1) 

cv2.imwrite('result.png', img_rgb)

源图像：fake1.png文件

模式模板：模式.png

输出：result.png（灰色区域实际上是透明的;放大一点以便查看）

我知道这种方法有问题，因为在某些情况下，模板无法完全识别，因为部分图案被PNG图像中的图形隐藏。

**我的问题是：**如何通过FFT滤波使用OpenCV？完美匹配这样的模式？

参考文献：

opencv

来源：https://stackoverflow.com/questions/74134195/how-to-replace-a-checked-pattern-in-a-png-image-with-transparent-in-python

3条答案

按热度按时间

bogh5gae1#

这里有一种在Python/OpenCV中实现这一点的方法，只需对校验颜色范围设置阈值。
输入：

import cv2
import numpy as np

# read input
img = cv2.imread("fake.png")

# threshold on checks
low = (230,230,230)
high = (255,255,255)
mask = cv2.inRange(img, low, high)

# invert alpha
alpha = 255 - mask

# convert img to BGRA
result = cv2.cvtColor(img, cv2.COLOR_BGR2BGRA)
result[:,:,3] = alpha

# save output
cv2.imwrite('fake_transparent.png', result)

cv2.imshow('img', img)
cv2.imshow('mask', mask)
cv2.imshow('result', result)
cv2.waitKey(0)
cv2.destroyAllWindows()

下载生成的图像，看看它实际上是透明的。

赞(0）回复(0）举报 2022-11-15

fdbelqdn2#

这里有一种在 Python/OpenCV/Numpy 中使用 DFT 来处理图像的方法。我们需要知道棋盘图案的大小（亮或暗的正方形大小）。

读取输入
独立通道
对每个通道应用 DFT
将原点从每个通道的左上角移到中心
从每个通道提取幅度和相位图像
定义棋盘图案大小
创建大小相同的黑白棋盘图像
对棋盘格图像应用类似的 DFT 处理
从对数中获得频谱（幅度）
设置光谱阈值以形成遮罩
将模板中的 DC 中心点归零
选项：如果需要，应用形态学扩张来加厚白点。但在这里似乎不需要
反转蒙版，使背景为白色，点为黑色
将遮罩转换为 0 到 1 的范围并生成 2 个通道
将双通道遮罩应用于中心偏移 DFT 通道
将每个遮罩图像的中心移回左上方
执行 IDFT 以在每个通道上从复数域返回到实数域
将生成的通道合并回 BGR 图像，作为最终重建图像
储存结果

输入：

import numpy as np
import cv2
import math

# read input 
# note: opencv fft only works on grayscale
img = cv2.imread('fake.png')
hh, ww = img.shape[:2]

# separate channels
b,g,r = cv2.split(img)

# convert images to floats and do dft saving as complex output
dft_b = cv2.dft(np.float32(b), flags = cv2.DFT_COMPLEX_OUTPUT)
dft_g = cv2.dft(np.float32(g), flags = cv2.DFT_COMPLEX_OUTPUT)
dft_r = cv2.dft(np.float32(r), flags = cv2.DFT_COMPLEX_OUTPUT)

# apply shift of origin from upper left corner to center of image
dft_b_shift = np.fft.fftshift(dft_b)
dft_g_shift = np.fft.fftshift(dft_g)
dft_r_shift = np.fft.fftshift(dft_r)

# extract magnitude and phase images
mag_b, phase_b = cv2.cartToPolar(dft_b_shift[:,:,0], dft_b_shift[:,:,1])
mag_g, phase_g = cv2.cartToPolar(dft_g_shift[:,:,0], dft_g_shift[:,:,1])
mag_r, phase_r = cv2.cartToPolar(dft_r_shift[:,:,0], dft_r_shift[:,:,1])

# set check size (size of either dark or light square)
check_size = 15

# create checkerboard pattern
white = np.full((check_size,check_size), 255, dtype=np.uint8)
black = np.full((check_size,check_size), 0, dtype=np.uint8)
checks1 = np.hstack([white,black])
checks2 = np.hstack([black,white])
checks3 = np.vstack([checks1,checks2])
numht = math.ceil(hh / (2*check_size))
numwd = math.ceil(ww / (2*check_size))
checks = np.tile(checks3, (numht,numwd))
checks = checks[0:hh, 0:ww]

# apply dft to checkerboard pattern
dft_c = cv2.dft(np.float32(checks), flags = cv2.DFT_COMPLEX_OUTPUT)
dft_c_shift = np.fft.fftshift(dft_c)
mag_c, phase_c = cv2.cartToPolar(dft_c_shift[:,:,0], dft_c_shift[:,:,1])

# get spectrum from magnitude (add tiny amount to avoid divide by zero error)
spec = np.log(mag_c + 0.00000001)

# theshold spectrum
mask = cv2.threshold(spec, 1, 255, cv2.THRESH_BINARY)[1]

# mask DC point (center spot)
centx = int(ww/2)
centy = int(hh/2)
dot = np.zeros((3,3), dtype=np.uint8)
mask[centy-1:centy+2, centx-1:centx+2] = dot

# If needed do morphology dilate by small amount. 
# But does not seem to be needed in this case

# invert mask
mask = 255 - mask

# apply mask to real and imaginary components
mask1 = (mask/255).astype(np.float32)
mask2 = cv2.merge([mask1,mask1])
complex_b = dft_b_shift*mask2
complex_g = dft_g_shift*mask2
complex_r = dft_r_shift*mask2

# shift origin from center to upper left corner
complex_ishift_b = np.fft.ifftshift(complex_b)
complex_ishift_g = np.fft.ifftshift(complex_g)
complex_ishift_r = np.fft.ifftshift(complex_r)

# do idft with normalization saving as real output and crop to original size
img_notch_b = cv2.idft(complex_ishift_b, flags=cv2.DFT_SCALE+cv2.DFT_REAL_OUTPUT)
img_notch_b = img_notch_b.clip(0,255).astype(np.uint8)
img_notch_b = img_notch_b[0:hh, 0:ww]
img_notch_g = cv2.idft(complex_ishift_g, flags=cv2.DFT_SCALE+cv2.DFT_REAL_OUTPUT)
img_notch_g = img_notch_g.clip(0,255).astype(np.uint8)
img_notch_g = img_notch_g[0:hh, 0:ww]
img_notch_r = cv2.idft(complex_ishift_r, flags=cv2.DFT_SCALE+cv2.DFT_REAL_OUTPUT)
img_notch_r = img_notch_r.clip(0,255).astype(np.uint8)
img_notch_r = img_notch_r[0:hh, 0:ww]

# combine b,g,r components
img_notch = cv2.merge([img_notch_b, img_notch_g, img_notch_r])

# write result to disk
cv2.imwrite("fake_checks.png", checks)
cv2.imwrite("fake_spectrum.png", (255*spec).clip(0,255).astype(np.uint8))
cv2.imwrite("fake_mask.png", mask)
cv2.imwrite("fake_notched.png", img_notch)

# show results
cv2.imshow("ORIGINAL", img)
cv2.imshow("CHECKS", checks)
cv2.imshow("SPECTRUM", spec)
cv2.imshow("MASK", mask)
cv2.imshow("NOTCH", img_notch)
cv2.waitKey(0)
cv2.destroyAllWindows()

中的每一个
棋盘图像：