我想用2个图像输入构建一个多路径CNN，每个输入都输入自己的CNN，然后在最后将特征连接起来。现在我有这样的东西：
第一组图像的特征和标签：X（具有特征）y（标签）
第二组图像的特征和标签（分段）：X_2（具有特征）y_2（具有标签）
我的增强代码是这样的：

data_augmentation = tf.keras.Sequential()
data_augmentation.add(tf.keras.layers.RandomFlip("horizontal_and_vertical"))
data_augmentation.add(tf.keras.layers.RandomRotation(0.2))
data_augmentation.add(tf.keras.layers.RandomZoom(height_factor=(.05),
                                                 width_factor=(.05)))

我想让CNN看起来像这样：CNN model
第一和第二组图像都是相同的大小并且顺序相同。唯一的区别是第二组是分段的。我想对这两个应用相同的数据增强。有人知道怎么做吗？
我试着在网上寻找一些答案，但我发现大多数关于多输入CNNS的网站都没有使用增强层，所以我被卡住了。

(See以下为原始答案）
我猜这个Keras文档与Keras中的数据增强有关。特别是，它提到了使用数据增强层的两个选项：在Model中使用它，并在tf.data管道中使用它。
下面是一个使用第二个选项的粗略（可以将其视为伪代码）示例实现。
此外，我个人倾向于使用自定义的Model来实现问题中描述的Model，以实现灵活性，但也许有一些方法可以使用更高级别的Keras API来实现。
在下面的代码片段中，设置了一个自定义的Model，然后是训练数据集管道。它缺乏确切的加载步骤，但应该给予一个如何将数据集与自定义Model连接的粗略概念。

import tensorflow as tf
class TwinCNN(tf.keras.Model):
    def __init__(self, name="TwinCNN"):
        super().__init__(name=name)
        
        self.cnn_for_color = [
            tf.keras.layers.Conv2D(128, 3, padding="same"), 
            tf.keras.layers.Conv2D(128, 3, padding="same"), 
            tf.keras.layers.Conv2D(128, 3, padding="same"), 
        ]  # Some layers
        
        self.cnn_for_segment = [
            tf.keras.layers.Conv2D(64, 3, padding="same"), 
            tf.keras.layers.Conv2D(64, 3, padding="same"), 
            tf.keras.layers.Conv2D(64, 3, padding="same"), 
        ]
        
        self.classifier = [
            tf.keras.layers.Flatten(),
            tf.keras.layers.Dense(
                units=4,  # seems there are 4 classes
                activation=tf.nn.softmax)
        ]
    
    def call(self, inputs, training):
        color_image = inputs['color']
        segment_image = inputs['segment']
        
        color_fts = color_image
        for ly in self.cnn_for_color:
            color_fts = ly(color_fts, training=training)
        
        segment_fts = segment_image 
        for ly in self.cnn_for_segment:
            segment_fts = ly(segment_fts, training=training)
        
        concat_fts = tf.concat([color_fts, segment_fts], axis=-1, name='concat_fts')
        
        net = concat_fts
        for ly in self.classifier:
            net = ly(net, training=training)
        return net  # shape Nx4
# Your data augmentation layers
data_augmentation = tf.keras.Sequential()
data_augmentation.add(tf.keras.layers.RandomFlip("horizontal_and_vertical"))
data_augmentation.add(tf.keras.layers.RandomRotation(0.2))
data_augmentation.add(tf.keras.layers.RandomZoom(height_factor=(.05),
                                                 width_factor=(.05)))
def apply_augment_to_zipped(color_img, segment_img):
    c_color = color_img.shape[-1]  # Number of channels
    concat_img = tf.concat([color_img, segment_img], axis=-1)
    # **For your test/validation dataset, don't use augmentation!**
    img_aug = data_augmentation(concat_img, training=True)
    color_aug = img_aug[..., :c_color]
    segment_aug = img_aug[..., c_color:]
    return color_aug, segment_aug
def turn_image_tuple_to_dict(color_img, segment_img):
    # Put the two images into a dict
    return dict(color=color_img, segment=segment_img)
# Assuming ds_color and ds_segment has the *same* label, so we can take either one
ds_color_images = tf.data.Dataset(...)  # Set up a dataset for color images
ds_labels = tf.data.Dataset(...)  # Set up a dataset for labels
ds_segment_images = tf.data.Dataset(...)  # Set up a dataset for segment images
ds_images = tf.data.Dataset.zip((ds_color_images, ds_segment_images))  # each entry will be a tuple of 2 images
ds_images_augmented = ds_images.map(apply_augment_to_zipped)
ds_images_augmented = ds_images_augmented.map(turn_image_tuple_to_dict)
ds_train = tf.data.Dataset.zip((ds_images_augmented, ds_labels))  # The training set 
# NOTE: for a validation dataset, we should skip the augmentation (i.e. don't call apply_augment_to_zipped)
model = TwinCNN()
# TODO: select an optimizer, 
# TODO: model.compile(optimizer)
model.fit(
    ds_train, # TODO: other training settings
)

原始答案

由于两个数据集大小相同，因此可以使用

ds = tf.data.Dataset.zip((ds1, ds2))

以创建数据集，其中每个条目是一对图像，比如(x, x2)。然后我们可以做一些事情，

ds = ds.map(lambda x, x2: tf.concat([x, x2], axis=-1))

将x和x2合并为一个更适合增强层的Tensor。
假设对于每个x（假设3个通道）和每个x2（假设1个通道），通道的数量是恒定的，则可以在之后切片扩增结果：

x_aug = input_tensor[..., :3]
x2_aug = input_tensor[..., 3:4]

顺便说一下，如果问题更详细一些就更好了。例如，提供一些代码或伪代码来解释数据在“多路径CNN”中应该如何流动以及正在尝试什么样的数据增强将是有帮助的。

展开查看全部

keras 如何在2个不同的图像输入上执行相同的数据增强？

1条答案

原始答案

相关问题

热门标签

最新问答