tensorflow Shape=()打印时包含多个值的向量

eqqqjvef 于 2022-11-16 发布在其他

关注(0)|答案(1)|浏览(160)

我使用Tensorflow来开发VAE，所以我使用的成本是ELBO（证据下界）。为了将误差应用到梯度，在最后一步中必须使用reduce_mean()，以便成本函数返回标量。

def vae_cost(x_true, model, analytic_kl=False, kl_weight=4):
  x_true = tf.cast(x_true, tf.float32)
  z_sample, mu, sd = model.encode(x_true)
  x_recons_logits = model.decoder(z_sample)
  # compute mean squared error
  recons_error = tf.cast(
      tf.reduce_mean((x_true - x_recons_logits) ** 2, axis=[1, 2, 3]),
      tf.float32)
  # compute reverse KL divergence, either analytically 
  # or through MC approximation with one sample
  if analytic_kl:
    kl_divergence = -0.5 * tf.math.reduce_sum(
        1 + tf.math.log(tf.math.square(sd)) - tf.math.square(mu) - tf.math.square(sd),
        axis=1) # shape=(batch_size,)
  else:
    log_pz = normal_log_pdf(z_sample, 0., 1.) # shape=(batch_size,)
    logqz_x = normal_log_pdf(z_sample, mu, tf.math.square(sd))
    kl_divergence = logqz_x - log_pz

  elbo = tf.reduce_mean(-kl_weight * kl_divergence - recons_error)
  return -elbo

(Note：这是我从here中提取的代码，几乎没有修改）
模型训练完美;从这个意义上说没有问题。我有问题的是 * 打印 * 错误的事实。我对tensorflow的内部工作原理知之甚少，但我知道你不能使用python的内置函数print()，因为如果我没弄错的话，它会打印计算图。因此，tf.print()似乎是解决方案。但控制台中显示的不是单个值，而是：

然后，如果我使用python的print()：

<tf.Tensor 'Neg:0' shape=() dtype=float32>

如果向量有shape=（），那么如何用tf.print()得到这么多的值呢？我是否真的混淆了这个函数是如何工作的？在这种情况下，我实际上如何打印错误呢？如果你能解释一下"Neg:0"的含义，我将不胜感激。提前感谢。

tensorflow

来源：https://stackoverflow.com/questions/74086877/shape-vector-containing-many-values-when-printed

1条答案

按热度按时间

ruyhziif1#

tf.print（）的输出是一个值列表，每个值对应于输入Tensor中的一个元素。在本例中，输入Tensor是一个shape=（）的向量，因此输出是一个值列表，每个值对应于输入向量中的一个元素。
下面是修改后的代码：

def vae_cost(x_true, model, analytic_kl=False, kl_weight=4):
  x_true = tf.cast(x_true, tf.float32)
  z_sample, mu, sd = model.encode(x_true)
  x_recons_logits = model.decoder(z_sample)
  # compute mean squared error
  recons_error = tf.cast(
      tf.reduce_mean((x_true - x_recons_logits) ** 2, axis=[1, 2, 3]),
      tf.float32)
  # compute reverse KL divergence, either analytically 
  # or through MC approximation with one sample
  if analytic_kl:
    kl_divergence = -0.5 * tf.math.reduce_sum(
        1 + tf.math.log(tf.math.square(sd)) - tf.math.square(mu) - tf.math.square(sd),
        axis=1) # shape=(batch_size,)
  else:
    log_pz = normal_log_pdf(z_sample, 0., 1.) # shape=(batch_size,)
    logqz_x = normal_log_pdf(z_sample, mu, tf.math.square(sd))
    kl_divergence = logqz_x - log_pz

  elbo = tf.reduce_mean(-kl_weight * kl_divergence - recons_error)
  return -elbo, kl_divergence, recons_error

赞(0）回复(0）举报 2022-11-16

我来回答

tensorflow Shape=()打印时包含多个值的向量

1条答案

相关问题

热门标签

最新问答