Chinese-CLIP 论文无法复现

eulz3vhy  于 9个月前  发布在  其他
关注(0)|答案(1)|浏览(237)

在Flickr30k-CN数据的fintune实验上,无法复现论文,特别是valid上表现,低于10%
训练配置:

  1. ## Number of GPUs per GPU worker
  2. GPUS_PER_NODE=8
  3. ## Number of GPU workers, for single-worker training, please set to 1
  4. WORKER_CNT=1
  5. ## The ip address of the rank-0 worker, for single-worker training, please set to localhost
  6. export MASTER_ADDR=127.0.0.1
  7. ## The port for communication
  8. export MASTER_PORT=8514
  9. ## The rank of this worker, should be in {0, ..., WORKER_CNT-1}, for single-worker training, please set to 0
  10. export RANK=0
  11. context_length=52
  12. warmup=100
  13. batch_size=512
  14. valid_batch_size=512
  15. accum_freq=2
  16. lr=5e-4
  17. wd=0.001
  18. max_epochs=30 # or you can alternatively specify --max-steps
  19. valid_step_interval=150
  20. valid_epoch_interval=1
  21. vision_model=ViT-B-16
  22. text_model=RoBERTa-wwm-ext-base-chinese
  23. ## mask_ratio=0.5 # use flip: set mask ratio
  24. use_augment="--use-augment"
  25. use_augment=""
falq053o

falq053o1#

\n\n您好,您可以参考我们在技术报告中给出的超参数配置。大多数finetune实验都是在32卡上进行的。根据您的情况,您的finetune实验应该是在单机8卡上进行的。您可以尝试使用经过单机8卡验证的COCO-CN脚本默认参数来进行COCO-CN finetune实验。

相关问题