c++ TensorRT TRT::Tensor 用法学习笔记

x33g5p2x  于2022-08-17 转载在 其他  
字(0.7k)|赞(0)|评价(0)|浏览(708)
  1. TRT::Tensor roi_align_inputs(TRT::DataType::Float);
  2. roi_align_inputs.resize(infer_batch_size * MAX_IMAGE_BBOX * 6);
  3. roi_align_inputs.to_cpu(false);
  4. output_array_device.to_cpu(true);

to_cpu(true),转cpu,同时拷贝一份。

测试执行时间:

  1. const int ntest =1;
  2. auto begin_timer = iLogger::timestamp_now_float();
  3. for (int i = 0; i < ntest; ++i)
  4. boxes_array = engine->commits(images);
  5. // wait all result
  6. boxes_array.back().get();
  7. float inference_average_time = (iLogger::timestamp_now_float() - begin_timer) / ntest / images.size();
  8. auto type_name = FasterRCNN::type_name(type);
  9. auto mode_name = TRT::mode_string(mode);
  10. INFO("%s[%s] average: %.2f ms / image, FPS: %.2f", engine_file.c_str(), type_name, inference_average_time, 1000 / inference_average_time);

后面持续更新。

相关文章