c++ 使用MPI合并来自多个秩的数组

3phpmpom  于 2022-12-15  发布在  其他
关注(0)|答案(1)|浏览(228)

我的代码要求3个进程生成如下所示的2D向量(在第二列上递增0.05),属于其他进程的行的值为零(未显示):

  1. Proc 1
  2. 100,0,1,1,1
  3. 100,0.05,4,1,5
  4. 100,0.10,10,5,11
  5. 100,...,...,...,...
  6. 100,0.30,12,11,15
  7. Proc 2
  8. 100,0.35,40,35,55
  9. 100,0.40,45,39,55
  10. 100,...,...,...,...
  11. 100,0.65,180,90,215
  12. Proc 3
  13. 100,0.70,130,67,145
  14. 100,...,...,...,...
  15. 100,1.0,1,1,1

我尝试使用MPI_Allreduce生成一个相同大小的2D向量:

  1. 100,0,1,1,1
  2. 100,0.05,2,2,3
  3. 100,0.1,3,2,4
  4. 100,0.15,3,2,6
  5. 100,0.2,4,2,6
  6. 100,0.25,14,10,20
  7. 100,0.30,15,11,21
  8. 100,0.35,10,6,19
  9. 100,0.4,13,9,21
  10. 100,0.45,16,12,25
  11. 100,0.5,33,17,55
  12. 100,0.55,70,33,155
  13. 100,0.6,80,30,190
  14. 100,0.65,110,45,200
  15. 100,0.7,145,134,161
  16. 100,0.75,131,127,138
  17. 100,0.8,123,120,129
  18. 100,0.85,117,114,122
  19. 100,0.9,111,111,113
  20. 100,0.95,110,108,112
  21. 100,1.0,1,1,1

到目前为止,我已经尝试了以下方法:

  1. // Function that calculates the average number of steps, incrementing over arraysize and probability
  2. // Output vector has 21 rows and columns represent [Arraysize, Probability, Average, Min Value, Max Value]
  3. std::vector<std::vector<double>> forest_fire_average_steps(int arraySize, int numberOfRuns, int rank, int numProcs)
  4. {
  5. int count = 21 / numProcs;
  6. int start = rank * count;
  7. int end = start + count;
  8. // init probability
  9. double p;
  10. // Create empty vector to store results
  11. std::vector<std::vector<double>> stepsResults(21, std::vector<double>(5, 0));
  12. if (rank == 0) p = 0;
  13. if (rank == 1) p = 0.35;
  14. if (rank == 2) p = 0.7;
  15. if (rank < 3)
  16. {
  17. // Iterates over a range of probability values, from 0 to 1 in 0.05 increments.
  18. for (int i = start; i < end; ++i)
  19. {
  20. // Adds the array size and probability value to the first two columns of the row.
  21. stepsResults[i][0] = arraySize;
  22. stepsResults[i][1] = p;
  23. // Runs the forest fire model 'numberOfRuns' times with a defined array size and p
  24. // and stores the results.
  25. std::vector<double> runSteps;
  26. for (int j = 0; j < numberOfRuns; ++j)
  27. {
  28. runSteps.push_back(forest_fire(arraySize, p).stepCount);
  29. }
  30. // Sums the runSteps vector.
  31. // Could have used std::reduce which is more efficient but the HPC G++ compiler was out of date.
  32. int sum = 0;
  33. for (auto& n : runSteps)
  34. {
  35. sum += n;
  36. }
  37. // Calculates and stores the average of the results.
  38. double averageSteps = sum / runSteps.size();
  39. stepsResults[i][2] = averageSteps;
  40. stepsResults[i][3] = *min_element(runSteps.begin(), runSteps.end());
  41. stepsResults[i][4] = *max_element(runSteps.begin(), runSteps.end());
  42. // Increments probability.
  43. p += 0.05;
  44. }
  45. }
  46. std::vector<std::vector<double>> finalResults(21, std::vector<double>(5, 0));
  47. for (unsigned int i=0;i<21;++i){
  48. int ierr = MPI_Allreduce(stepsResults[i].data(), finalResults[i].data(), 5, MPI_DOUBLE, MPI_SUM, MPI_COMM_WORLD);
  49. }
  50. MPI_Finalize();
  51. if (rank == 3)
  52. {
  53. for (int i = 0; i < 21; i++)
  54. {
  55. for (int j = 0; j < finalResults[i].size(); j++)
  56. {
  57. if (j < 4)
  58. {
  59. std::cout << finalResults[i][j] << ",";
  60. } else
  61. {
  62. std::cout << finalResults[i][j];
  63. }
  64. }
  65. std::cout << std::endl;
  66. }
  67. }
  68. return finalResults;
  69. }

我不完全确定为什么,但输出看起来像这样,其中它明显跳过了一些行(0.6、0.65等),并添加了额外的零行:

  1. 100,0,1,1,1
  2. 100,0.05,2,2,3
  3. 100,0.1,3,2,4
  4. 100,0.15,3,2,6
  5. 100,0.2,4,2,6
  6. 100,0.35,10,6,19
  7. 100,0.4,13,9,21
  8. 100,0.45,16,12,25
  9. 100,0.5,33,17,55
  10. 100,0.55,70,33,155
  11. 100,0.7,145,134,161
  12. 100,0.75,131,127,138
  13. 100,0.8,123,120,129
  14. 100,0.85,117,114,122
  15. 100,0.9,111,111,113
  16. 0,0,0,0,0
  17. 0,0,0,0,0
  18. 0,0,0,0,0
  19. 0,0,0,0,0
  20. 0,0,0,0,0
  21. 0,0,0,0,0

我的实现是否存在导致此问题的错误?

tcbh2hod

tcbh2hod1#

我很惊讶这个程序没有因为segfault而中止。你的问题在于你把矩阵定义为vector<vector<double>>。MPI需要连续的缓冲区,而向量的向量是一堆小数组,随机分散在内存中。为你的对象写一个类,让它存储一个vector<double>,在其中你用i+j*N左右的索引。

相关问题