MARL for 3D RBC | Evaluation of trained agent at Ra = 750