* [MAQAO] Info: Detected 1 Lprof instances in ip-172-31-35-140.ec2.internal.
If this is incorrect, rerun with number-processes-per-node=X
OMP: pid 872571 tid 872571 thread 0 bound to OS proc set {0}
OMP: pid 872571 tid 872684 thread 15 bound to OS proc set {15}
OMP: pid 872571 tid 872672 thread 3 bound to OS proc set {3}
OMP: pid 872571 tid 872681 thread 12 bound to OS proc set {12}
OMP: pid 872571 tid 872683 thread 14 bound to OS proc set {14}
OMP: pid 872571 tid 872670 thread 1 bound to OS proc set {1}
OMP: pid 872571 tid 872671 thread 2 bound to OS proc set {2}
OMP: pid 872571 tid 872700 thread 31 bound to OS proc set {31}
OMP: pid 872571 tid 872701 thread 32 bound to OS proc set {32}
OMP: pid 872571 tid 872680 thread 11 bound to OS proc set {11}
OMP: pid 872571 tid 872677 thread 8 bound to OS proc set {8}
OMP: pid 872571 tid 872682 thread 13 bound to OS proc set {13}
OMP: pid 872571 tid 872703 thread 34 bound to OS proc set {34}
OMP: pid 872571 tid 872688 thread 19 bound to OS proc set {19}
OMP: pid 872571 tid 872697 thread 28 bound to OS proc set {28}
OMP: pid 872571 tid 872685 thread 16 bound to OS proc set {16}
OMP: pid 872571 tid 872676 thread 7 bound to OS proc set {7}
OMP: pid 872571 tid 872673 thread 4 bound to OS proc set {4}
OMP: pid 872571 tid 872679 thread 10 bound to OS proc set {10}
OMP: pid 872571 tid 872687 thread 18 bound to OS proc set {18}
OMP: pid 872571 tid 872699 thread 30 bound to OS proc set {30}
OMP: pid 872571 tid 872678 thread 9 bound to OS proc set {9}
OMP: pid 872571 tid 872696 thread 27 bound to OS proc set {27}
OMP: pid 872571 tid 872675 thread 6 bound to OS proc set {6}
OMP: pid 872571 tid 872686 thread 17 bound to OS proc set {17}
OMP: pid 872571 tid 872702 thread 33 bound to OS proc set {33}
OMP: pid 872571 tid 872693 thread 24 bound to OS proc set {24}
OMP: pid 872571 tid 872692 thread 23 bound to OS proc set {23}
OMP: pid 872571 tid 872698 thread 29 bound to OS proc set {29}
OMP: pid 872571 tid 872674 thread 5 bound to OS proc set {5}
OMP: pid 872571 tid 872689 thread 20 bound to OS proc set {20}
OMP: pid 872571 tid 872691 thread 22 bound to OS proc set {22}
OMP: pid 872571 tid 872713 thread 44 bound to OS proc set {44}
OMP: pid 872571 tid 872695 thread 26 bound to OS proc set {26}
OMP: pid 872571 tid 872717 thread 48 bound to OS proc set {48}
OMP: pid 872571 tid 872732 thread 63 bound to OS proc set {63}
OMP: pid 872571 tid 872694 thread 25 bound to OS proc set {25}
OMP: pid 872571 tid 872720 thread 51 bound to OS proc set {51}
OMP: pid 872571 tid 872733 thread 64 bound to OS proc set {64}
OMP: pid 872571 tid 872690 thread 21 bound to OS proc set {21}
OMP: pid 872571 tid 872729 thread 60 bound to OS proc set {60}
OMP: pid 872571 tid 872725 thread 56 bound to OS proc set {56}
OMP: pid 872571 tid 872704 thread 35 bound to OS proc set {35}
OMP: pid 872571 tid 872719 thread 50 bound to OS proc set {50}
OMP: pid 872571 tid 872712 thread 43 bound to OS proc set {43}
OMP: pid 872571 tid 872716 thread 47 bound to OS proc set {47}
OMP: pid 872571 tid 872728 thread 59 bound to OS proc set {59}
OMP: pid 872571 tid 872730 thread 61 bound to OS proc set {61}
OMP: pid 872571 tid 872736 thread 67 bound to OS proc set {67}
OMP: pid 872571 tid 872731 thread 62 bound to OS proc set {62}
OMP: pid 872571 tid 872718 thread 49 bound to OS proc set {49}
OMP: pid 872571 tid 872745 thread 76 bound to OS proc set {76}
OMP: pid 872571 tid 872709 thread 40 bound to OS proc set {40}
OMP: pid 872571 tid 872715 thread 46 bound to OS proc set {46}
OMP: pid 872571 tid 872714 thread 45 bound to OS proc set {45}
OMP: pid 872571 tid 872748 thread 79 bound to OS proc set {79}
OMP: pid 872571 tid 872710 thread 41 bound to OS proc set {41}
OMP: pid 872571 tid 872727 thread 58 bound to OS proc set {58}
OMP: pid 872571 tid 872705 thread 36 bound to OS proc set {36}
OMP: pid 872571 tid 872711 thread 42 bound to OS proc set {42}
OMP: pid 872571 tid 872724 thread 55 bound to OS proc set {55}
OMP: pid 872571 tid 872741 thread 72 bound to OS proc set {72}
OMP: pid 872571 tid 872747 thread 78 bound to OS proc set {78}
OMP: pid 872571 tid 872744 thread 75 bound to OS proc set {75}
OMP: pid 872571 tid 872707 thread 38 bound to OS proc set {38}
OMP: pid 872571 tid 872708 thread 39 bound to OS proc set {39}
OMP: pid 872571 tid 872737 thread 68 bound to OS proc set {68}
OMP: pid 872571 tid 872723 thread 54 bound to OS proc set {54}
OMP: pid 872571 tid 872735 thread 66 bound to OS proc set {66}
OMP: pid 872571 tid 872706 thread 37 bound to OS proc set {37}
OMP: pid 872571 tid 872734 thread 65 bound to OS proc set {65}
OMP: pid 872571 tid 872743 thread 74 bound to OS proc set {74}
OMP: pid 872571 tid 872722 thread 53 bound to OS proc set {53}
OMP: pid 872571 tid 872742 thread 73 bound to OS proc set {73}
OMP: pid 872571 tid 872746 thread 77 bound to OS proc set {77}
OMP: pid 872571 tid 872761 thread 92 bound to OS proc set {92}
OMP: pid 872571 tid 872740 thread 71 bound to OS proc set {71}
OMP: pid 872571 tid 872757 thread 88 bound to OS proc set {88}
OMP: pid 872571 tid 872762 thread 93 bound to OS proc set {93}
OMP: pid 872571 tid 872721 thread 52 bound to OS proc set {52}
OMP: pid 872571 tid 872760 thread 91 bound to OS proc set {91}
OMP: pid 872571 tid 872764 thread 95 bound to OS proc set {95}
OMP: pid 872571 tid 872739 thread 70 bound to OS proc set {70}
OMP: pid 872571 tid 872752 thread 83 bound to OS proc set {83}
OMP: pid 872571 tid 872756 thread 87 bound to OS proc set {87}
OMP: pid 872571 tid 872738 thread 69 bound to OS proc set {69}
OMP: pid 872571 tid 872750 thread 81 bound to OS proc set {81}
OMP: pid 872571 tid 872751 thread 82 bound to OS proc set {82}
OMP: pid 872571 tid 872753 thread 84 bound to OS proc set {84}
OMP: pid 872571 tid 872755 thread 86 bound to OS proc set {86}
OMP: pid 872571 tid 872758 thread 89 bound to OS proc set {89}
OMP: pid 872571 tid 872759 thread 90 bound to OS proc set {90}
OMP: pid 872571 tid 872763 thread 94 bound to OS proc set {94}
OMP: pid 872571 tid 872749 thread 80 bound to OS proc set {80}
OMP: pid 872571 tid 872726 thread 57 bound to OS proc set {57}
OMP: pid 872571 tid 872754 thread 85 bound to OS proc set {85}
{"n_kv_max": 16384, "n_batch": 2048, "n_ubatch": 512, "flash_attn": -1, "is_pp_shared": 0, "n_gpu_layers": -1, "n_threads": 96, "n_threads_batch": 96, "pp": 128, "tg": 0, "pl": 4, "n_kv": 512, "t_pp": 0.517619, "speed_pp": 989.144470, "t_tg": 0.000000, "speed_tg": nan, "t": 0.517619, "speed": 989.144470}
Your experiment path is /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0
To display your profiling results:
####################################################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
####################################################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-35-140.ec2.internal/176-399-7732/llama.cpp/run/oneview_runs/defaults/orig/oneview_results_1763997797/tools/lprof_npsu_run_0 #
####################################################################################################################################################################################################################################