Diff of the two buildlogs: -- --- b1/build.log 2023-04-11 23:48:11.118997174 +0000 +++ b2/build.log 2023-04-11 23:52:58.949403137 +0000 @@ -1,6 +1,6 @@ I: pbuilder: network access will be disabled during build -I: Current time: Mon May 13 18:06:40 -12 2024 -I: pbuilder-time-stamp: 1715666800 +I: Current time: Wed Apr 12 13:48:14 +14 2023 +I: pbuilder-time-stamp: 1681256894 I: Building the build Environment I: extracting base tarball [/var/cache/pbuilder/bookworm-reproducible-base.tgz] I: copying local configuration @@ -16,7 +16,7 @@ I: copying [./fenicsx-performance-tests_0.5.0~git20220731.821823b.orig.tar.gz] I: copying [./fenicsx-performance-tests_0.5.0~git20220731.821823b-1.debian.tar.xz] I: Extracting source -gpgv: Signature made Thu Sep 22 00:05:12 2022 -12 +gpgv: Signature made Fri Sep 23 02:05:12 2022 +14 gpgv: using RSA key 23C9A93E585819E9126D0A36573EF1E4BD5A01FA gpgv: Can't check signature: No public key dpkg-source: warning: cannot verify inline signature for ./fenicsx-performance-tests_0.5.0~git20220731.821823b-1.dsc: no acceptable signature found @@ -27,11 +27,20 @@ dpkg-source: info: applying enable_complex_build.patch I: Not using root during the build. I: Installing the build-deps -I: user script /srv/workspace/pbuilder/29246/tmp/hooks/D02_print_environment starting +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/D01_modify_environment starting +debug: Running on codethink14-arm64. +I: Changing host+domainname to test build reproducibility +I: Adding a custom variable just for the fun of it... +I: Changing /bin/sh to bash +lrwxrwxrwx 1 root root 4 Jan 6 03:20 /bin/sh -> dash +I: Setting pbuilder2's login shell to /bin/bash +I: Setting pbuilder2's GECOS to second user,second room,second work-phone,second home-phone,second other +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/D01_modify_environment finished +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/D02_print_environment starting I: set BUILDDIR='/build' - BUILDUSERGECOS='first user,first room,first work-phone,first home-phone,first other' - BUILDUSERNAME='pbuilder1' + BUILDUSERGECOS='second user,second room,second work-phone,second home-phone,second other' + BUILDUSERNAME='pbuilder2' BUILD_ARCH='arm64' DEBIAN_FRONTEND='noninteractive' DEB_BUILD_OPTIONS='buildinfo=+all reproducible=+all parallel=8' @@ -41,38 +50,38 @@ IFS=' ' LANG='C' - LANGUAGE='en_US:en' + LANGUAGE='nl_BE:nl' LC_ALL='C' MAIL='/var/mail/root' OPTIND='1' - PATH='/usr/sbin:/usr/bin:/sbin:/bin:/usr/games' + PATH='/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/i/capture/the/path' PBCURRENTCOMMANDLINEOPERATION='build' PBUILDER_OPERATION='build' PBUILDER_PKGDATADIR='/usr/share/pbuilder' PBUILDER_PKGLIBDIR='/usr/lib/pbuilder' PBUILDER_SYSCONFDIR='/etc' - PPID='29246' + PPID='25777' PS1='# ' PS2='> ' PS4='+ ' PWD='/' SHELL='/bin/bash' SHLVL='2' - SUDO_COMMAND='/usr/bin/timeout -k 18.1h 18h /usr/bin/ionice -c 3 /usr/bin/nice /usr/sbin/pbuilder --build --configfile /srv/reproducible-results/rbuild-debian/r-b-build.rveULWfD/pbuilderrc_zgRi --distribution bookworm --hookdir /etc/pbuilder/first-build-hooks --debbuildopts -b --basetgz /var/cache/pbuilder/bookworm-reproducible-base.tgz --buildresult /srv/reproducible-results/rbuild-debian/r-b-build.rveULWfD/b1 --logfile b1/build.log fenicsx-performance-tests_0.5.0~git20220731.821823b-1.dsc' + SUDO_COMMAND='/usr/bin/timeout -k 24.1h 24h /usr/bin/ionice -c 3 /usr/bin/nice -n 11 /usr/bin/unshare --uts -- /usr/sbin/pbuilder --build --configfile /srv/reproducible-results/rbuild-debian/r-b-build.rveULWfD/pbuilderrc_fpjU --distribution bookworm --hookdir /etc/pbuilder/rebuild-hooks --debbuildopts -b --basetgz /var/cache/pbuilder/bookworm-reproducible-base.tgz --buildresult /srv/reproducible-results/rbuild-debian/r-b-build.rveULWfD/b2 --logfile b2/build.log --extrapackages usrmerge fenicsx-performance-tests_0.5.0~git20220731.821823b-1.dsc' SUDO_GID='117' SUDO_UID='110' SUDO_USER='jenkins' TERM='unknown' - TZ='/usr/share/zoneinfo/Etc/GMT+12' + TZ='/usr/share/zoneinfo/Etc/GMT-14' USER='root' USERNAME='root' _='/usr/bin/systemd-run' http_proxy='http://192.168.101.16:3128' I: uname -a - Linux codethink11-arm64 4.15.0-208-generic #220-Ubuntu SMP Mon Mar 20 14:28:12 UTC 2023 aarch64 GNU/Linux + Linux i-capture-the-hostname 4.15.0-208-generic #220-Ubuntu SMP Mon Mar 20 14:28:12 UTC 2023 aarch64 GNU/Linux I: ls -l /bin - lrwxrwxrwx 1 root root 7 May 12 04:50 /bin -> usr/bin -I: user script /srv/workspace/pbuilder/29246/tmp/hooks/D02_print_environment finished + lrwxrwxrwx 1 root root 7 Apr 11 00:24 /bin -> usr/bin +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/D02_print_environment finished -> Attempting to satisfy build-dependencies -> Creating pbuilder-satisfydepends-dummy package Package: pbuilder-satisfydepends-dummy @@ -411,7 +420,7 @@ Get: 289 http://deb.debian.org/debian bookworm/main arm64 libxsimd-dev arm64 8.1.0-7 [88.8 kB] Get: 290 http://deb.debian.org/debian bookworm/main arm64 libdolfinx-dev arm64 1:0.5.2-2+b1 [155 kB] Get: 291 http://deb.debian.org/debian bookworm/main arm64 xtensor-dev all 0.24.3-1 [5372 B] -Fetched 179 MB in 6s (32.4 MB/s) +Fetched 179 MB in 3s (58.2 MB/s) debconf: delaying package configuration, since apt-utils is not installed Selecting previously unselected package libpython3.11-minimal:arm64. (Reading database ... (Reading database ... 5% (Reading database ... 10% (Reading database ... 15% (Reading database ... 20% (Reading database ... 25% (Reading database ... 30% (Reading database ... 35% (Reading database ... 40% (Reading database ... 45% (Reading database ... 50% (Reading database ... 55% (Reading database ... 60% (Reading database ... 65% (Reading database ... 70% (Reading database ... 75% (Reading database ... 80% (Reading database ... 85% (Reading database ... 90% (Reading database ... 95% (Reading database ... 100% (Reading database ... 19616 files and directories currently installed.) @@ -1614,8 +1623,17 @@ Writing extended state information... Building tag database... -> Finished parsing the build-deps +Reading package lists... +Building dependency tree... +Reading state information... +usrmerge is already the newest version (35). +0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded. I: Building the package -I: Running cd /build/fenicsx-performance-tests-0.5.0~git20220731.821823b/ && env PATH="/usr/sbin:/usr/bin:/sbin:/bin:/usr/games" HOME="/nonexistent/first-build" dpkg-buildpackage -us -uc -b && env PATH="/usr/sbin:/usr/bin:/sbin:/bin:/usr/games" HOME="/nonexistent/first-build" dpkg-genchanges -S > ../fenicsx-performance-tests_0.5.0~git20220731.821823b-1_source.changes +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/A99_set_merged_usr starting +Re-configuring usrmerge... +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/A99_set_merged_usr finished +hostname: Temporary failure in name resolution +I: Running cd /build/fenicsx-performance-tests-0.5.0~git20220731.821823b/ && env PATH="/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/i/capture/the/path" HOME="/nonexistent/second-build" dpkg-buildpackage -us -uc -b && env PATH="/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/i/capture/the/path" HOME="/nonexistent/second-build" dpkg-genchanges -S > ../fenicsx-performance-tests_0.5.0~git20220731.821823b-1_source.changes dpkg-buildpackage: info: source package fenicsx-performance-tests dpkg-buildpackage: info: source version 0.5.0~git20220731.821823b-1 dpkg-buildpackage: info: source distribution unstable @@ -1719,28 +1737,28 @@ make[2]: Entering directory '/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu' make -f CMakeFiles/dolfinx-scaling-test.dir/build.make CMakeFiles/dolfinx-scaling-test.dir/depend make[3]: Entering directory '/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu' -[ 20%] Generating Poisson.c -[ 20%] Generating Elasticity.c +[ 10%] Generating Poisson.c ffcx /build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/Poisson.py +[ 20%] Generating Elasticity.c ffcx /build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/Elasticity.py cd "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/CMakeFiles/dolfinx-scaling-test.dir/DependInfo.cmake" --color= make[3]: Leaving directory '/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu' make -f CMakeFiles/dolfinx-scaling-test.dir/build.make CMakeFiles/dolfinx-scaling-test.dir/build make[3]: Entering directory '/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu' +[ 40%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/mesh.cpp.o +[ 40%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/main.cpp.o [ 50%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/elasticity_problem.cpp.o -[ 50%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/mesh.cpp.o -[ 50%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/main.cpp.o /usr/bin/c++ -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -std=c++20 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/main.cpp.o -MF CMakeFiles/dolfinx-scaling-test.dir/main.cpp.o.d -o CMakeFiles/dolfinx-scaling-test.dir/main.cpp.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/main.cpp" /usr/bin/c++ -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -std=c++20 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/mesh.cpp.o -MF CMakeFiles/dolfinx-scaling-test.dir/mesh.cpp.o.d -o CMakeFiles/dolfinx-scaling-test.dir/mesh.cpp.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/mesh.cpp" /usr/bin/c++ -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -std=c++20 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/elasticity_problem.cpp.o -MF CMakeFiles/dolfinx-scaling-test.dir/elasticity_problem.cpp.o.d -o CMakeFiles/dolfinx-scaling-test.dir/elasticity_problem.cpp.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/elasticity_problem.cpp" -[ 80%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/poisson_problem.cpp.o -[ 80%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/mem.cpp.o -[ 80%] Building C object CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o +[ 60%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/mem.cpp.o +[ 70%] Building CXX object CMakeFiles/dolfinx-scaling-test.dir/poisson_problem.cpp.o /usr/bin/c++ -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -std=c++20 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/poisson_problem.cpp.o -MF CMakeFiles/dolfinx-scaling-test.dir/poisson_problem.cpp.o.d -o CMakeFiles/dolfinx-scaling-test.dir/poisson_problem.cpp.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/poisson_problem.cpp" +[ 80%] Building C object CMakeFiles/dolfinx-scaling-test.dir/Elasticity.c.o /usr/bin/c++ -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -std=c++20 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/mem.cpp.o -MF CMakeFiles/dolfinx-scaling-test.dir/mem.cpp.o.d -o CMakeFiles/dolfinx-scaling-test.dir/mem.cpp.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src/mem.cpp" -/usr/bin/cc -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o -MF CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o.d -o CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/Poisson.c" -[ 90%] Building C object CMakeFiles/dolfinx-scaling-test.dir/Elasticity.c.o /usr/bin/cc -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/Elasticity.c.o -MF CMakeFiles/dolfinx-scaling-test.dir/Elasticity.c.o.d -o CMakeFiles/dolfinx-scaling-test.dir/Elasticity.c.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/Elasticity.c" +[ 90%] Building C object CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o +/usr/bin/cc -DBOOST_ALL_NO_LIB -DBOOST_CHRONO_DYN_LINK -DBOOST_PROGRAM_OPTIONS_DYN_LINK -DBOOST_TIMER_DYN_LINK -DDOLFINX_VERSION=\"0.5.2\" -DHAS_PTSCOTCH -DHAS_SLEPC -DXTENSOR_USE_XSIMD -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu" -I"/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/src" -isystem /usr/lib/python3/dist-packages/ffcx/codegeneration -isystem /usr/lib/aarch64-linux-gnu/openmpi/include -isystem /usr/lib/aarch64-linux-gnu/openmpi/include/openmpi -isystem /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/include -isystem /usr/include/hdf5/openmpi -isystem /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/include -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wdate-time -D_FORTIFY_SOURCE=2 -MD -MT CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o -MF CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o.d -o CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o -c "/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/Poisson.c" [100%] Linking CXX executable dolfinx-scaling-test /usr/bin/cmake -E cmake_link_script CMakeFiles/dolfinx-scaling-test.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/build/fenicsx-performance-tests-0.5.0~git20220731.821823b=. -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -Wl,-z,relro "CMakeFiles/dolfinx-scaling-test.dir/main.cpp.o" "CMakeFiles/dolfinx-scaling-test.dir/mesh.cpp.o" "CMakeFiles/dolfinx-scaling-test.dir/elasticity_problem.cpp.o" "CMakeFiles/dolfinx-scaling-test.dir/poisson_problem.cpp.o" "CMakeFiles/dolfinx-scaling-test.dir/mem.cpp.o" "CMakeFiles/dolfinx-scaling-test.dir/Elasticity.c.o" "CMakeFiles/dolfinx-scaling-test.dir/Poisson.c.o" -o dolfinx-scaling-test /usr/lib/aarch64-linux-gnu/libdolfinx_real.so.0.5.2 /usr/lib/aarch64-linux-gnu/libboost_program_options.so.1.74.0 /usr/lib/aarch64-linux-gnu/libpugixml.so.1.13 /usr/lib/aarch64-linux-gnu/libbasix.so.0.5.1 /usr/lib/aarch64-linux-gnu/libboost_timer.so.1.74.0 /usr/lib/aarch64-linux-gnu/libboost_chrono.so.1.74.0 /usr/lib/aarch64-linux-gnu/openmpi/lib/libmpi_cxx.so /usr/lib/aarch64-linux-gnu/openmpi/lib/libmpi.so /usr/lib/aarch64-linux-gnu/hdf5/openmpi/libhdf5.so /usr/lib/slepcdir/slepc3.18/aarch64-linux-gnu-real/lib/libslepc_real.so /usr/lib/petscdir/petsc3.18/aarch64-linux-gnu-real/lib/libpetsc_real.so @@ -1753,61 +1771,61 @@ make[1]: Entering directory '/build/fenicsx-performance-tests-0.5.0~git20220731.821823b' debian/tests/run-dolfinx-scaling-test -p /build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu ==== elasticity weak scaling (3 processes with ndofs=10000) ==== -2024-05-13 18:09:56.094 ( 0.061s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks -2024-05-13 18:09:56.094 ( 0.061s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph -2024-05-13 18:09:56.094 ( 0.061s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:09:56.128 ( 0.096s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph -2024-05-13 18:09:56.128 ( 0.096s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:09:56.129 ( 0.096s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:09:56.129 ( 0.096s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 -2024-05-13 18:09:56.132 ( 0.099s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 66332, non-local: 962) -2024-05-13 18:09:56.132 ( 0.099s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH -2024-05-13 18:10:04.235 ( 8.202s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:04.235 ( 8.203s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:04.238 ( 8.206s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:04.272 ( 8.239s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(120) 16696-1198 - -2024-05-13 18:10:04.277 ( 8.244s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology -2024-05-13 18:10:04.279 ( 8.247s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:04.279 ( 8.247s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:04.293 ( 8.260s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.293 ( 8.261s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.294 ( 8.261s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.294 ( 8.261s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.297 ( 8.264s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.297 ( 8.264s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.297 ( 8.264s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:04.297 ( 8.264s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:04.297 ( 8.265s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.297 ( 8.265s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.300 ( 8.268s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:04.300 ( 8.268s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:04.301 ( 8.268s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 -2024-05-13 18:10:04.301 ( 8.268s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.301 ( 8.268s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.301 ( 8.268s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 +2023-04-12 13:51:43.756 ( 0.113s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks +2023-04-12 13:51:43.756 ( 0.113s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph +2023-04-12 13:51:43.756 ( 0.113s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:51:43.792 ( 0.150s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph +2023-04-12 13:51:43.793 ( 0.150s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:43.793 ( 0.150s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:43.793 ( 0.150s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 +2023-04-12 13:51:43.797 ( 0.154s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 66332, non-local: 962) +2023-04-12 13:51:43.797 ( 0.154s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH +2023-04-12 13:51:52.461 ( 8.818s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:51:52.461 ( 8.818s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:51:52.464 ( 8.822s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:51:52.497 ( 8.854s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(121) 16044-636 + +2023-04-12 13:51:52.502 ( 8.859s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology +2023-04-12 13:51:52.508 ( 8.866s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:51:52.509 ( 8.866s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:51:52.522 ( 8.880s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.523 ( 8.881s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.523 ( 8.881s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.524 ( 8.881s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.527 ( 8.884s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.527 ( 8.884s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.527 ( 8.884s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:51:52.527 ( 8.884s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:51:52.527 ( 8.885s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.528 ( 8.885s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.531 ( 8.888s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:51:52.531 ( 8.888s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:51:52.531 ( 8.888s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 +2023-04-12 13:51:52.531 ( 8.888s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.531 ( 8.889s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.531 ( 8.889s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 UnitCube (24x19x19) to be refined 0 times -2024-05-13 18:10:04.303 ( 8.271s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 -2024-05-13 18:10:04.316 ( 8.283s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.316 ( 8.283s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.369 ( 8.336s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.369 ( 8.337s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.371 ( 8.338s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:04.371 ( 8.339s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. -2024-05-13 18:10:04.374 ( 8.341s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 -2024-05-13 18:10:04.377 ( 8.344s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.377 ( 8.344s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.377 ( 8.345s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.377 ( 8.345s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.377 ( 8.345s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:04.377 ( 8.345s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:04.378 ( 8.345s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:04.378 ( 8.345s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:04.379 ( 8.346s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:04.380 ( 8.347s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:04.380 ( 8.347s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 -2024-05-13 18:10:04.380 ( 8.348s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:04.413 ( 8.381s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 220 to 547 +2023-04-12 13:51:52.534 ( 8.891s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 +2023-04-12 13:51:52.547 ( 8.904s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.547 ( 8.904s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.601 ( 8.959s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.601 ( 8.959s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.603 ( 8.961s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:51:52.604 ( 8.961s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. +2023-04-12 13:51:52.606 ( 8.964s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 +2023-04-12 13:51:52.609 ( 8.966s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.609 ( 8.967s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.610 ( 8.967s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.610 ( 8.967s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.610 ( 8.967s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:51:52.610 ( 8.967s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:51:52.610 ( 8.968s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:52.611 ( 8.968s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:52.612 ( 8.969s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:51:52.612 ( 8.970s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:51:52.612 ( 8.970s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 +2023-04-12 13:51:52.613 ( 8.970s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:51:52.646 ( 9.003s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 219 to 545 ---------------------------------------------------------------- Test problem summary @@ -1822,7 +1840,7 @@ Total degrees of freedom: 30000 (30 thousand) Average degrees of freedom per process: 10000 ---------------------------------------------------------------- -2024-05-13 18:10:04.661 ( 8.629s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. +2023-04-12 13:51:52.902 ( 9.259s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. KSP Object: 3 MPI processes type: cg maximum iterations=10000, initial guess is zero @@ -1840,7 +1858,7 @@ AGG specific options Number of levels to square graph 1 Number smoothing steps 1 - Complexity: grid = 1.067 operator = 1.27312 + Complexity: grid = 1.0708 operator = 1.29225 Coarse grid solver -- level 0 ------------------------------- KSP Object: (mg_coarse_) 3 MPI processes type: preonly @@ -1865,33 +1883,33 @@ tolerance for zero pivot 2.22045e-14 using diagonal shift on blocks to prevent zero pivot [INBLOCKS] matrix ordering: nd - factor fill ratio given 5., needed 1.04049 + factor fill ratio given 5., needed 1.06422 Factored matrix follows: Mat Object: (mg_coarse_sub_) 1 MPI process type: seqaij - rows=240, cols=240, bs=6 + rows=228, cols=228, bs=6 package used to perform factorization: petsc - total: nonzeros=55512, allocated nonzeros=55512 - using I-node routines: found 59 nodes, limit used is 5 + total: nonzeros=50112, allocated nonzeros=50112 + using I-node routines: found 56 nodes, limit used is 5 linear system matrix = precond matrix: Mat Object: (mg_coarse_sub_) 1 MPI process type: seqaij - rows=240, cols=240, bs=6 - total: nonzeros=53352, allocated nonzeros=53352 + rows=228, cols=228, bs=6 + total: nonzeros=47088, allocated nonzeros=47088 total number of mallocs used during MatSetValues calls=0 - using I-node routines: found 71 nodes, limit used is 5 + using I-node routines: found 66 nodes, limit used is 5 linear system matrix = precond matrix: Mat Object: 3 MPI processes type: mpiaij - rows=240, cols=240, bs=6 - total: nonzeros=53352, allocated nonzeros=53352 + rows=228, cols=228, bs=6 + total: nonzeros=47088, allocated nonzeros=47088 total number of mallocs used during MatSetValues calls=0 - using I-node (on process 0) routines: found 71 nodes, limit used is 5 + using I-node (on process 0) routines: found 66 nodes, limit used is 5 Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 3 MPI processes type: chebyshev - eigenvalue targets used: min 0.282894, max 3.11183 - eigenvalues provided (min 2.46813e-06, max 2.82894) with transform: [0. 0.1; 0. 1.1] + eigenvalue targets used: min 0.232991, max 2.5629 + eigenvalues provided (min 1.23066e-06, max 2.32991) with transform: [0. 0.1; 0. 1.1] maximum iterations=2, nonzero initial guess tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning @@ -1902,17 +1920,17 @@ linear system matrix = precond matrix: Mat Object: 3 MPI processes type: mpiaij - rows=1770, cols=1770, bs=6 - total: nonzeros=288468, allocated nonzeros=288468 + rows=1896, cols=1896, bs=6 + total: nonzeros=318672, allocated nonzeros=318672 total number of mallocs used during MatSetValues calls=0 using scalable MatPtAP() implementation - using I-node (on process 0) routines: found 184 nodes, limit used is 5 + using I-node (on process 0) routines: found 172 nodes, limit used is 5 Up solver (post-smoother) same as down solver (pre-smoother) Down solver (pre-smoother) on level 2 ------------------------------- KSP Object: (mg_levels_2_) 3 MPI processes type: chebyshev - eigenvalue targets used: min 0.258389, max 2.84228 - eigenvalues provided (min 0.0725603, max 2.58389) with transform: [0. 0.1; 0. 1.1] + eigenvalue targets used: min 0.258374, max 2.84212 + eigenvalues provided (min 0.0713585, max 2.58374) with transform: [0. 0.1; 0. 1.1] maximum iterations=2, nonzero initial guess tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning @@ -1927,7 +1945,7 @@ total: nonzeros=1251522, allocated nonzeros=1251522 total number of mallocs used during MatSetValues calls=0 has attached near null space - using I-node (on process 0) routines: found 3393 nodes, limit used is 5 + using I-node (on process 0) routines: found 3384 nodes, limit used is 5 Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix = precond matrix: Mat Object: 3 MPI processes @@ -1936,50 +1954,50 @@ total: nonzeros=1251522, allocated nonzeros=1251522 total number of mallocs used during MatSetValues calls=0 has attached near null space - using I-node (on process 0) routines: found 3393 nodes, limit used is 5 + using I-node (on process 0) routines: found 3384 nodes, limit used is 5 [MPI_MAX] Summary of timings | reps wall avg wall tot ------------------------------------------------------------------------------------------------------- -Build BoxMesh | 1 8.211101 8.211101 -Build dofmap data | 2 0.004336 0.008673 -Build sparsity | 1 0.005406 0.005406 -Compute connectivity 2-3 | 1 0.001118 0.001118 -Compute dof reordering map | 2 0.000385 0.000771 -Compute entities of dim = 2 | 1 0.067955 0.067955 -Compute graph partition (SCOTCH) | 1 8.101201 8.101201 -Compute local part of mesh dual graph | 2 0.032983 0.065965 -Compute local-to-local map | 1 0.000531 0.000531 -Compute non-local part of mesh dual graph | 1 0.006642 0.006642 -Compute-local-to-global links for global/local adjacency list | 1 0.000589 0.000589 -Distribute AdjacencyList nodes to destination ranks | 1 0.005434 0.005434 -Distribute row-wise data (scalable) | 1 0.001163 0.001163 -Extract partition boundaries from SCOTCH graph | 1 0.001167 0.001167 -GPS: create_level_structure | 2 0.000753 0.003475 -Get SCOTCH graph data | 1 0.000015 0.000015 -Gibbs-Poole-Stockmeyer ordering | 1 0.006756 0.006756 -Init MPI | 1 0.057330 0.057330 -Init PETSc | 1 0.001739 0.001739 -Init dofmap from element dofmap | 2 0.002255 0.004510 -Init logging | 1 0.000052 0.000052 -PETSc Krylov solver | 1 1.096678 1.096678 -SCOTCH: call SCOTCH_dgraphBuild | 1 0.000279 0.000279 -SCOTCH: call SCOTCH_dgraphHalo | 1 0.031785 0.031785 -SCOTCH: call SCOTCH_dgraphPart | 1 8.073785 8.073785 -SparsityPattern::assemble | 1 0.013285 0.013285 -Topology: create | 1 0.019020 0.019020 -Topology: determine shared index ownership | 1 0.002458 0.002458 -Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.002596 0.002596 -ZZZ Assemble matrix | 1 0.224202 0.224202 -ZZZ Assemble vector | 1 0.008552 0.008552 -ZZZ Create Mesh | 1 8.211336 8.211336 -ZZZ Create RHS function | 1 0.015527 0.015527 -ZZZ Create boundary conditions | 1 0.002219 0.002219 -ZZZ Create facets and facet->cell connectivity | 1 0.069145 0.069145 -ZZZ Create forms | 1 0.000084 0.000084 +Build BoxMesh | 1 8.779362 8.779362 +Build dofmap data | 2 0.004380 0.008760 +Build sparsity | 1 0.005663 0.005663 +Compute connectivity 2-3 | 1 0.001189 0.001189 +Compute dof reordering map | 2 0.000391 0.000783 +Compute entities of dim = 2 | 1 0.070031 0.070031 +Compute graph partition (SCOTCH) | 1 8.661923 8.661923 +Compute local part of mesh dual graph | 2 0.035517 0.071034 +Compute local-to-local map | 1 0.000577 0.000577 +Compute non-local part of mesh dual graph | 1 0.006123 0.006123 +Compute-local-to-global links for global/local adjacency list | 1 0.000661 0.000661 +Distribute AdjacencyList nodes to destination ranks | 1 0.009296 0.009296 +Distribute row-wise data (scalable) | 1 0.001261 0.001261 +Extract partition boundaries from SCOTCH graph | 1 0.001096 0.001096 +GPS: create_level_structure | 2 0.000855 0.002564 +Get SCOTCH graph data | 1 0.000016 0.000016 +Gibbs-Poole-Stockmeyer ordering | 1 0.006519 0.006519 +Init MPI | 1 0.091022 0.091022 +Init PETSc | 1 0.019970 0.019970 +Init dofmap from element dofmap | 2 0.002319 0.004637 +Init logging | 1 0.000062 0.000062 +PETSc Krylov solver | 1 1.164353 1.164353 +SCOTCH: call SCOTCH_dgraphBuild | 1 0.000283 0.000283 +SCOTCH: call SCOTCH_dgraphHalo | 1 0.051679 0.051679 +SCOTCH: call SCOTCH_dgraphPart | 1 8.622577 8.622577 +SparsityPattern::assemble | 1 0.012430 0.012430 +Topology: create | 1 0.022225 0.022225 +Topology: determine shared index ownership | 1 0.004795 0.004795 +Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.002645 0.002645 +ZZZ Assemble matrix | 1 0.229238 0.229238 +ZZZ Assemble vector | 1 0.010236 0.010236 +ZZZ Create Mesh | 1 8.779620 8.779620 +ZZZ Create RHS function | 1 0.015991 0.015991 +ZZZ Create boundary conditions | 1 0.002202 0.002202 +ZZZ Create facets and facet->cell connectivity | 1 0.071300 0.071300 +ZZZ Create forms | 1 0.000085 0.000085 ZZZ Create near-nullspace | 1 0.008401 0.008401 -ZZZ FunctionSpace | 1 0.006406 0.006406 -ZZZ Solve | 1 1.100684 1.100684 -*** Number of Krylov iterations: 21 +ZZZ FunctionSpace | 1 0.006421 0.006421 +ZZZ Solve | 1 1.169507 1.169507 +*** Number of Krylov iterations: 20 *** Solution norm: 3.6795e-05 **************************************************************************************************************************************************************** *** WIDEN YOUR WINDOW TO 160 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** @@ -1987,17 +2005,17 @@ ------------------------------------------------------------------ PETSc Performance Summary: ------------------------------------------------------------------ -/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named codethink11-arm64 with 3 processors, by pbuilder1 Mon May 13 18:10:05 2024 +/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named i-capture-the-hostname with 3 processors, by pbuilder2 Wed Apr 12 13:51:54 2023 Using Petsc Release Version 3.18.5, Feb 27, 2023 Max Max/Min Avg Total -Time (sec): 9.670e+00 1.000 9.670e+00 +Time (sec): 1.032e+01 1.000 1.032e+01 Objects: 4.630e+02 1.004 4.617e+02 -Flops: 2.197e+08 1.071 2.122e+08 6.366e+08 -Flops/sec: 2.272e+07 1.071 2.194e+07 6.583e+07 -MPI Msg Count: 8.210e+02 1.041 7.993e+02 2.398e+03 -MPI Msg Len (bytes): 6.966e+06 1.462 7.103e+03 1.703e+07 -MPI Reductions: 6.370e+02 1.000 +Flops: 2.179e+08 1.091 2.097e+08 6.291e+08 +Flops/sec: 2.112e+07 1.091 2.033e+07 6.098e+07 +MPI Msg Count: 7.950e+02 1.041 7.740e+02 2.322e+03 +MPI Msg Len (bytes): 6.992e+06 1.537 7.255e+03 1.685e+07 +MPI Reductions: 6.330e+02 1.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops @@ -2005,7 +2023,7 @@ Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total - 0: Main Stage: 9.6700e+00 100.0% 6.3662e+08 100.0% 2.398e+03 100.0% 7.103e+03 100.0% 6.190e+02 97.2% + 0: Main Stage: 1.0316e+01 100.0% 6.2911e+08 100.0% 2.322e+03 100.0% 7.255e+03 100.0% 6.150e+02 97.2% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. @@ -2029,79 +2047,79 @@ --- Event Stage 0: Main Stage -BuildTwoSided 103 1.0 7.6468e-02 5.2 0.00e+00 0.0 1.4e+02 4.0e+00 1.0e+02 0 0 6 0 16 0 0 6 0 17 0 -BuildTwoSidedF 86 1.0 7.6239e-02 5.3 0.00e+00 0.0 1.2e+02 3.6e+04 8.6e+01 0 0 5 26 14 0 0 5 26 14 0 -MatMult 217 1.0 2.3669e-01 1.0 1.22e+08 1.1 1.3e+03 5.3e+03 2.0e+00 2 55 55 42 0 2 55 55 42 0 1481 -MatMultAdd 44 1.0 7.1042e-02 1.4 1.03e+07 1.1 1.8e+02 1.4e+03 0.0e+00 1 5 7 1 0 1 5 7 1 0 416 -MatMultTranspose 44 1.0 3.8571e-02 1.1 1.03e+07 1.1 1.9e+02 1.3e+03 2.0e+00 0 5 8 1 0 0 5 8 1 0 767 -MatSolve 22 0.0 2.5019e-03 0.0 2.44e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 974 -MatLUFactorSym 1 1.0 3.8870e-03 133.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatLUFactorNum 1 1.0 1.2513e-02 2910.1 8.33e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 666 -MatConvert 1 1.0 5.1718e-04 1.1 0.00e+00 0.0 1.2e+01 7.0e+02 1.0e+00 0 0 1 0 0 0 0 1 0 0 0 -MatScale 4 1.0 1.3813e-03 1.0 4.66e+05 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 973 -MatResidual 44 1.0 4.3921e-02 1.0 2.37e+07 1.1 2.6e+02 5.1e+03 0.0e+00 0 11 11 8 0 0 11 11 8 0 1543 -MatAssemblyBegin 59 1.0 9.7988e-02 2.7 0.00e+00 0.0 1.2e+02 3.6e+04 3.0e+01 1 0 5 26 5 1 0 5 26 5 0 -MatAssemblyEnd 59 1.0 5.4209e-02 1.1 8.59e+04 1.8 0.0e+00 0.0e+00 9.6e+01 1 0 0 0 15 1 0 0 0 16 4 -MatGetRowIJ 1 0.0 1.0168e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatCreateSubMat 2 1.0 1.3713e-02 1.0 0.00e+00 0.0 1.0e+01 4.4e+04 2.8e+01 0 0 0 3 4 0 0 0 3 5 0 -MatGetOrdering 1 0.0 2.3332e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatCoarsen 2 1.0 1.9207e-02 1.0 1.39e+05 1.0 2.4e+02 7.2e+02 9.7e+01 0 0 10 1 15 0 0 10 1 16 21 -MatZeroEntries 3 1.0 3.4394e-04 2.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatView 6 1.5 1.2966e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 -MatAXPY 4 1.0 2.1333e-02 1.0 1.15e+05 1.1 0.0e+00 0.0e+00 1.2e+01 0 0 0 0 2 0 0 0 0 2 16 -MatTranspose 10 1.0 1.3045e-02 1.1 0.00e+00 0.0 9.0e+01 2.1e+03 3.2e+01 0 0 4 1 5 0 0 4 1 5 0 -MatMatMultSym 9 1.0 9.1250e-02 1.1 0.00e+00 0.0 5.1e+01 1.1e+04 2.8e+01 1 0 2 3 4 1 0 2 3 5 0 -MatMatMultNum 3 1.0 2.7163e-02 1.0 6.46e+06 1.1 1.5e+01 2.5e+04 3.0e+00 0 3 1 2 0 0 3 1 2 0 681 -MatPtAPSymbolic 3 1.0 2.2247e-01 1.0 0.00e+00 0.0 9.9e+01 3.5e+04 2.1e+01 2 0 4 20 3 2 0 4 20 3 0 -MatPtAPNumeric 3 1.0 3.2998e-01 1.0 5.73e+07 1.2 3.0e+01 9.4e+04 1.5e+01 3 25 1 17 2 3 25 1 17 2 478 -MatGetLocalMat 9 1.0 4.5703e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatGetBrAoCol 9 1.0 7.3491e-03 1.1 0.00e+00 0.0 1.2e+02 3.1e+04 1.0e+00 0 0 5 22 0 0 0 5 22 0 0 -VecMDot 20 1.0 3.0289e-03 1.4 1.19e+06 1.1 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1154 -VecTDot 42 1.0 3.5238e-03 1.9 8.55e+05 1.1 0.0e+00 0.0e+00 4.2e+01 0 0 0 0 7 0 0 0 0 7 715 -VecNorm 44 1.0 2.8868e-03 1.6 6.85e+05 1.1 0.0e+00 0.0e+00 4.4e+01 0 0 0 0 7 0 0 0 0 7 699 -VecScale 22 1.0 1.5614e-04 1.1 1.19e+05 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2238 -VecCopy 136 1.0 8.3198e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecSet 203 1.0 4.4330e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecAXPY 44 1.0 1.4343e-03 1.1 8.76e+05 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1801 -VecAYPX 284 1.0 5.3976e-03 1.1 2.31e+06 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 1258 -VecAXPBYCZ 88 1.0 2.5683e-03 1.2 2.38e+06 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2721 -VecMAXPY 22 1.0 1.4954e-03 1.1 1.41e+06 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2762 -VecAssemblyBegin 57 1.0 2.1246e-03 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 9 0 0 0 0 9 0 -VecAssemblyEnd 57 1.0 2.0324e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecPointwiseMult 198 1.0 4.2336e-03 1.3 1.07e+06 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 743 -VecScatterBegin 363 1.0 7.5861e-03 1.0 0.00e+00 0.0 1.9e+03 4.1e+03 8.0e+00 0 0 79 45 1 0 0 79 45 1 0 -VecScatterEnd 363 1.0 4.9170e-02 1.7 1.29e+04 1.6 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1 -VecNormalize 22 1.0 1.0803e-03 1.4 3.58e+05 1.1 0.0e+00 0.0e+00 2.2e+01 0 0 0 0 3 0 0 0 0 4 970 -SFSetGraph 35 1.0 2.9650e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFSetUp 17 1.0 1.5977e-03 1.1 0.00e+00 0.0 1.6e+02 4.8e+02 1.7e+01 0 0 7 0 3 0 0 7 0 3 0 -SFBcastBegin 15 1.0 1.7604e-04 1.1 0.00e+00 0.0 9.0e+01 6.2e+02 0.0e+00 0 0 4 0 0 0 0 4 0 0 0 -SFBcastEnd 15 1.0 1.7904e-04 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFReduceBegin 4 1.0 1.2618e-04 1.4 0.00e+00 0.0 1.8e+01 5.3e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 -SFReduceEnd 4 1.0 5.9220e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFPack 382 1.0 2.4636e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFUnpack 382 1.0 5.8996e-04 1.2 1.29e+04 1.6 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 51 -KSPSetUp 7 1.0 3.3690e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 -KSPSolve 1 1.0 3.4545e-01 1.0 1.43e+08 1.1 1.6e+03 4.5e+03 7.0e+01 4 65 65 41 11 4 65 65 41 11 1196 -KSPGMRESOrthog 20 1.0 4.4265e-03 1.3 2.38e+06 1.1 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1579 -PCSetUp_GAMG+ 1 1.0 7.4847e-01 1.0 7.81e+07 1.2 8.0e+02 1.1e+04 5.0e+02 8 34 33 52 79 8 34 33 52 81 291 - PCGAMGCreateG 2 1.0 3.5079e-02 1.0 5.02e+04 1.0 6.0e+01 2.9e+03 4.4e+01 0 0 3 1 7 0 0 3 1 7 4 - GAMG Coarsen 2 1.0 2.0588e-02 1.0 1.39e+05 1.0 2.4e+02 7.2e+02 9.7e+01 0 0 10 1 15 0 0 10 1 16 20 - GAMG MIS/Agg 2 1.0 1.9246e-02 1.0 1.39e+05 1.0 2.4e+02 7.2e+02 9.7e+01 0 0 10 1 15 0 0 10 1 16 21 - PCGAMGProl 2 1.0 2.5510e-02 1.0 0.00e+00 0.0 1.9e+02 1.7e+03 1.9e+02 0 0 8 2 30 0 0 8 2 30 0 - GAMG Prol-col 2 1.0 1.2606e-02 1.0 0.00e+00 0.0 1.8e+02 6.3e+02 1.7e+02 0 0 8 1 27 0 0 8 1 27 0 - GAMG Prol-lift 2 1.0 1.2482e-02 1.0 0.00e+00 0.0 1.2e+01 1.7e+04 1.0e+01 0 0 1 1 2 0 0 1 1 2 0 - PCGAMGOptProl 2 1.0 1.0828e-01 1.0 2.07e+07 1.1 1.9e+02 8.3e+03 8.2e+01 1 9 8 9 13 1 9 8 9 13 550 - GAMG smooth 2 1.0 7.6994e-02 1.0 6.98e+06 1.1 4.8e+01 2.0e+04 2.6e+01 1 3 2 6 4 1 3 2 6 4 260 - PCGAMGCreateL 2 1.0 5.5737e-01 1.0 5.72e+07 1.2 1.2e+02 5.5e+04 7.7e+01 6 25 5 39 12 6 25 5 39 12 282 - GAMG PtAP 2 1.0 5.4250e-01 1.0 5.72e+07 1.2 9.6e+01 6.5e+04 2.4e+01 6 25 4 36 4 6 25 4 36 4 290 - GAMG Reduce 1 1.0 1.5221e-02 1.0 0.00e+00 0.0 2.5e+01 1.9e+04 5.3e+01 0 0 1 3 8 0 0 1 3 9 0 -PCGAMG Gal l00 1 1.0 3.7521e-01 1.0 3.49e+07 1.0 4.8e+01 8.3e+04 1.2e+01 4 16 2 23 2 4 16 2 23 2 275 -PCGAMG Opt l00 1 1.0 5.1880e-02 1.0 5.08e+06 1.0 2.4e+01 3.2e+04 1.0e+01 1 2 1 5 2 1 2 1 5 2 289 -PCGAMG Gal l01 1 1.0 1.6728e-01 1.0 2.23e+07 1.6 4.8e+01 4.6e+04 1.2e+01 2 8 2 13 2 2 8 2 13 2 323 -PCGAMG Opt l01 1 1.0 1.1111e-02 1.0 1.37e+06 1.4 2.4e+01 7.3e+03 1.0e+01 0 1 1 1 2 0 1 1 1 2 312 -PCSetUp 2 1.0 7.6523e-01 1.0 7.81e+07 1.1 8.0e+02 1.1e+04 5.0e+02 8 35 33 52 79 8 35 33 52 81 295 -PCSetUpOnBlocks 22 1.0 1.6862e-02 60.2 8.33e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 494 -PCApply 22 1.0 3.0533e-01 1.0 1.23e+08 1.1 1.4e+03 4.1e+03 6.0e+00 3 56 59 34 1 3 56 59 34 1 1158 +BuildTwoSided 103 1.0 1.4327e-01 10.8 0.00e+00 0.0 1.4e+02 4.0e+00 1.0e+02 1 0 6 0 16 1 0 6 0 17 0 +BuildTwoSidedF 86 1.0 1.4295e-01 11.2 0.00e+00 0.0 1.2e+02 3.6e+04 8.6e+01 1 0 5 26 14 1 0 5 26 14 0 +MatMult 208 1.0 2.4265e-01 1.0 1.19e+08 1.1 1.3e+03 5.5e+03 2.0e+00 2 54 55 41 0 2 54 55 41 0 1408 +MatMultAdd 42 1.0 6.9671e-02 1.3 1.00e+07 1.1 1.7e+02 1.4e+03 0.0e+00 1 5 7 1 0 1 5 7 1 0 416 +MatMultTranspose 42 1.0 3.6710e-02 1.1 1.00e+07 1.1 1.8e+02 1.3e+03 2.0e+00 0 5 8 1 0 0 5 8 1 0 790 +MatSolve 21 0.0 2.2414e-03 0.0 2.10e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 937 +MatLUFactorSym 1 1.0 3.3656e-03 108.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatLUFactorNum 1 1.0 1.2278e-02 2803.2 6.97e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 567 +MatConvert 1 1.0 5.7175e-04 1.1 0.00e+00 0.0 1.2e+01 7.1e+02 1.0e+00 0 0 1 0 0 0 0 1 0 0 0 +MatScale 4 1.0 1.5704e-03 1.1 4.77e+05 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 879 +MatResidual 42 1.0 4.5933e-02 1.0 2.31e+07 1.1 2.5e+02 5.2e+03 0.0e+00 0 10 11 8 0 0 10 11 8 0 1436 +MatAssemblyBegin 59 1.0 1.6479e-01 4.5 0.00e+00 0.0 1.2e+02 3.6e+04 3.0e+01 1 0 5 26 5 1 0 5 26 5 0 +MatAssemblyEnd 59 1.0 5.8654e-02 1.1 8.74e+04 2.1 0.0e+00 0.0e+00 9.6e+01 1 0 0 0 15 1 0 0 0 16 3 +MatGetRowIJ 1 0.0 9.7282e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatCreateSubMat 2 1.0 1.3469e-02 1.0 0.00e+00 0.0 1.0e+01 4.3e+04 2.8e+01 0 0 0 3 4 0 0 0 3 5 0 +MatGetOrdering 1 0.0 2.3862e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatCoarsen 2 1.0 2.0086e-02 1.0 1.37e+05 1.0 2.3e+02 7.5e+02 9.6e+01 0 0 10 1 15 0 0 10 1 16 20 +MatZeroEntries 3 1.0 9.2143e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatView 6 1.5 1.4738e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 +MatAXPY 4 1.0 2.2462e-02 1.0 1.15e+05 1.0 0.0e+00 0.0e+00 1.2e+01 0 0 0 0 2 0 0 0 0 2 15 +MatTranspose 10 1.0 1.3993e-02 1.1 0.00e+00 0.0 9.0e+01 2.1e+03 3.2e+01 0 0 4 1 5 0 0 4 1 5 0 +MatMatMultSym 9 1.0 9.6974e-02 1.2 0.00e+00 0.0 5.1e+01 1.2e+04 2.8e+01 1 0 2 4 4 1 0 2 4 5 0 +MatMatMultNum 3 1.0 2.9455e-02 1.0 6.61e+06 1.1 1.5e+01 2.5e+04 3.0e+00 0 3 1 2 0 0 3 1 2 0 640 +MatPtAPSymbolic 3 1.0 2.3689e-01 1.0 0.00e+00 0.0 9.9e+01 3.5e+04 2.1e+01 2 0 4 20 3 2 0 4 20 3 0 +MatPtAPNumeric 3 1.0 3.7086e-01 1.0 5.93e+07 1.3 3.0e+01 9.0e+04 1.5e+01 4 26 1 16 2 4 26 1 16 2 439 +MatGetLocalMat 9 1.0 4.8428e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatGetBrAoCol 9 1.0 7.5846e-03 1.1 0.00e+00 0.0 1.2e+02 3.1e+04 1.0e+00 0 0 5 22 0 0 0 5 22 0 0 +VecMDot 20 1.0 3.5426e-03 1.5 1.18e+06 1.0 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 990 +VecTDot 40 1.0 3.0254e-03 1.6 8.12e+05 1.0 0.0e+00 0.0e+00 4.0e+01 0 0 0 0 6 0 0 0 0 7 793 +VecNorm 43 1.0 2.6494e-03 1.7 6.61e+05 1.0 0.0e+00 0.0e+00 4.3e+01 0 0 0 0 7 0 0 0 0 7 740 +VecScale 22 1.0 1.6084e-04 1.0 1.18e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2181 +VecCopy 130 1.0 9.2867e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecSet 194 1.0 4.6031e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecAXPY 42 1.0 1.5549e-03 1.3 8.33e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1584 +VecAYPX 271 1.0 6.2491e-03 1.3 2.19e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 1040 +VecAXPBYCZ 84 1.0 2.6302e-03 1.1 2.26e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2547 +VecMAXPY 22 1.0 1.5247e-03 1.0 1.40e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2720 +VecAssemblyBegin 57 1.0 1.2663e-03 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 9 0 0 0 0 9 0 +VecAssemblyEnd 57 1.0 1.6199e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecPointwiseMult 190 1.0 3.8557e-03 1.2 1.02e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 786 +VecScatterBegin 350 1.0 8.1265e-03 1.1 0.00e+00 0.0 1.8e+03 4.2e+03 8.0e+00 0 0 79 45 1 0 0 79 45 1 0 +VecScatterEnd 350 1.0 4.4267e-02 1.4 1.26e+04 1.6 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1 +VecNormalize 22 1.0 1.0111e-03 1.2 3.55e+05 1.0 0.0e+00 0.0e+00 2.2e+01 0 0 0 0 3 0 0 0 0 4 1041 +SFSetGraph 35 1.0 3.0163e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFSetUp 17 1.0 1.8015e-03 1.2 0.00e+00 0.0 1.6e+02 5.0e+02 1.7e+01 0 0 7 0 3 0 0 7 0 3 0 +SFBcastBegin 14 1.0 1.7816e-04 1.1 0.00e+00 0.0 8.4e+01 6.6e+02 0.0e+00 0 0 4 0 0 0 0 4 0 0 0 +SFBcastEnd 14 1.0 1.6712e-04 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFReduceBegin 4 1.0 1.1556e-04 1.2 0.00e+00 0.0 1.8e+01 5.5e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 +SFReduceEnd 4 1.0 5.7000e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFPack 368 1.0 2.7213e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFUnpack 368 1.0 5.5617e-04 1.2 1.26e+04 1.6 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 54 +KSPSetUp 7 1.0 3.2838e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 +KSPSolve 1 1.0 3.4775e-01 1.0 1.35e+08 1.0 1.5e+03 4.6e+03 6.7e+01 3 64 64 40 11 3 64 64 40 11 1149 +KSPGMRESOrthog 20 1.0 4.9460e-03 1.3 2.37e+06 1.0 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1419 +PCSetUp_GAMG+ 1 1.0 8.1346e-01 1.0 8.05e+07 1.2 8.0e+02 1.1e+04 5.0e+02 8 36 34 53 79 8 36 34 53 81 275 + PCGAMGCreateG 2 1.0 3.6358e-02 1.0 5.01e+04 1.0 6.0e+01 2.9e+03 4.4e+01 0 0 3 1 7 0 0 3 1 7 4 + GAMG Coarsen 2 1.0 2.1448e-02 1.0 1.37e+05 1.0 2.3e+02 7.5e+02 9.6e+01 0 0 10 1 15 0 0 10 1 16 19 + GAMG MIS/Agg 2 1.0 2.0128e-02 1.0 1.37e+05 1.0 2.3e+02 7.5e+02 9.6e+01 0 0 10 1 15 0 0 10 1 16 20 + PCGAMGProl 2 1.0 2.5731e-02 1.0 0.00e+00 0.0 1.9e+02 2.0e+03 1.9e+02 0 0 8 2 30 0 0 8 2 31 0 + GAMG Prol-col 2 1.0 1.1308e-02 1.0 0.00e+00 0.0 1.8e+02 7.3e+02 1.7e+02 0 0 8 1 27 0 0 8 1 28 0 + GAMG Prol-lift 2 1.0 1.4024e-02 1.0 0.00e+00 0.0 1.2e+01 2.0e+04 1.0e+01 0 0 1 1 2 0 0 1 1 2 0 + PCGAMGOptProl 2 1.0 1.1708e-01 1.0 2.11e+07 1.1 1.9e+02 8.5e+03 8.2e+01 1 10 8 10 13 1 10 8 10 13 518 + GAMG smooth 2 1.0 8.2887e-02 1.0 7.14e+06 1.1 4.8e+01 2.0e+04 2.6e+01 1 3 2 6 4 1 3 2 6 4 246 + PCGAMGCreateL 2 1.0 6.1113e-01 1.0 5.92e+07 1.3 1.2e+02 5.4e+04 7.7e+01 6 26 5 39 12 6 26 5 39 13 266 + GAMG PtAP 2 1.0 5.9645e-01 1.0 5.92e+07 1.3 9.6e+01 6.3e+04 2.4e+01 6 26 4 36 4 6 26 4 36 4 272 + GAMG Reduce 1 1.0 1.4871e-02 1.0 0.00e+00 0.0 2.5e+01 1.8e+04 5.3e+01 0 0 1 3 8 0 0 1 3 9 0 +PCGAMG Gal l00 1 1.0 4.2670e-01 1.0 3.76e+07 1.1 4.8e+01 8.5e+04 1.2e+01 4 17 2 24 2 4 17 2 24 2 253 +PCGAMG Opt l00 1 1.0 5.5185e-02 1.0 5.03e+06 1.0 2.4e+01 3.3e+04 1.0e+01 1 2 1 5 2 1 2 1 5 2 272 +PCGAMG Gal l01 1 1.0 1.6975e-01 1.0 2.25e+07 2.0 4.8e+01 4.1e+04 1.2e+01 2 9 2 12 2 2 9 2 12 2 320 +PCGAMG Opt l01 1 1.0 1.2477e-02 1.0 1.60e+06 1.8 2.4e+01 7.7e+03 1.0e+01 0 1 1 1 2 0 1 1 1 2 306 +PCSetUp 2 1.0 8.2947e-01 1.0 8.05e+07 1.1 8.0e+02 1.1e+04 5.0e+02 8 37 34 53 79 8 37 34 53 81 278 +PCSetUpOnBlocks 21 1.0 1.6117e-02 56.8 6.97e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 432 +PCApply 21 1.0 3.0842e-01 1.0 1.17e+08 1.0 1.4e+03 4.2e+03 6.0e+00 3 55 59 34 1 3 55 59 34 1 1112 --- Event Stage 1: Unknown @@ -2130,9 +2148,9 @@ --- Event Stage 1: Unknown ======================================================================================================================== -Average time to get PetscTime(): 8e-08 -Average time for MPI_Barrier(): 3.168e-06 -Average time for zero size MPI_Send(): 2.26e-06 +Average time to get PetscTime(): 7.81e-08 +Average time for MPI_Barrier(): 3.272e-06 +Average time for zero size MPI_Send(): 2.08e-06 #PETSc Option Table entries: --ndofs 10000 --problem_type elasticity @@ -2195,63 +2213,63 @@ Option left: name:--problem_type value: elasticity Option left: name:--scaling_type value: weak Option left: name:-mg_levels_esteig_ksp_type value: cg -2024-05-13 18:10:05.825 ( 9.792s) [RANK: 0 ] loguru.cpp:526 INFO| atexit +2023-04-12 13:51:54.135 ( 10.492s) [RANK: 0 ] loguru.cpp:526 INFO| atexit ==== elasticity strong scaling (3 processes with ndofs=30000) ==== -2024-05-13 18:10:06.112 ( 0.062s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks -2024-05-13 18:10:06.112 ( 0.063s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph -2024-05-13 18:10:06.112 ( 0.063s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:06.146 ( 0.097s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph -2024-05-13 18:10:06.147 ( 0.098s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:06.147 ( 0.098s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:06.147 ( 0.098s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 -2024-05-13 18:10:06.150 ( 0.101s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 66332, non-local: 962) -2024-05-13 18:10:06.150 ( 0.101s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH -2024-05-13 18:10:16.215 ( 10.166s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:16.215 ( 10.166s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:16.218 ( 10.169s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:16.247 ( 10.198s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(120) 1173-16590 - -2024-05-13 18:10:16.252 ( 10.202s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology -2024-05-13 18:10:16.262 ( 10.213s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:16.262 ( 10.213s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:16.275 ( 10.226s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.276 ( 10.227s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.276 ( 10.227s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.276 ( 10.227s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.279 ( 10.230s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.279 ( 10.230s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.280 ( 10.230s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:16.280 ( 10.230s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:16.280 ( 10.231s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.280 ( 10.231s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.283 ( 10.234s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:16.283 ( 10.234s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:16.283 ( 10.234s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 -2024-05-13 18:10:16.284 ( 10.235s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.284 ( 10.235s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.284 ( 10.235s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 +2023-04-12 13:51:54.424 ( 0.061s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks +2023-04-12 13:51:54.424 ( 0.062s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph +2023-04-12 13:51:54.424 ( 0.062s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:51:54.458 ( 0.096s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph +2023-04-12 13:51:54.458 ( 0.096s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:51:54.458 ( 0.096s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:51:54.458 ( 0.096s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 +2023-04-12 13:51:54.462 ( 0.100s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 66332, non-local: 962) +2023-04-12 13:51:54.462 ( 0.100s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH +2023-04-12 13:52:03.078 ( 8.716s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:52:03.080 ( 8.718s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:52:03.084 ( 8.721s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:52:03.113 ( 8.751s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(122) 16794-1230 + +2023-04-12 13:52:03.117 ( 8.755s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology +2023-04-12 13:52:03.128 ( 8.766s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:52:03.128 ( 8.766s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:52:03.142 ( 8.780s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.142 ( 8.780s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.143 ( 8.780s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.143 ( 8.780s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.146 ( 8.783s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.146 ( 8.784s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.146 ( 8.784s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:03.146 ( 8.784s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:03.146 ( 8.784s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.146 ( 8.784s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.150 ( 8.787s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:03.150 ( 8.787s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:03.150 ( 8.788s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 +2023-04-12 13:52:03.150 ( 8.788s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.150 ( 8.788s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.150 ( 8.788s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 UnitCube (24x19x19) to be refined 0 times -2024-05-13 18:10:16.286 ( 10.237s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 -2024-05-13 18:10:16.299 ( 10.249s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.299 ( 10.250s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.346 ( 10.297s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.346 ( 10.297s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.348 ( 10.299s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:16.348 ( 10.299s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. -2024-05-13 18:10:16.351 ( 10.302s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 -2024-05-13 18:10:16.354 ( 10.304s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.354 ( 10.305s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.354 ( 10.305s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.354 ( 10.305s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.354 ( 10.305s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:16.354 ( 10.305s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:16.355 ( 10.306s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:16.355 ( 10.306s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:16.356 ( 10.307s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:16.356 ( 10.307s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:16.357 ( 10.307s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 -2024-05-13 18:10:16.357 ( 10.308s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:16.390 ( 10.341s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 228 to 554 +2023-04-12 13:52:03.152 ( 8.790s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 +2023-04-12 13:52:03.165 ( 8.803s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.165 ( 8.803s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.217 ( 8.855s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.218 ( 8.855s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.219 ( 8.857s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:03.219 ( 8.857s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. +2023-04-12 13:52:03.222 ( 8.860s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 +2023-04-12 13:52:03.225 ( 8.863s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.225 ( 8.863s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.225 ( 8.863s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.225 ( 8.863s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.225 ( 8.863s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:03.225 ( 8.863s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:03.226 ( 8.864s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:03.226 ( 8.864s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:03.227 ( 8.865s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:52:03.228 ( 8.865s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:52:03.228 ( 8.866s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 +2023-04-12 13:52:03.228 ( 8.866s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:03.261 ( 8.899s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 213 to 559 ---------------------------------------------------------------- Test problem summary @@ -2266,7 +2284,7 @@ Total degrees of freedom: 30000 (30 thousand) Average degrees of freedom per process: 10000 ---------------------------------------------------------------- -2024-05-13 18:10:16.624 ( 10.575s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. +2023-04-12 13:52:03.505 ( 9.143s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. KSP Object: 3 MPI processes type: cg maximum iterations=10000, initial guess is zero @@ -2284,7 +2302,7 @@ AGG specific options Number of levels to square graph 1 Number smoothing steps 1 - Complexity: grid = 1.0708 operator = 1.29697 + Complexity: grid = 1.0654 operator = 1.26202 Coarse grid solver -- level 0 ------------------------------- KSP Object: (mg_coarse_) 3 MPI processes type: preonly @@ -2309,33 +2327,33 @@ tolerance for zero pivot 2.22045e-14 using diagonal shift on blocks to prevent zero pivot [INBLOCKS] matrix ordering: nd - factor fill ratio given 5., needed 1.04467 + factor fill ratio given 5., needed 1.11617 Factored matrix follows: Mat Object: (mg_coarse_sub_) 1 MPI process type: seqaij - rows=246, cols=246, bs=6 + rows=228, cols=228, bs=6 package used to perform factorization: petsc - total: nonzeros=58932, allocated nonzeros=58932 - using I-node routines: found 57 nodes, limit used is 5 + total: nonzeros=51192, allocated nonzeros=51192 + using I-node routines: found 49 nodes, limit used is 5 linear system matrix = precond matrix: Mat Object: (mg_coarse_sub_) 1 MPI process type: seqaij - rows=246, cols=246, bs=6 - total: nonzeros=56412, allocated nonzeros=56412 + rows=228, cols=228, bs=6 + total: nonzeros=45864, allocated nonzeros=45864 total number of mallocs used during MatSetValues calls=0 - using I-node routines: found 75 nodes, limit used is 5 + using I-node routines: found 66 nodes, limit used is 5 linear system matrix = precond matrix: Mat Object: 3 MPI processes type: mpiaij - rows=246, cols=246, bs=6 - total: nonzeros=56412, allocated nonzeros=56412 + rows=228, cols=228, bs=6 + total: nonzeros=45864, allocated nonzeros=45864 total number of mallocs used during MatSetValues calls=0 - using I-node (on process 0) routines: found 75 nodes, limit used is 5 + using I-node (on process 0) routines: found 66 nodes, limit used is 5 Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 3 MPI processes type: chebyshev - eigenvalue targets used: min 0.297588, max 3.27346 - eigenvalues provided (min 1.27865e-06, max 2.97588) with transform: [0. 0.1; 0. 1.1] + eigenvalue targets used: min 0.276448, max 3.04093 + eigenvalues provided (min 1.41432e-06, max 2.76448) with transform: [0. 0.1; 0. 1.1] maximum iterations=2, nonzero initial guess tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning @@ -2346,17 +2364,17 @@ linear system matrix = precond matrix: Mat Object: 3 MPI processes type: mpiaij - rows=1878, cols=1878, bs=6 - total: nonzeros=315252, allocated nonzeros=315252 + rows=1734, cols=1734, bs=6 + total: nonzeros=282060, allocated nonzeros=282060 total number of mallocs used during MatSetValues calls=0 using scalable MatPtAP() implementation - using I-node (on process 0) routines: found 196 nodes, limit used is 5 + using I-node (on process 0) routines: found 174 nodes, limit used is 5 Up solver (post-smoother) same as down solver (pre-smoother) Down solver (pre-smoother) on level 2 ------------------------------- KSP Object: (mg_levels_2_) 3 MPI processes type: chebyshev - eigenvalue targets used: min 0.258174, max 2.83992 - eigenvalues provided (min 0.0708281, max 2.58174) with transform: [0. 0.1; 0. 1.1] + eigenvalue targets used: min 0.258313, max 2.84145 + eigenvalues provided (min 0.0745455, max 2.58313) with transform: [0. 0.1; 0. 1.1] maximum iterations=2, nonzero initial guess tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning @@ -2371,7 +2389,7 @@ total: nonzeros=1251522, allocated nonzeros=1251522 total number of mallocs used during MatSetValues calls=0 has attached near null space - using I-node (on process 0) routines: found 3342 nodes, limit used is 5 + using I-node (on process 0) routines: found 3398 nodes, limit used is 5 Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix = precond matrix: Mat Object: 3 MPI processes @@ -2380,50 +2398,50 @@ total: nonzeros=1251522, allocated nonzeros=1251522 total number of mallocs used during MatSetValues calls=0 has attached near null space - using I-node (on process 0) routines: found 3342 nodes, limit used is 5 + using I-node (on process 0) routines: found 3398 nodes, limit used is 5 -[MPI_MAX] Summary of timings | reps wall avg wall tot ---------------------------------------------------------------------------------------------------------- -Build BoxMesh | 1 10.175915 10.175915 -Build dofmap data | 2 0.004197 0.008395 -Build sparsity | 1 0.005509 0.005509 -Compute connectivity 2-3 | 1 0.001123 0.001123 -Compute dof reordering map | 2 0.000384 0.000768 -Compute entities of dim = 2 | 1 0.062313 0.062313 -Compute graph partition (SCOTCH) | 1 10.062759 10.062759 -Compute local part of mesh dual graph | 2 0.032568 0.065136 -Compute local-to-local map | 1 0.000511 0.000511 -Compute non-local part of mesh dual graph | 1 0.006632 0.006632 -Compute-local-to-global links for global/local adjacency list | 1 0.000605 0.000605 -Distribute AdjacencyList nodes to destination ranks | 1 0.009596 0.009596 -Distribute row-wise data (scalable) | 1 0.001162 0.001162 -Extract partition boundaries from SCOTCH graph | 1 0.001185 0.001185 -GPS: create_level_structure | 2 0.000767 0.004601 -Get SCOTCH graph data | 1 0.000014 0.000014 -Gibbs-Poole-Stockmeyer ordering | 1 0.007822 0.007822 -Init MPI | 1 0.059040 0.059040 -Init PETSc | 1 0.001569 0.001569 -Init dofmap from element dofmap | 2 0.002268 0.004535 -Init logging | 1 0.000052 0.000052 -PETSc Krylov solver | 1 1.163372 1.163372 -SCOTCH: call SCOTCH_dgraphBuild | 1 0.000315 0.000315 -SCOTCH: call SCOTCH_dgraphHalo | 1 0.051666 0.051666 -SCOTCH: call SCOTCH_dgraphPart | 1 10.023305 10.023305 -SparsityPattern::assemble | 1 0.012501 0.012501 -Topology: create | 1 0.025035 0.025035 -Topology: determine shared index ownership | 1 0.008398 0.008398 -Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.002533 0.002533 -ZZZ Assemble matrix | 1 0.211307 0.211307 -ZZZ Assemble vector | 1 0.008720 0.008720 -ZZZ Create Mesh | 1 10.176152 10.176152 -ZZZ Create RHS function | 1 0.015866 0.015866 -ZZZ Create boundary conditions | 1 0.002185 0.002185 -ZZZ Create facets and facet->cell connectivity | 1 0.063513 0.063513 -ZZZ Create forms | 1 0.000080 0.000080 -ZZZ Create near-nullspace | 1 0.008010 0.008010 -ZZZ FunctionSpace | 1 0.005984 0.005984 -ZZZ Solve | 1 1.167266 1.167266 -*** Number of Krylov iterations: 22 +[MPI_MAX] Summary of timings | reps wall avg wall tot +------------------------------------------------------------------------------------------------------- +Build BoxMesh | 1 8.730099 8.730099 +Build dofmap data | 2 0.004186 0.008373 +Build sparsity | 1 0.005382 0.005382 +Compute connectivity 2-3 | 1 0.001168 0.001168 +Compute dof reordering map | 2 0.000385 0.000770 +Compute entities of dim = 2 | 1 0.067417 0.067417 +Compute graph partition (SCOTCH) | 1 8.608924 8.608924 +Compute local part of mesh dual graph | 2 0.032853 0.065706 +Compute local-to-local map | 1 0.000511 0.000511 +Compute non-local part of mesh dual graph | 1 0.007390 0.007390 +Compute-local-to-global links for global/local adjacency list | 1 0.000599 0.000599 +Distribute AdjacencyList nodes to destination ranks | 1 0.020607 0.020607 +Distribute row-wise data (scalable) | 1 0.001136 0.001136 +Extract partition boundaries from SCOTCH graph | 1 0.001058 0.001058 +GPS: create_level_structure | 2 0.000840 0.003950 +Get SCOTCH graph data | 1 0.000015 0.000015 +Gibbs-Poole-Stockmeyer ordering | 1 0.007753 0.007753 +Init MPI | 1 0.058042 0.058042 +Init PETSc | 1 0.001569 0.001569 +Init dofmap from element dofmap | 2 0.002270 0.004541 +Init logging | 1 0.000056 0.000056 +PETSc Krylov solver | 1 1.124449 1.124449 +SCOTCH: call SCOTCH_dgraphBuild | 1 0.000293 0.000293 +SCOTCH: call SCOTCH_dgraphHalo | 1 0.035750 0.035750 +SCOTCH: call SCOTCH_dgraphPart | 1 8.581797 8.581797 +SparsityPattern::assemble | 1 0.012392 0.012392 +Topology: create | 1 0.025592 0.025592 +Topology: determine shared index ownership | 1 0.008817 0.008817 +Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.002565 0.002565 +ZZZ Assemble matrix | 1 0.220492 0.220492 +ZZZ Assemble vector | 1 0.009333 0.009333 +ZZZ Create Mesh | 1 8.730315 8.730315 +ZZZ Create RHS function | 1 0.015330 0.015330 +ZZZ Create boundary conditions | 1 0.002169 0.002169 +ZZZ Create facets and facet->cell connectivity | 1 0.068648 0.068648 +ZZZ Create forms | 1 0.000078 0.000078 +ZZZ Create near-nullspace | 1 0.008210 0.008210 +ZZZ FunctionSpace | 1 0.005953 0.005953 +ZZZ Solve | 1 1.128409 1.128409 +*** Number of Krylov iterations: 21 *** Solution norm: 3.6795e-05 **************************************************************************************************************************************************************** *** WIDEN YOUR WINDOW TO 160 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** @@ -2431,17 +2449,17 @@ ------------------------------------------------------------------ PETSc Performance Summary: ------------------------------------------------------------------ -/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named codethink11-arm64 with 3 processors, by pbuilder1 Mon May 13 18:10:17 2024 +/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named i-capture-the-hostname with 3 processors, by pbuilder2 Wed Apr 12 13:52:04 2023 Using Petsc Release Version 3.18.5, Feb 27, 2023 Max Max/Min Avg Total -Time (sec): 1.168e+01 1.000 1.168e+01 +Time (sec): 1.021e+01 1.000 1.021e+01 Objects: 4.630e+02 1.004 4.617e+02 -Flops: 2.320e+08 1.055 2.252e+08 6.757e+08 -Flops/sec: 1.986e+07 1.055 1.928e+07 5.785e+07 -MPI Msg Count: 8.210e+02 1.043 8.067e+02 2.420e+03 -MPI Msg Len (bytes): 6.751e+06 1.356 7.428e+03 1.798e+07 -MPI Reductions: 6.400e+02 1.000 +Flops: 2.187e+08 1.090 2.090e+08 6.271e+08 +Flops/sec: 2.141e+07 1.090 2.047e+07 6.141e+07 +MPI Msg Count: 8.170e+02 1.044 7.947e+02 2.384e+03 +MPI Msg Len (bytes): 7.105e+06 1.481 7.234e+03 1.725e+07 +MPI Reductions: 6.360e+02 1.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops @@ -2449,7 +2467,7 @@ Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total - 0: Main Stage: 1.1682e+01 100.0% 6.7575e+08 100.0% 2.420e+03 100.0% 7.428e+03 100.0% 6.220e+02 97.2% + 0: Main Stage: 1.0212e+01 100.0% 6.2713e+08 100.0% 2.384e+03 100.0% 7.234e+03 100.0% 6.180e+02 97.2% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. @@ -2473,79 +2491,79 @@ --- Event Stage 0: Main Stage -BuildTwoSided 103 1.0 8.6089e-02 9.2 0.00e+00 0.0 1.3e+02 4.0e+00 1.0e+02 0 0 6 0 16 0 0 6 0 17 0 -BuildTwoSidedF 86 1.0 8.5930e-02 9.3 0.00e+00 0.0 1.1e+02 4.0e+04 8.6e+01 0 0 5 26 13 0 0 5 26 14 0 -MatMult 226 1.0 2.5232e-01 1.0 1.27e+08 1.1 1.4e+03 5.4e+03 2.0e+00 2 55 57 42 0 2 55 57 42 0 1470 -MatMultAdd 46 1.0 7.3609e-02 1.4 1.09e+07 1.1 1.8e+02 1.5e+03 0.0e+00 1 5 8 2 0 1 5 8 2 0 435 -MatMultTranspose 46 1.0 3.8739e-02 1.1 1.09e+07 1.1 2.0e+02 1.4e+03 2.0e+00 0 5 8 2 0 0 5 8 2 0 827 -MatSolve 23 0.0 2.7638e-03 0.0 2.71e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 979 -MatLUFactorSym 1 1.0 3.4517e-03 116.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatLUFactorNum 1 1.0 1.3735e-02 3350.1 8.94e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 651 -MatConvert 1 1.0 4.8114e-04 1.1 0.00e+00 0.0 1.2e+01 7.0e+02 1.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatScale 4 1.0 1.4316e-03 1.0 4.75e+05 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 972 -MatResidual 46 1.0 4.7683e-02 1.0 2.49e+07 1.1 2.8e+02 5.2e+03 0.0e+00 0 11 11 8 0 0 11 11 8 0 1511 -MatAssemblyBegin 59 1.0 1.0887e-01 2.9 0.00e+00 0.0 1.1e+02 4.0e+04 3.0e+01 1 0 5 26 5 1 0 5 26 5 0 -MatAssemblyEnd 59 1.0 5.7803e-02 1.0 1.02e+05 1.9 0.0e+00 0.0e+00 9.6e+01 0 0 0 0 15 0 0 0 0 15 4 -MatGetRowIJ 1 0.0 1.0954e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatCreateSubMat 2 1.0 1.4860e-02 1.0 0.00e+00 0.0 1.0e+01 4.7e+04 2.8e+01 0 0 0 3 4 0 0 0 3 5 0 -MatGetOrdering 1 0.0 2.4342e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatCoarsen 2 1.0 1.8558e-02 1.0 1.39e+05 1.0 2.3e+02 7.5e+02 9.7e+01 0 0 10 1 15 0 0 10 1 16 22 -MatZeroEntries 3 1.0 1.8834e-04 2.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatView 6 1.5 1.1554e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 -MatAXPY 4 1.0 2.1031e-02 1.0 1.14e+05 1.0 0.0e+00 0.0e+00 1.2e+01 0 0 0 0 2 0 0 0 0 2 16 -MatTranspose 10 1.0 1.2816e-02 1.0 0.00e+00 0.0 8.5e+01 2.2e+03 3.2e+01 0 0 4 1 5 0 0 4 1 5 0 -MatMatMultSym 9 1.0 9.2218e-02 1.1 0.00e+00 0.0 5.1e+01 1.2e+04 2.8e+01 1 0 2 3 4 1 0 2 3 5 0 -MatMatMultNum 3 1.0 2.6771e-02 1.0 6.51e+06 1.1 1.5e+01 2.5e+04 3.0e+00 0 3 1 2 0 0 3 1 2 0 703 -MatPtAPSymbolic 3 1.0 2.3766e-01 1.0 0.00e+00 0.0 9.9e+01 3.7e+04 2.1e+01 2 0 4 20 3 2 0 4 20 3 0 -MatPtAPNumeric 3 1.0 3.6584e-01 1.0 6.26e+07 1.2 3.0e+01 1.0e+05 1.5e+01 3 25 1 17 2 3 25 1 17 2 464 -MatGetLocalMat 9 1.0 4.8075e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatGetBrAoCol 9 1.0 6.8531e-03 1.0 0.00e+00 0.0 1.2e+02 3.2e+04 1.0e+00 0 0 5 21 0 0 0 5 21 0 0 -VecMDot 20 1.0 2.9753e-03 1.5 1.18e+06 1.0 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1178 -VecTDot 44 1.0 3.2497e-03 1.8 8.94e+05 1.0 0.0e+00 0.0e+00 4.4e+01 0 0 0 0 7 0 0 0 0 7 812 -VecNorm 45 1.0 1.9565e-03 1.1 7.04e+05 1.0 0.0e+00 0.0e+00 4.5e+01 0 0 0 0 7 0 0 0 0 7 1064 -VecScale 22 1.0 1.5282e-04 1.0 1.18e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2295 -VecCopy 142 1.0 8.7670e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecSet 212 1.0 4.4972e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecAXPY 46 1.0 1.4798e-03 1.1 9.16e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1827 -VecAYPX 297 1.0 5.5676e-03 1.0 2.40e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 1280 -VecAXPBYCZ 92 1.0 3.7021e-03 1.4 2.47e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 1980 -VecMAXPY 22 1.0 1.4884e-03 1.1 1.40e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2784 -VecAssemblyBegin 57 1.0 1.8464e-03 2.3 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 9 0 0 0 0 9 0 -VecAssemblyEnd 57 1.0 1.4732e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecPointwiseMult 206 1.0 3.7935e-03 1.1 1.11e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 866 -VecScatterBegin 376 1.0 8.0418e-03 1.1 0.00e+00 0.0 1.9e+03 4.2e+03 8.0e+00 0 0 80 46 1 0 0 80 46 1 0 -VecScatterEnd 376 1.0 4.7632e-02 1.5 1.57e+04 2.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1 -VecNormalize 22 1.0 9.1270e-04 1.1 3.54e+05 1.0 0.0e+00 0.0e+00 2.2e+01 0 0 0 0 3 0 0 0 0 4 1153 -SFSetGraph 35 1.0 2.7698e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFSetUp 17 1.0 1.4827e-03 1.1 0.00e+00 0.0 1.5e+02 5.0e+02 1.7e+01 0 0 6 0 3 0 0 6 0 3 0 -SFBcastBegin 15 1.0 1.6220e-04 1.1 0.00e+00 0.0 9.0e+01 6.3e+02 0.0e+00 0 0 4 0 0 0 0 4 0 0 0 -SFBcastEnd 15 1.0 1.1594e-04 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFReduceBegin 4 1.0 1.0408e-04 1.0 0.00e+00 0.0 1.7e+01 5.7e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 -SFReduceEnd 4 1.0 5.6240e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFPack 395 1.0 2.2109e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFUnpack 395 1.0 5.8516e-04 1.2 1.57e+04 2.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 58 -KSPSetUp 7 1.0 3.1164e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 -KSPSolve 1 1.0 3.6379e-01 1.0 1.50e+08 1.1 1.6e+03 4.6e+03 7.3e+01 3 65 67 41 11 3 65 67 41 12 1208 -KSPGMRESOrthog 20 1.0 4.3382e-03 1.3 2.36e+06 1.0 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1617 -PCSetUp_GAMG+ 1 1.0 7.9700e-01 1.0 8.35e+07 1.2 7.5e+02 1.2e+04 5.0e+02 7 34 31 52 78 7 34 31 52 81 289 - PCGAMGCreateG 2 1.0 3.4631e-02 1.0 4.98e+04 1.0 6.0e+01 2.9e+03 4.4e+01 0 0 2 1 7 0 0 2 1 7 4 - GAMG Coarsen 2 1.0 1.9980e-02 1.0 1.39e+05 1.0 2.3e+02 7.5e+02 9.7e+01 0 0 10 1 15 0 0 10 1 16 21 - GAMG MIS/Agg 2 1.0 1.8597e-02 1.0 1.39e+05 1.0 2.3e+02 7.5e+02 9.7e+01 0 0 10 1 15 0 0 10 1 16 22 - PCGAMGProl 2 1.0 2.3728e-02 1.0 0.00e+00 0.0 1.5e+02 2.1e+03 1.9e+02 0 0 6 2 29 0 0 6 2 30 0 - GAMG Prol-col 2 1.0 1.1108e-02 1.0 0.00e+00 0.0 1.4e+02 7.8e+02 1.7e+02 0 0 6 1 27 0 0 6 1 27 0 - GAMG Prol-lift 2 1.0 1.2248e-02 1.0 0.00e+00 0.0 1.0e+01 2.1e+04 1.0e+01 0 0 0 1 2 0 0 0 1 2 0 - PCGAMGOptProl 2 1.0 1.0767e-01 1.0 2.08e+07 1.1 1.9e+02 8.5e+03 8.2e+01 1 9 8 9 13 1 9 8 9 13 562 - GAMG smooth 2 1.0 7.6569e-02 1.0 7.04e+06 1.1 4.8e+01 2.0e+04 2.6e+01 1 3 2 5 4 1 3 2 5 4 266 - PCGAMGCreateL 2 1.0 6.0939e-01 1.0 6.25e+07 1.2 1.2e+02 5.9e+04 7.7e+01 5 25 5 40 12 5 25 5 40 12 278 - GAMG PtAP 2 1.0 5.9340e-01 1.0 6.25e+07 1.2 9.6e+01 6.9e+04 2.4e+01 5 25 4 37 4 5 25 4 37 4 285 - GAMG Reduce 1 1.0 1.6366e-02 1.0 0.00e+00 0.0 2.5e+01 2.0e+04 5.3e+01 0 0 1 3 8 0 0 1 3 9 0 -PCGAMG Gal l00 1 1.0 3.9535e-01 1.0 3.67e+07 1.0 4.8e+01 8.8e+04 1.2e+01 3 16 2 24 2 3 16 2 24 2 274 -PCGAMG Opt l00 1 1.0 5.0808e-02 1.0 5.10e+06 1.0 2.4e+01 3.2e+04 1.0e+01 0 2 1 4 2 0 2 1 4 2 296 -PCGAMG Gal l01 1 1.0 1.9804e-01 1.0 2.58e+07 1.5 4.8e+01 5.0e+04 1.2e+01 2 9 2 13 2 2 9 2 13 2 307 -PCGAMG Opt l01 1 1.0 1.1746e-02 1.0 1.55e+06 1.4 2.4e+01 8.0e+03 1.0e+01 0 1 1 1 2 0 1 1 1 2 322 -PCSetUp 2 1.0 8.1453e-01 1.0 8.35e+07 1.1 7.5e+02 1.2e+04 5.0e+02 7 35 31 52 78 7 35 31 52 81 294 -PCSetUpOnBlocks 23 1.0 1.7658e-02 63.4 8.94e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 506 -PCApply 23 1.0 3.2257e-01 1.0 1.30e+08 1.1 1.5e+03 4.2e+03 6.0e+00 3 56 61 35 1 3 56 61 35 1 1169 +BuildTwoSided 103 1.0 9.4586e-02 15.7 0.00e+00 0.0 1.4e+02 4.0e+00 1.0e+02 1 0 6 0 16 1 0 6 0 17 0 +BuildTwoSidedF 86 1.0 9.4585e-02 16.3 0.00e+00 0.0 1.1e+02 3.8e+04 8.6e+01 1 0 5 25 14 1 0 5 25 14 0 +MatMult 217 1.0 2.5678e-01 1.0 1.22e+08 1.1 1.3e+03 5.5e+03 2.0e+00 2 56 56 43 0 2 56 56 43 0 1361 +MatMultAdd 44 1.0 6.9459e-02 1.3 1.00e+07 1.0 1.8e+02 1.5e+03 0.0e+00 1 5 7 1 0 1 5 7 1 0 423 +MatMultTranspose 44 1.0 3.8526e-02 1.1 1.01e+07 1.0 1.9e+02 1.4e+03 2.0e+00 0 5 8 2 0 0 5 8 2 0 764 +MatSolve 22 0.0 2.3024e-03 0.0 2.25e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 976 +MatLUFactorSym 1 1.0 3.1958e-03 105.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatLUFactorNum 1 1.0 1.0315e-02 2578.8 7.07e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 685 +MatConvert 1 1.0 5.0029e-04 1.1 0.00e+00 0.0 1.2e+01 7.2e+02 1.0e+00 0 0 1 0 0 0 0 1 0 0 0 +MatScale 4 1.0 1.4469e-03 1.0 4.57e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 924 +MatResidual 44 1.0 4.7626e-02 1.0 2.38e+07 1.1 2.6e+02 5.3e+03 0.0e+00 0 11 11 8 0 0 11 11 8 0 1417 +MatAssemblyBegin 59 1.0 1.1741e-01 3.2 0.00e+00 0.0 1.1e+02 3.8e+04 3.0e+01 1 0 5 25 5 1 0 5 25 5 0 +MatAssemblyEnd 59 1.0 5.7465e-02 1.1 1.02e+05 2.2 0.0e+00 0.0e+00 9.6e+01 1 0 0 0 15 1 0 0 0 16 4 +MatGetRowIJ 1 0.0 9.0281e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatCreateSubMat 2 1.0 1.2408e-02 1.0 0.00e+00 0.0 1.0e+01 4.4e+04 2.8e+01 0 0 0 3 4 0 0 0 3 5 0 +MatGetOrdering 1 0.0 2.1468e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatCoarsen 2 1.0 1.8999e-02 1.0 1.37e+05 1.0 2.3e+02 7.7e+02 9.6e+01 0 0 10 1 15 0 0 10 1 16 22 +MatZeroEntries 3 1.0 2.4682e-04 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatView 6 1.5 1.2617e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 +MatAXPY 4 1.0 2.1294e-02 1.0 1.13e+05 1.0 0.0e+00 0.0e+00 1.2e+01 0 0 0 0 2 0 0 0 0 2 16 +MatTranspose 10 1.0 1.3103e-02 1.0 0.00e+00 0.0 9.0e+01 2.1e+03 3.2e+01 0 0 4 1 5 0 0 4 1 5 0 +MatMatMultSym 9 1.0 8.9442e-02 1.1 0.00e+00 0.0 5.1e+01 1.2e+04 2.8e+01 1 0 2 4 4 1 0 2 4 5 0 +MatMatMultNum 3 1.0 2.7448e-02 1.0 6.50e+06 1.1 1.5e+01 2.5e+04 3.0e+00 0 3 1 2 0 0 3 1 2 0 671 +MatPtAPSymbolic 3 1.0 2.1991e-01 1.0 0.00e+00 0.0 9.9e+01 3.4e+04 2.1e+01 2 0 4 20 3 2 0 4 20 3 0 +MatPtAPNumeric 3 1.0 3.4555e-01 1.0 5.66e+07 1.2 3.0e+01 9.0e+04 1.5e+01 3 24 1 16 2 3 24 1 16 2 438 +MatGetLocalMat 9 1.0 4.5240e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatGetBrAoCol 9 1.0 6.6932e-03 1.0 0.00e+00 0.0 1.2e+02 3.1e+04 1.0e+00 0 0 5 21 0 0 0 5 21 0 0 +VecMDot 20 1.0 3.1317e-03 1.5 1.18e+06 1.0 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1115 +VecTDot 42 1.0 3.2246e-03 1.7 8.56e+05 1.0 0.0e+00 0.0e+00 4.2e+01 0 0 0 0 7 0 0 0 0 7 781 +VecNorm 44 1.0 2.7396e-03 2.0 6.84e+05 1.0 0.0e+00 0.0e+00 4.4e+01 0 0 0 0 7 0 0 0 0 7 737 +VecScale 22 1.0 1.6128e-04 1.0 1.18e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2164 +VecCopy 136 1.0 9.9570e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecSet 203 1.0 4.8014e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecAXPY 44 1.0 1.4395e-03 1.0 8.78e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1795 +VecAYPX 284 1.0 5.5229e-03 1.0 2.29e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 1229 +VecAXPBYCZ 88 1.0 2.8077e-03 1.0 2.36e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2487 +VecMAXPY 22 1.0 1.4794e-03 1.0 1.39e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2788 +VecAssemblyBegin 57 1.0 1.4902e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 9 0 0 0 0 9 0 +VecAssemblyEnd 57 1.0 1.6900e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecPointwiseMult 198 1.0 3.8698e-03 1.2 1.06e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 812 +VecScatterBegin 363 1.0 8.4231e-03 1.1 0.00e+00 0.0 1.9e+03 4.2e+03 8.0e+00 0 0 80 46 1 0 0 80 46 1 0 +VecScatterEnd 363 1.0 4.8644e-02 1.5 1.45e+04 2.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1 +VecNormalize 22 1.0 9.5487e-04 1.1 3.54e+05 1.0 0.0e+00 0.0e+00 2.2e+01 0 0 0 0 3 0 0 0 0 4 1097 +SFSetGraph 35 1.0 2.9922e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFSetUp 17 1.0 1.5684e-03 1.2 0.00e+00 0.0 1.6e+02 5.0e+02 1.7e+01 0 0 7 0 3 0 0 7 0 3 0 +SFBcastBegin 14 1.0 1.5814e-04 1.0 0.00e+00 0.0 8.4e+01 6.8e+02 0.0e+00 0 0 4 0 0 0 0 4 0 0 0 +SFBcastEnd 14 1.0 9.9620e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFReduceBegin 4 1.0 1.0574e-04 1.1 0.00e+00 0.0 1.8e+01 5.5e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 +SFReduceEnd 4 1.0 5.0900e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFPack 381 1.0 2.5537e-03 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFUnpack 381 1.0 5.5219e-04 1.2 1.45e+04 2.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 58 +KSPSetUp 7 1.0 3.1538e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 +KSPSolve 1 1.0 3.6278e-01 1.0 1.40e+08 1.1 1.6e+03 4.6e+03 7.0e+01 4 65 65 42 11 4 65 65 42 11 1131 +KSPGMRESOrthog 20 1.0 4.5023e-03 1.3 2.36e+06 1.0 0.0e+00 0.0e+00 2.0e+01 0 1 0 0 3 0 1 0 0 3 1551 +PCSetUp_GAMG+ 1 1.0 7.5894e-01 1.0 7.74e+07 1.2 8.0e+02 1.1e+04 5.0e+02 7 34 33 51 79 7 34 33 51 81 278 + PCGAMGCreateG 2 1.0 3.5038e-02 1.0 4.97e+04 1.0 6.0e+01 3.0e+03 4.4e+01 0 0 3 1 7 0 0 3 1 7 4 + GAMG Coarsen 2 1.0 2.0373e-02 1.0 1.37e+05 1.0 2.3e+02 7.7e+02 9.6e+01 0 0 10 1 15 0 0 10 1 16 20 + GAMG MIS/Agg 2 1.0 1.9036e-02 1.0 1.37e+05 1.0 2.3e+02 7.7e+02 9.6e+01 0 0 10 1 15 0 0 10 1 16 22 + PCGAMGProl 2 1.0 2.4853e-02 1.0 0.00e+00 0.0 1.9e+02 1.8e+03 1.9e+02 0 0 8 2 30 0 0 8 2 30 0 + GAMG Prol-col 2 1.0 1.1233e-02 1.0 0.00e+00 0.0 1.8e+02 6.9e+02 1.7e+02 0 0 8 1 27 0 0 8 1 28 0 + GAMG Prol-lift 2 1.0 1.3222e-02 1.0 0.00e+00 0.0 1.2e+01 1.9e+04 1.0e+01 0 0 1 1 2 0 0 1 1 2 0 + PCGAMGOptProl 2 1.0 1.0915e-01 1.0 2.07e+07 1.1 1.9e+02 8.6e+03 8.2e+01 1 9 8 10 13 1 9 8 10 13 544 + GAMG smooth 2 1.0 7.7196e-02 1.0 7.01e+06 1.1 4.8e+01 2.0e+04 2.6e+01 1 3 2 6 4 1 3 2 6 4 258 + PCGAMGCreateL 2 1.0 5.6790e-01 1.0 5.65e+07 1.2 1.2e+02 5.4e+04 7.7e+01 6 24 5 38 12 6 24 5 38 12 266 + GAMG PtAP 2 1.0 5.5434e-01 1.0 5.65e+07 1.2 9.6e+01 6.3e+04 2.4e+01 5 24 4 35 4 5 24 4 35 4 273 + GAMG Reduce 1 1.0 1.3737e-02 1.0 0.00e+00 0.0 2.5e+01 1.8e+04 5.3e+01 0 0 1 3 8 0 0 1 3 9 0 +PCGAMG Gal l00 1 1.0 3.8815e-01 1.0 3.54e+07 1.0 4.8e+01 8.6e+04 1.2e+01 4 17 2 24 2 4 17 2 24 2 267 +PCGAMG Opt l00 1 1.0 5.1752e-02 1.0 5.04e+06 1.0 2.4e+01 3.3e+04 1.0e+01 1 2 1 5 2 1 2 1 5 2 290 +PCGAMG Gal l01 1 1.0 1.6620e-01 1.0 2.12e+07 1.9 4.8e+01 4.0e+04 1.2e+01 2 8 2 11 2 2 8 2 11 2 285 +PCGAMG Opt l01 1 1.0 1.1367e-02 1.0 1.49e+06 1.6 2.4e+01 7.6e+03 1.0e+01 0 1 1 1 2 0 1 1 1 2 298 +PCSetUp 2 1.0 7.7278e-01 1.0 7.74e+07 1.1 8.0e+02 1.1e+04 5.0e+02 7 35 33 51 79 7 35 33 51 81 282 +PCSetUpOnBlocks 22 1.0 1.3962e-02 49.0 7.07e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 506 +PCApply 22 1.0 3.2048e-01 1.0 1.21e+08 1.1 1.4e+03 4.3e+03 6.0e+00 3 56 60 35 1 3 56 60 35 1 1094 --- Event Stage 1: Unknown @@ -2574,9 +2592,9 @@ --- Event Stage 1: Unknown ======================================================================================================================== -Average time to get PetscTime(): 7.6e-08 -Average time for MPI_Barrier(): 3.08e-06 -Average time for zero size MPI_Send(): 1.58e-06 +Average time to get PetscTime(): 8.2e-08 +Average time for MPI_Barrier(): 3.208e-06 +Average time for zero size MPI_Send(): 1.59333e-06 #PETSc Option Table entries: --ndofs 30000 --problem_type elasticity @@ -2639,67 +2657,67 @@ Option left: name:--problem_type value: elasticity Option left: name:--scaling_type value: strong Option left: name:-mg_levels_esteig_ksp_type value: cg -2024-05-13 18:10:17.852 ( 11.803s) [RANK: 0 ] loguru.cpp:526 INFO| atexit +2023-04-12 13:52:04.694 ( 10.332s) [RANK: 0 ] loguru.cpp:526 INFO| atexit ==== poisson weak scaling (3 processes with ndofs=10000) ==== -2024-05-13 18:10:18.140 ( 0.069s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks -2024-05-13 18:10:18.140 ( 0.069s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph -2024-05-13 18:10:18.140 ( 0.069s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:18.252 ( 0.180s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph -2024-05-13 18:10:18.255 ( 0.183s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:18.255 ( 0.183s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:18.255 ( 0.183s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 -2024-05-13 18:10:18.261 ( 0.189s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 210000, non-local: 1800) -2024-05-13 18:10:18.261 ( 0.189s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH -2024-05-13 18:10:33.506 ( 15.434s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:33.507 ( 15.436s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:33.518 ( 15.447s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:33.621 ( 15.550s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(168) 52610-476 - -2024-05-13 18:10:33.638 ( 15.567s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology -2024-05-13 18:10:33.657 ( 15.586s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:33.657 ( 15.586s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:33.706 ( 15.634s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.706 ( 15.634s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.706 ( 15.634s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.706 ( 15.635s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.715 ( 15.644s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.715 ( 15.644s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.716 ( 15.644s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:33.716 ( 15.644s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:33.716 ( 15.645s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.716 ( 15.645s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.726 ( 15.655s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:33.726 ( 15.655s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:33.726 ( 15.655s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 -2024-05-13 18:10:33.727 ( 15.656s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.727 ( 15.656s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.727 ( 15.656s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 +2023-04-12 13:52:04.990 ( 0.069s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks +2023-04-12 13:52:04.990 ( 0.069s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph +2023-04-12 13:52:04.990 ( 0.069s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:52:05.102 ( 0.180s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph +2023-04-12 13:52:05.110 ( 0.189s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:05.110 ( 0.189s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:05.110 ( 0.189s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 +2023-04-12 13:52:05.117 ( 0.195s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 210000, non-local: 1800) +2023-04-12 13:52:05.117 ( 0.195s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH +2023-04-12 13:52:21.987 ( 17.066s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:52:21.987 ( 17.066s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:52:21.998 ( 17.077s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:52:22.108 ( 17.187s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(171) 52882-613 + +2023-04-12 13:52:22.128 ( 17.207s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology +2023-04-12 13:52:22.159 ( 17.238s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:52:22.159 ( 17.238s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:52:22.207 ( 17.286s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.207 ( 17.286s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.208 ( 17.286s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.208 ( 17.286s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.217 ( 17.295s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.217 ( 17.296s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.217 ( 17.296s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:22.217 ( 17.296s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:22.218 ( 17.296s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.218 ( 17.297s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.227 ( 17.306s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:22.228 ( 17.306s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:22.228 ( 17.306s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 +2023-04-12 13:52:22.228 ( 17.307s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.228 ( 17.307s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.228 ( 17.307s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 UnitCube (30x30x30) to be refined 0 times -2024-05-13 18:10:33.734 ( 15.662s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 -2024-05-13 18:10:33.778 ( 15.706s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.780 ( 15.708s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.873 ( 15.802s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.873 ( 15.802s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.880 ( 15.808s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:33.880 ( 15.808s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. -2024-05-13 18:10:33.885 ( 15.813s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 -2024-05-13 18:10:33.893 ( 15.822s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.893 ( 15.822s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.894 ( 15.822s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.894 ( 15.822s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.894 ( 15.822s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:33.894 ( 15.822s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:33.895 ( 15.823s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.895 ( 15.823s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:33.896 ( 15.825s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:33.898 ( 15.827s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:33.898 ( 15.827s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 -2024-05-13 18:10:33.900 ( 15.828s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:33.960 ( 15.888s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:33.960 ( 15.888s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 2 -2024-05-13 18:10:33.960 ( 15.889s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:33.960 ( 15.889s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:34.012 ( 15.940s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 454 to 1211 +2023-04-12 13:52:22.235 ( 17.313s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 +2023-04-12 13:52:22.279 ( 17.358s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.281 ( 17.360s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.378 ( 17.456s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.378 ( 17.457s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.385 ( 17.463s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:22.385 ( 17.463s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. +2023-04-12 13:52:22.390 ( 17.469s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 +2023-04-12 13:52:22.398 ( 17.477s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.399 ( 17.478s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.399 ( 17.478s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.399 ( 17.478s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.399 ( 17.478s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:22.399 ( 17.478s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:22.400 ( 17.479s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.400 ( 17.479s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.402 ( 17.481s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:52:22.404 ( 17.483s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:52:22.404 ( 17.483s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 +2023-04-12 13:52:22.405 ( 17.484s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:22.464 ( 17.543s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:22.464 ( 17.543s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 2 +2023-04-12 13:52:22.464 ( 17.543s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:22.465 ( 17.544s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:22.519 ( 17.598s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 434 to 1223 ---------------------------------------------------------------- Test problem summary @@ -2714,7 +2732,7 @@ Total degrees of freedom: 29791 (29.8 thousand) Average degrees of freedom per process: 9930 ---------------------------------------------------------------- -2024-05-13 18:10:34.155 ( 16.084s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. +2023-04-12 13:52:22.661 ( 17.740s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. KSP Object: 3 MPI processes type: cg maximum iterations=10000, initial guess is zero @@ -2758,44 +2776,44 @@ [MPI_MAX] Summary of timings | reps wall avg wall tot --------------------------------------------------------------------------------------------------------- -Build BoxMesh | 1 15.597352 15.597352 -Build dofmap data | 2 0.011254 0.022509 -Build sparsity | 1 0.017150 0.017150 -Compute connectivity 2-3 | 1 0.004254 0.004254 -Compute dof reordering map | 2 0.001256 0.002512 -Compute entities of dim = 2 | 1 0.146118 0.146118 -Compute graph partition (SCOTCH) | 1 15.239311 15.239311 -Compute local part of mesh dual graph | 2 0.106450 0.212901 -Compute local-to-local map | 1 0.001950 0.001950 -Compute non-local part of mesh dual graph | 1 0.008779 0.008779 -Compute-local-to-global links for global/local adjacency list | 1 0.002120 0.002120 -Distribute AdjacencyList nodes to destination ranks | 1 0.024840 0.024840 -Distribute row-wise data (scalable) | 1 0.002651 0.002651 -Extract partition boundaries from SCOTCH graph | 1 0.002949 0.002949 -GPS: create_level_structure | 2 0.002917 0.010828 -Get SCOTCH graph data | 1 0.000015 0.000015 -Gibbs-Poole-Stockmeyer ordering | 1 0.023927 0.023927 -Init MPI | 1 0.062835 0.062835 -Init PETSc | 1 0.001573 0.001573 -Init dofmap from element dofmap | 2 0.007245 0.014489 -Init logging | 1 0.000053 0.000053 -PETSc Krylov solver | 1 0.664794 0.664794 -SCOTCH: call SCOTCH_dgraphBuild | 1 0.000457 0.000457 -SCOTCH: call SCOTCH_dgraphHalo | 1 0.038896 0.038896 -SCOTCH: call SCOTCH_dgraphPart | 1 15.200208 15.200208 -SparsityPattern::assemble | 1 0.038730 0.038730 -Topology: create | 1 0.068399 0.068399 -Topology: determine shared index ownership | 1 0.011511 0.011511 -Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.008109 0.008109 -ZZZ Assemble | 1 0.255568 0.255568 -ZZZ Assemble matrix | 1 0.108920 0.108920 -ZZZ Assemble vector | 1 0.022428 0.022428 -ZZZ Create Mesh | 1 15.597586 15.597586 -ZZZ Create RHS function | 1 0.058678 0.058678 -ZZZ Create boundary conditions | 1 0.005106 0.005106 -ZZZ Create facets and facet->cell connectivity | 1 0.150476 0.150476 -ZZZ FunctionSpace | 1 0.012263 0.012263 -ZZZ Solve | 1 0.668528 0.668528 +Build BoxMesh | 1 17.248828 17.248828 +Build dofmap data | 2 0.011195 0.022391 +Build sparsity | 1 0.017948 0.017948 +Compute connectivity 2-3 | 1 0.004719 0.004719 +Compute dof reordering map | 2 0.001287 0.002574 +Compute entities of dim = 2 | 1 0.150210 0.150210 +Compute graph partition (SCOTCH) | 1 16.862439 16.862439 +Compute local part of mesh dual graph | 2 0.118137 0.236274 +Compute local-to-local map | 1 0.002066 0.002066 +Compute non-local part of mesh dual graph | 1 0.014692 0.014692 +Compute-local-to-global links for global/local adjacency list | 1 0.002185 0.002185 +Distribute AdjacencyList nodes to destination ranks | 1 0.026645 0.026645 +Distribute row-wise data (scalable) | 1 0.002118 0.002118 +Extract partition boundaries from SCOTCH graph | 1 0.003067 0.003067 +GPS: create_level_structure | 2 0.003973 0.015894 +Get SCOTCH graph data | 1 0.000016 0.000016 +Gibbs-Poole-Stockmeyer ordering | 1 0.029584 0.029584 +Init MPI | 1 0.062292 0.062292 +Init PETSc | 1 0.001785 0.001785 +Init dofmap from element dofmap | 2 0.007301 0.014602 +Init logging | 1 0.000058 0.000058 +PETSc Krylov solver | 1 0.708954 0.708954 +SCOTCH: call SCOTCH_dgraphBuild | 1 0.000432 0.000432 +SCOTCH: call SCOTCH_dgraphHalo | 1 0.047604 0.047604 +SCOTCH: call SCOTCH_dgraphPart | 1 16.817549 16.817549 +SparsityPattern::assemble | 1 0.038732 0.038732 +Topology: create | 1 0.079550 0.079550 +Topology: determine shared index ownership | 1 0.023052 0.023052 +Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.008280 0.008280 +ZZZ Assemble | 1 0.255789 0.255789 +ZZZ Assemble matrix | 1 0.109533 0.109533 +ZZZ Assemble vector | 1 0.022642 0.022642 +ZZZ Create Mesh | 1 17.249123 17.249123 +ZZZ Create RHS function | 1 0.058645 0.058645 +ZZZ Create boundary conditions | 1 0.005175 0.005175 +ZZZ Create facets and facet->cell connectivity | 1 0.154974 0.154974 +ZZZ FunctionSpace | 1 0.012218 0.012218 +ZZZ Solve | 1 0.712766 0.712766 *** Number of Krylov iterations: 7 *** Solution norm: 36.6791 **************************************************************************************************************************************************************** @@ -2804,16 +2822,16 @@ ------------------------------------------------------------------ PETSc Performance Summary: ------------------------------------------------------------------ -/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named codethink11-arm64 with 3 processors, by pbuilder1 Mon May 13 18:10:34 2024 +/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named i-capture-the-hostname with 3 processors, by pbuilder2 Wed Apr 12 13:52:23 2023 Using Petsc Release Version 3.18.5, Feb 27, 2023 Max Max/Min Avg Total -Time (sec): 1.669e+01 1.000 1.669e+01 +Time (sec): 1.839e+01 1.000 1.839e+01 Objects: 3.200e+01 1.000 3.200e+01 -Flops: 3.508e+06 1.030 3.449e+06 1.035e+07 -Flops/sec: 2.102e+05 1.030 2.067e+05 6.201e+05 +Flops: 3.469e+06 1.016 3.449e+06 1.035e+07 +Flops/sec: 1.887e+05 1.016 1.876e+05 5.627e+05 MPI Msg Count: 2.800e+01 1.000 2.800e+01 8.400e+01 -MPI Msg Len (bytes): 1.851e+05 1.207 6.207e+03 5.214e+05 +MPI Msg Len (bytes): 1.923e+05 1.245 6.398e+03 5.374e+05 MPI Reductions: 6.700e+01 1.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) @@ -2822,7 +2840,7 @@ Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total - 0: Main Stage: 1.6687e+01 100.0% 1.0348e+07 100.0% 8.400e+01 100.0% 6.207e+03 100.0% 4.900e+01 73.1% + 0: Main Stage: 1.8388e+01 100.0% 1.0347e+07 100.0% 8.400e+01 100.0% 6.398e+03 100.0% 4.900e+01 73.1% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. @@ -2846,30 +2864,30 @@ --- Event Stage 0: Main Stage -BuildTwoSided 4 1.0 4.4408e-03 22.1 0.00e+00 0.0 1.8e+01 4.0e+00 4.0e+00 0 0 21 0 6 0 0 21 0 8 0 -BuildTwoSidedF 2 1.0 4.3657e-03 37.3 0.00e+00 0.0 1.2e+01 2.1e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 -MatMult 7 1.0 7.2865e-03 1.0 1.94e+06 1.0 5.4e+01 4.6e+03 1.0e+00 0 55 64 47 1 0 55 64 47 2 786 -MatConvert 1 1.0 4.0099e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 3 0 0 0 0 4 0 -MatAssemblyBegin 3 1.0 7.9695e-03 2.2 0.00e+00 0.0 1.2e+01 2.1e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 -MatAssemblyEnd 3 1.0 6.2367e-03 1.1 3.68e+03 1.3 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 9 0 0 0 0 12 2 -MatGetRowIJ 2 1.0 2.8000e-06 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatView 1 1.0 5.5636e-04 2.9 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 2 0 -VecTDot 14 1.0 6.4714e-04 1.1 2.82e+05 1.0 0.0e+00 0.0e+00 1.4e+01 0 8 0 0 21 0 8 0 0 29 1289 -VecNorm 8 1.0 9.2178e-04 1.9 1.61e+05 1.0 0.0e+00 0.0e+00 8.0e+00 0 5 0 0 12 0 5 0 0 16 517 -VecCopy 2 1.0 4.6700e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecSet 9 1.0 6.9240e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecAXPY 14 1.0 4.4390e-04 1.1 2.82e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 8 0 0 0 0 8 0 0 0 1879 -VecAYPX 6 1.0 1.9746e-04 1.3 1.21e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 3 0 0 0 0 3 0 0 0 1810 -VecScatterBegin 8 1.0 6.6320e-04 1.2 0.00e+00 0.0 7.2e+01 3.7e+03 2.0e+00 0 0 86 51 3 0 0 86 51 4 0 -VecScatterEnd 8 1.0 6.8744e-04 2.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFSetGraph 3 1.0 7.5880e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFSetUp 2 1.0 3.4532e-04 1.4 0.00e+00 0.0 2.4e+01 9.4e+02 2.0e+00 0 0 29 4 3 0 0 29 4 4 0 -SFPack 8 1.0 9.7980e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFUnpack 8 1.0 3.4560e-05 2.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -KSPSetUp 1 1.0 6.4080e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -KSPSolve 1 1.0 3.3915e-01 1.0 2.79e+06 1.0 5.4e+01 4.6e+03 2.3e+01 2 80 64 47 34 2 80 64 47 47 24 -PCSetUp 1 1.0 3.2467e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 2 0 0 0 3 2 0 0 0 4 0 -PCApply 8 1.0 3.2986e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 +BuildTwoSided 4 1.0 2.0275e-03 15.0 0.00e+00 0.0 1.8e+01 4.0e+00 4.0e+00 0 0 21 0 6 0 0 21 0 8 0 +BuildTwoSidedF 2 1.0 2.0298e-03 21.2 0.00e+00 0.0 1.2e+01 2.2e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 +MatMult 7 1.0 7.8201e-03 1.0 1.92e+06 1.0 5.4e+01 4.7e+03 1.0e+00 0 55 64 47 1 0 55 64 47 2 733 +MatConvert 1 1.0 4.1493e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 3 0 0 0 0 4 0 +MatAssemblyBegin 3 1.0 5.0156e-03 1.3 0.00e+00 0.0 1.2e+01 2.2e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 +MatAssemblyEnd 3 1.0 6.5050e-03 1.0 3.74e+03 1.2 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 9 0 0 0 0 12 2 +MatGetRowIJ 2 1.0 3.1400e-06 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatView 1 1.0 4.6253e-04 4.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 2 0 +VecTDot 14 1.0 7.3285e-04 1.3 2.79e+05 1.0 0.0e+00 0.0e+00 1.4e+01 0 8 0 0 21 0 8 0 0 29 1138 +VecNorm 8 1.0 2.8243e-03 6.6 1.59e+05 1.0 0.0e+00 0.0e+00 8.0e+00 0 5 0 0 12 0 5 0 0 16 169 +VecCopy 2 1.0 5.5240e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecSet 9 1.0 7.4821e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecAXPY 14 1.0 5.3769e-04 1.2 2.79e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 8 0 0 0 0 8 0 0 0 1551 +VecAYPX 6 1.0 2.0406e-04 1.2 1.20e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 3 0 0 0 0 3 0 0 0 1752 +VecScatterBegin 8 1.0 6.5467e-04 1.1 0.00e+00 0.0 7.2e+01 3.8e+03 2.0e+00 0 0 86 51 3 0 0 86 51 4 0 +VecScatterEnd 8 1.0 8.5447e-04 3.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFSetGraph 3 1.0 7.9020e-05 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFSetUp 2 1.0 3.0390e-04 1.2 0.00e+00 0.0 2.4e+01 9.7e+02 2.0e+00 0 0 29 4 3 0 0 29 4 4 0 +SFPack 8 1.0 1.1982e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFUnpack 8 1.0 2.7441e-05 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +KSPSetUp 1 1.0 7.1660e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +KSPSolve 1 1.0 3.7182e-01 1.0 2.76e+06 1.0 5.4e+01 4.7e+03 2.3e+01 2 80 64 47 34 2 80 64 47 47 22 +PCSetUp 1 1.0 3.3629e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 2 0 0 0 3 2 0 0 0 4 0 +PCApply 8 1.0 3.6194e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 --- Event Stage 1: Unknown @@ -2891,9 +2909,9 @@ --- Event Stage 1: Unknown ======================================================================================================================== -Average time to get PetscTime(): 7.8e-08 -Average time for MPI_Barrier(): 2.88e-06 -Average time for zero size MPI_Send(): 3.72667e-06 +Average time to get PetscTime(): 8e-08 +Average time for MPI_Barrier(): 2.536e-06 +Average time for zero size MPI_Send(): 4.32e-06 #PETSc Option Table entries: --ndofs 10000 --problem_type poisson @@ -2949,67 +2967,67 @@ Option left: name:--ndofs value: 10000 Option left: name:--problem_type value: poisson Option left: name:--scaling_type value: weak -2024-05-13 18:10:34.885 ( 16.813s) [RANK: 0 ] loguru.cpp:526 INFO| atexit +2023-04-12 13:52:23.434 ( 18.513s) [RANK: 0 ] loguru.cpp:526 INFO| atexit ==== poisson strong scaling (3 processes with ndofs=30000) ==== -2024-05-13 18:10:35.175 ( 0.064s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks -2024-05-13 18:10:35.175 ( 0.064s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph -2024-05-13 18:10:35.175 ( 0.064s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:35.281 ( 0.170s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph -2024-05-13 18:10:35.292 ( 0.181s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:35.292 ( 0.181s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:35.292 ( 0.181s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 -2024-05-13 18:10:35.298 ( 0.187s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 210000, non-local: 1800) -2024-05-13 18:10:35.298 ( 0.187s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH -2024-05-13 18:10:49.799 ( 14.688s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:49.800 ( 14.689s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:49.810 ( 14.699s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph -2024-05-13 18:10:49.926 ( 14.815s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(192) 2694-52270 - -2024-05-13 18:10:49.942 ( 14.830s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology -2024-05-13 18:10:49.975 ( 14.864s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 -2024-05-13 18:10:49.975 ( 14.864s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 -2024-05-13 18:10:50.022 ( 14.911s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.026 ( 14.915s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.026 ( 14.915s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.026 ( 14.915s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.035 ( 14.924s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.036 ( 14.924s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.036 ( 14.925s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:50.036 ( 14.925s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:50.037 ( 14.925s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.037 ( 14.925s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.047 ( 14.935s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:50.047 ( 14.936s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:50.047 ( 14.936s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 -2024-05-13 18:10:50.047 ( 14.936s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.048 ( 14.936s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.048 ( 14.937s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 +2023-04-12 13:52:23.733 ( 0.073s) [RANK: 0 ] utils.cpp:612 INFO| Compute partition of cells across ranks +2023-04-12 13:52:23.734 ( 0.073s) [RANK: 0 ] graphbuild.cpp:533 INFO| Building mesh dual graph +2023-04-12 13:52:23.734 ( 0.073s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:52:23.851 ( 0.191s) [RANK: 0 ] graphbuild.cpp:89 INFO| Build nonlocal part of mesh dual graph +2023-04-12 13:52:23.852 ( 0.192s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:23.852 ( 0.192s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:23.852 ( 0.192s) [RANK: 0 ] graphbuild.cpp:203 INFO| Number of destination and source ranks in non-local dual graph construction, and ratio to total number of ranks: 2, 2, 0.666667, 0.666667 +2023-04-12 13:52:23.859 ( 0.198s) [RANK: 0 ] graphbuild.cpp:545 INFO| Graph edges (local: 210000, non-local: 1800) +2023-04-12 13:52:23.859 ( 0.198s) [RANK: 0 ] partitioners.cpp:316 INFO| Compute graph partition using PT-SCOTCH +2023-04-12 13:52:38.422 ( 14.761s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:52:38.422 ( 14.761s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:52:38.433 ( 14.772s) [RANK: 0 ] graphbuild.cpp:396 INFO| Build local part of mesh dual graph +2023-04-12 13:52:38.547 ( 14.887s) [RANK: 0 ] ordering.cpp:202 INFO| GPS pseudo-diameter:(170) 1839-51611 + +2023-04-12 13:52:38.563 ( 14.902s) [RANK: 0 ] Topology.cpp:923 INFO| Create topology +2023-04-12 13:52:38.587 ( 14.926s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 3 +2023-04-12 13:52:38.587 ( 14.926s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 3 +2023-04-12 13:52:38.631 ( 14.970s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.634 ( 14.974s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.634 ( 14.974s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.634 ( 14.974s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.643 ( 14.983s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.643 ( 14.983s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.644 ( 14.983s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:38.644 ( 14.983s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:38.644 ( 14.984s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.644 ( 14.984s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.654 ( 14.993s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:38.654 ( 14.993s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:38.654 ( 14.993s) [RANK: 0 ] MPI.h:368 INFO| Number of neighbourhood source ranks in distribute_to_postoffice: 0 +2023-04-12 13:52:38.655 ( 14.994s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.655 ( 14.994s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.655 ( 14.994s) [RANK: 0 ] MPI.h:500 INFO| Neighbourhood destination ranks from post office in distribute_data (rank, num dests, num dests/mpi_size): 0, 2, 0.666667 UnitCube (30x30x30) to be refined 0 times -2024-05-13 18:10:50.054 ( 14.943s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 -2024-05-13 18:10:50.100 ( 14.989s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.104 ( 14.992s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.215 ( 15.104s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.215 ( 15.104s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.222 ( 15.111s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:50.222 ( 15.111s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. -2024-05-13 18:10:50.227 ( 15.116s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 -2024-05-13 18:10:50.236 ( 15.125s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.237 ( 15.126s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.237 ( 15.126s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.237 ( 15.126s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.238 ( 15.126s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 -2024-05-13 18:10:50.238 ( 15.127s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 -2024-05-13 18:10:50.238 ( 15.127s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.238 ( 15.127s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.240 ( 15.129s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:50.242 ( 15.131s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 -2024-05-13 18:10:50.242 ( 15.131s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 -2024-05-13 18:10:50.244 ( 15.133s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:50.305 ( 15.194s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 -2024-05-13 18:10:50.305 ( 15.194s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 2 -2024-05-13 18:10:50.305 ( 15.194s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 -2024-05-13 18:10:50.306 ( 15.194s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 -2024-05-13 18:10:50.360 ( 15.249s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 492 to 1316 +2023-04-12 13:52:38.661 ( 15.000s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 2 +2023-04-12 13:52:38.704 ( 15.043s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.705 ( 15.045s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.803 ( 15.142s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.803 ( 15.143s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.810 ( 15.149s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:38.810 ( 15.149s) [RANK: 0 ]topologycomputation.cpp:650 INFO| Computing mesh connectivity 2 - 3 from transpose. +2023-04-12 13:52:38.815 ( 15.154s) [RANK: 0 ]topologycomputation.cpp:746 INFO| Computing mesh entities of dimension 0 +2023-04-12 13:52:38.823 ( 15.163s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.824 ( 15.163s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.824 ( 15.163s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.824 ( 15.163s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.824 ( 15.164s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 0 +2023-04-12 13:52:38.824 ( 15.164s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 0 +2023-04-12 13:52:38.825 ( 15.164s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.825 ( 15.164s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.827 ( 15.166s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:52:38.829 ( 15.168s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 0 +2023-04-12 13:52:38.829 ( 15.168s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 0 +2023-04-12 13:52:38.830 ( 15.169s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:38.889 ( 15.229s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 2 - 3 +2023-04-12 13:52:38.889 ( 15.229s) [RANK: 0 ]topologycomputation.cpp:786 INFO| Requesting connectivity 3 - 2 +2023-04-12 13:52:38.890 ( 15.229s) [RANK: 0 ] MPI.cpp:154 INFO| Computing communicaton graph edges (using NBX algorithm). Number of input edges: 2 +2023-04-12 13:52:38.890 ( 15.229s) [RANK: 0 ] MPI.cpp:217 INFO| Finished graph edge discovery using NBX algorithm. Number of discovered edges 2 +2023-04-12 13:52:38.942 ( 15.281s) [RANK: 0 ] SparsityPattern.cpp:389 INFO| Column ghost size increased from 472 to 1300 ---------------------------------------------------------------- Test problem summary @@ -3024,7 +3042,7 @@ Total degrees of freedom: 29791 (29.8 thousand) Average degrees of freedom per process: 9930 ---------------------------------------------------------------- -2024-05-13 18:10:50.515 ( 15.403s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. +2023-04-12 13:52:39.087 ( 15.426s) [RANK: 0 ] petsc.cpp:677 INFO| PETSc Krylov solver starting to solve system. KSP Object: 3 MPI processes type: cg maximum iterations=10000, initial guess is zero @@ -3068,44 +3086,44 @@ [MPI_MAX] Summary of timings | reps wall avg wall tot --------------------------------------------------------------------------------------------------------- -Build BoxMesh | 1 14.882973 14.882973 -Build dofmap data | 2 0.011840 0.023681 -Build sparsity | 1 0.018932 0.018932 -Compute connectivity 2-3 | 1 0.004806 0.004806 -Compute dof reordering map | 2 0.001316 0.002632 -Compute entities of dim = 2 | 1 0.168124 0.168124 -Compute graph partition (SCOTCH) | 1 14.495072 14.495072 -Compute local part of mesh dual graph | 2 0.115831 0.231662 -Compute local-to-local map | 1 0.002189 0.002189 -Compute non-local part of mesh dual graph | 1 0.016460 0.016460 -Compute-local-to-global links for global/local adjacency list | 1 0.002287 0.002287 -Distribute AdjacencyList nodes to destination ranks | 1 0.018181 0.018181 -Distribute row-wise data (scalable) | 1 0.002546 0.002546 -Extract partition boundaries from SCOTCH graph | 1 0.003392 0.003392 -GPS: create_level_structure | 2 0.003632 0.018127 -Get SCOTCH graph data | 1 0.000017 0.000017 -Gibbs-Poole-Stockmeyer ordering | 1 0.033936 0.033936 -Init MPI | 1 0.058303 0.058303 -Init PETSc | 1 0.001518 0.001518 -Init dofmap from element dofmap | 2 0.007556 0.015111 +Build BoxMesh | 1 14.931521 14.931521 +Build dofmap data | 2 0.010982 0.021964 +Build sparsity | 1 0.017757 0.017757 +Compute connectivity 2-3 | 1 0.004427 0.004427 +Compute dof reordering map | 2 0.001235 0.002469 +Compute entities of dim = 2 | 1 0.149126 0.149126 +Compute graph partition (SCOTCH) | 1 14.555608 14.555608 +Compute local part of mesh dual graph | 2 0.112777 0.225554 +Compute local-to-local map | 1 0.001879 0.001879 +Compute non-local part of mesh dual graph | 1 0.019990 0.019990 +Compute-local-to-global links for global/local adjacency list | 1 0.002086 0.002086 +Distribute AdjacencyList nodes to destination ranks | 1 0.021412 0.021412 +Distribute row-wise data (scalable) | 1 0.002430 0.002430 +Extract partition boundaries from SCOTCH graph | 1 0.003130 0.003130 +GPS: create_level_structure | 2 0.002947 0.011788 +Get SCOTCH graph data | 1 0.000015 0.000015 +Gibbs-Poole-Stockmeyer ordering | 1 0.028818 0.028818 +Init MPI | 1 0.066945 0.066945 +Init PETSc | 1 0.001610 0.001610 +Init dofmap from element dofmap | 2 0.007240 0.014480 Init logging | 1 0.000053 0.000053 -PETSc Krylov solver | 1 0.738618 0.738618 -SCOTCH: call SCOTCH_dgraphBuild | 1 0.000498 0.000498 -SCOTCH: call SCOTCH_dgraphHalo | 1 0.033859 0.033859 -SCOTCH: call SCOTCH_dgraphPart | 1 14.470932 14.470932 -SparsityPattern::assemble | 1 0.039974 0.039974 -Topology: create | 1 0.084895 0.084895 -Topology: determine shared index ownership | 1 0.025590 0.025590 -Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.008552 0.008552 -ZZZ Assemble | 1 0.270851 0.270851 -ZZZ Assemble matrix | 1 0.118167 0.118167 -ZZZ Assemble vector | 1 0.023387 0.023387 -ZZZ Create Mesh | 1 14.883223 14.883223 -ZZZ Create RHS function | 1 0.059775 0.059775 -ZZZ Create boundary conditions | 1 0.005579 0.005579 -ZZZ Create facets and facet->cell connectivity | 1 0.173027 0.173027 -ZZZ FunctionSpace | 1 0.013257 0.013257 -ZZZ Solve | 1 0.742462 0.742462 +PETSc Krylov solver | 1 0.701243 0.701243 +SCOTCH: call SCOTCH_dgraphBuild | 1 0.000479 0.000479 +SCOTCH: call SCOTCH_dgraphHalo | 1 0.051595 0.051595 +SCOTCH: call SCOTCH_dgraphPart | 1 14.531644 14.531644 +SparsityPattern::assemble | 1 0.041170 0.041170 +Topology: create | 1 0.071883 0.071883 +Topology: determine shared index ownership | 1 0.016450 0.016450 +Topology: determine vertex ownership groups (owned, undetermined, unowned) | 1 0.008076 0.008076 +ZZZ Assemble | 1 0.256685 0.256685 +ZZZ Assemble matrix | 1 0.109380 0.109380 +ZZZ Assemble vector | 1 0.021687 0.021687 +ZZZ Create Mesh | 1 14.931795 14.931795 +ZZZ Create RHS function | 1 0.057765 0.057765 +ZZZ Create boundary conditions | 1 0.005327 0.005327 +ZZZ Create facets and facet->cell connectivity | 1 0.153631 0.153631 +ZZZ FunctionSpace | 1 0.012130 0.012130 +ZZZ Solve | 1 0.704855 0.704855 *** Number of Krylov iterations: 7 *** Solution norm: 36.6791 **************************************************************************************************************************************************************** @@ -3114,16 +3132,16 @@ ------------------------------------------------------------------ PETSc Performance Summary: ------------------------------------------------------------------ -/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named codethink11-arm64 with 3 processors, by pbuilder1 Mon May 13 18:10:51 2024 +/build/fenicsx-performance-tests-0.5.0~git20220731.821823b/obj-aarch64-linux-gnu/dolfinx-scaling-test on a named i-capture-the-hostname with 3 processors, by pbuilder2 Wed Apr 12 13:52:39 2023 Using Petsc Release Version 3.18.5, Feb 27, 2023 Max Max/Min Avg Total -Time (sec): 1.609e+01 1.000 1.609e+01 +Time (sec): 1.606e+01 1.000 1.606e+01 Objects: 3.200e+01 1.000 3.200e+01 -Flops: 3.482e+06 1.015 3.448e+06 1.034e+07 -Flops/sec: 2.164e+05 1.015 2.144e+05 6.431e+05 +Flops: 3.474e+06 1.018 3.448e+06 1.034e+07 +Flops/sec: 2.163e+05 1.018 2.147e+05 6.441e+05 MPI Msg Count: 2.800e+01 1.000 2.800e+01 8.400e+01 -MPI Msg Len (bytes): 2.200e+05 1.329 6.720e+03 5.645e+05 +MPI Msg Len (bytes): 1.983e+05 1.202 6.582e+03 5.529e+05 MPI Reductions: 6.700e+01 1.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) @@ -3132,7 +3150,7 @@ Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total - 0: Main Stage: 1.6085e+01 100.0% 1.0344e+07 100.0% 8.400e+01 100.0% 6.720e+03 100.0% 4.900e+01 73.1% + 0: Main Stage: 1.6062e+01 100.0% 1.0345e+07 100.0% 8.400e+01 100.0% 6.582e+03 100.0% 4.900e+01 73.1% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. @@ -3156,30 +3174,30 @@ --- Event Stage 0: Main Stage -BuildTwoSided 4 1.0 8.5823e-03 63.1 0.00e+00 0.0 1.8e+01 4.0e+00 4.0e+00 0 0 21 0 6 0 0 21 0 8 0 -BuildTwoSidedF 2 1.0 8.5708e-03 71.7 0.00e+00 0.0 1.2e+01 2.3e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 -MatMult 7 1.0 8.1164e-03 1.0 1.93e+06 1.0 5.4e+01 4.9e+03 1.0e+00 0 55 64 47 1 0 55 64 47 2 706 -MatConvert 1 1.0 4.5304e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 3 0 0 0 0 4 0 -MatAssemblyBegin 3 1.0 1.1944e-02 2.6 0.00e+00 0.0 1.2e+01 2.3e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 -MatAssemblyEnd 3 1.0 7.3015e-03 1.1 4.36e+03 1.4 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 9 0 0 0 0 12 2 -MatGetRowIJ 2 1.0 2.9200e-06 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -MatView 1 1.0 5.7784e-04 4.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 2 0 -VecTDot 14 1.0 9.6158e-04 1.6 2.79e+05 1.0 0.0e+00 0.0e+00 1.4e+01 0 8 0 0 21 0 8 0 0 29 867 -VecNorm 8 1.0 1.1611e-03 3.1 1.59e+05 1.0 0.0e+00 0.0e+00 8.0e+00 0 5 0 0 12 0 5 0 0 16 411 -VecCopy 2 1.0 5.0060e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecSet 9 1.0 7.3220e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -VecAXPY 14 1.0 4.9428e-04 1.1 2.79e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 8 0 0 0 0 8 0 0 0 1688 -VecAYPX 6 1.0 2.7304e-04 1.2 1.20e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 3 0 0 0 0 3 0 0 0 1309 -VecScatterBegin 8 1.0 6.6070e-04 1.1 0.00e+00 0.0 7.2e+01 4.0e+03 2.0e+00 0 0 86 51 3 0 0 86 51 4 0 -VecScatterEnd 8 1.0 8.5056e-04 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFSetGraph 3 1.0 9.0800e-05 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFSetUp 2 1.0 2.9532e-04 1.2 0.00e+00 0.0 2.4e+01 1.0e+03 2.0e+00 0 0 29 4 3 0 0 29 4 4 0 -SFPack 8 1.0 1.4000e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -SFUnpack 8 1.0 2.2560e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -KSPSetUp 1 1.0 7.7140e-05 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 -KSPSolve 1 1.0 3.7419e-01 1.0 2.77e+06 1.0 5.4e+01 4.9e+03 2.3e+01 2 80 64 47 34 2 80 64 47 47 22 -PCSetUp 1 1.0 3.6346e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 2 0 0 0 3 2 0 0 0 4 0 -PCApply 8 1.0 3.6399e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 +BuildTwoSided 4 1.0 3.9698e-03 11.6 0.00e+00 0.0 1.8e+01 4.0e+00 4.0e+00 0 0 21 0 6 0 0 21 0 8 0 +BuildTwoSidedF 2 1.0 3.9605e-03 13.6 0.00e+00 0.0 1.2e+01 2.3e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 +MatMult 7 1.0 7.3467e-03 1.0 1.92e+06 1.0 5.4e+01 4.9e+03 1.0e+00 0 55 64 47 1 0 55 64 47 2 780 +MatConvert 1 1.0 4.0453e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 3 0 0 0 0 4 0 +MatAssemblyBegin 3 1.0 7.2165e-03 1.9 0.00e+00 0.0 1.2e+01 2.3e+04 2.0e+00 0 0 14 49 3 0 0 14 49 4 0 +MatAssemblyEnd 3 1.0 6.2907e-03 1.0 3.76e+03 1.2 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 9 0 0 0 0 12 2 +MatGetRowIJ 2 1.0 2.8200e-06 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +MatView 1 1.0 5.6317e-04 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 2 0 +VecTDot 14 1.0 7.2855e-04 1.2 2.80e+05 1.0 0.0e+00 0.0e+00 1.4e+01 0 8 0 0 21 0 8 0 0 29 1145 +VecNorm 8 1.0 1.4742e-03 3.8 1.60e+05 1.0 0.0e+00 0.0e+00 8.0e+00 0 5 0 0 12 0 5 0 0 16 323 +VecCopy 2 1.0 5.0741e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecSet 9 1.0 7.5982e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +VecAXPY 14 1.0 5.0581e-04 1.1 2.80e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 8 0 0 0 0 8 0 0 0 1649 +VecAYPX 6 1.0 2.6706e-04 1.3 1.20e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 3 0 0 0 0 3 0 0 0 1339 +VecScatterBegin 8 1.0 6.6261e-04 1.1 0.00e+00 0.0 7.2e+01 3.9e+03 2.0e+00 0 0 86 51 3 0 0 86 51 4 0 +VecScatterEnd 8 1.0 7.5911e-04 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFSetGraph 3 1.0 1.0542e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFSetUp 2 1.0 3.3653e-04 1.2 0.00e+00 0.0 2.4e+01 1.0e+03 2.0e+00 0 0 29 4 3 0 0 29 4 4 0 +SFPack 8 1.0 1.0478e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +SFUnpack 8 1.0 2.0120e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +KSPSetUp 1 1.0 7.2721e-05 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 +KSPSolve 1 1.0 3.4998e-01 1.0 2.76e+06 1.0 5.4e+01 4.9e+03 2.3e+01 2 80 64 47 34 2 80 64 47 47 24 +PCSetUp 1 1.0 3.5032e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 2 0 0 0 3 2 0 0 0 4 0 +PCApply 8 1.0 3.4050e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 --- Event Stage 1: Unknown @@ -3201,9 +3219,9 @@ --- Event Stage 1: Unknown ======================================================================================================================== -Average time to get PetscTime(): 7.4e-08 -Average time for MPI_Barrier(): 3.26e-06 -Average time for zero size MPI_Send(): 5.16667e-06 +Average time to get PetscTime(): 8e-08 +Average time for MPI_Barrier(): 2.78e-06 +Average time for zero size MPI_Send(): 3.6e-06 #PETSc Option Table entries: --ndofs 30000 --problem_type poisson @@ -3259,7 +3277,7 @@ Option left: name:--ndofs value: 30000 Option left: name:--problem_type value: poisson Option left: name:--scaling_type value: strong -2024-05-13 18:10:51.318 ( 16.207s) [RANK: 0 ] loguru.cpp:526 INFO| atexit +2023-04-12 13:52:39.850 ( 16.189s) [RANK: 0 ] loguru.cpp:526 INFO| atexit make[1]: Leaving directory '/build/fenicsx-performance-tests-0.5.0~git20220731.821823b' create-stamp debian/debhelper-build-stamp dh_testroot -O--buildsystem=cmake -O--sourcedirectory=src @@ -3306,12 +3324,14 @@ dpkg-buildpackage: info: binary-only upload (no source included) dpkg-genchanges: info: including full source code in upload I: copying local configuration +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/B01_cleanup starting +I: user script /srv/workspace/pbuilder/25777/tmp/hooks/B01_cleanup finished I: unmounting dev/ptmx filesystem I: unmounting dev/pts filesystem I: unmounting dev/shm filesystem I: unmounting proc filesystem I: unmounting sys filesystem I: cleaning the build env -I: removing directory /srv/workspace/pbuilder/29246 and its subdirectories -I: Current time: Mon May 13 18:11:10 -12 2024 -I: pbuilder-time-stamp: 1715667070 +I: removing directory /srv/workspace/pbuilder/25777 and its subdirectories +I: Current time: Wed Apr 12 13:52:58 +14 2023 +I: pbuilder-time-stamp: 1681257178