| #
bfc9018b
|
| 02-Jul-2024 |
Junchao Zhang <jczhang@anl.gov> |
Vec: simplify VecDuplicate_MPI_DA
|
| #
3fbf2391
|
| 30-Jul-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'origin/release' with a revert of !7720
|
| #
a847dfd4
|
| 29-Jul-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2024-07-29/minor-error-msg-fix' into 'release'
Minor PETSC_ERR_ARG_OUTOFRANGE error message fix
See merge request petsc/petsc!7725
|
| #
86f116e9
|
| 29-Jul-2024 |
Junchao Zhang <jczhang@anl.gov> |
Vec: clarify the error message in PETSC_ERR_ARG_OUTOFRANGE
|
| #
184b5a2e
|
| 26-Jul-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2023-09-15/fix-log-pcmpi' into 'main'
add optimized shared memory PCMPI and add users manual section on comparing STREAMS performance to PETSc application performance
See merge
Merge branch 'barry/2023-09-15/fix-log-pcmpi' into 'main'
add optimized shared memory PCMPI and add users manual section on comparing STREAMS performance to PETSc application performance
See merge request petsc/petsc!6903
show more ...
|
| #
9f0612e4
|
| 11-Oct-2023 |
Barry Smith <bsmith@mcs.anl.gov> |
Add shared memory support for PCMPI using both shmget() and MPI_Win_allocate_shared()
|
| #
14e7b07e
|
| 21-Jun-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
8d41f8fb
|
| 19-Jun-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2024-06-18/fix-veccupm-resetarray-bug' into 'release'
Fix a VECCUPM VecResetArray bug
See merge request petsc/petsc!7635
|
| #
b7f76605
|
| 18-Jun-2024 |
Junchao Zhang <jczhang@anl.gov> |
Vec: use VecGetArrayWrite for dest. vector in VecCopy
|
| #
a623e290
|
| 12-Jun-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2024-05-29/minor-fortran-stub-cleanup' into 'main'
More improvements to Fortran stubs and interfaces
See merge request petsc/petsc!7598
|
| #
5d83a8b1
|
| 30-May-2024 |
Barry Smith <bsmith@mcs.anl.gov> |
This requires some changes to user FORTRAN code
Pass PETSC_NULL_XXX_ARRAY when passing a NULL as an argument that is expecting an array Pass PETSC_NULL_ENUM when argument returns an enum instead of
This requires some changes to user FORTRAN code
Pass PETSC_NULL_XXX_ARRAY when passing a NULL as an argument that is expecting an array Pass PETSC_NULL_ENUM when argument returns an enum instead of PETSC_NULL_INTEGER Pass arrays (and not scalar values) when the argument is expecting an array; this means replace, for example, the argument v with [v] Use PetscObjectIsNull(obj) to check if the object is NULL, instead of obj == PETSC_NULL_XXX
The compiler will now automatically prevent you from using the wrong argument type for the first three bullets above This will also require an update sowing with the new sowing branch this MR is using.
show more ...
|
| #
3d77ad52
|
| 14-May-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
b3ede365
|
| 10-May-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2024-05-07/fix-vecduplicatevecs-gemv-setting' into 'release'
Fix GEMV setting in VecDuplicateVecs for VecKokkos
See merge request petsc/petsc!7531
|
| #
1ba3db34
|
| 10-May-2024 |
Junchao Zhang <jczhang@anl.gov> |
Vec: use smaller alignment in VecDuplicateVecs GEMV
|
| #
970231d2
|
| 07-Mar-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/clang-format-18' into 'main'
clang-format version 18
See merge request petsc/petsc!6902
|
| #
f4f49eea
|
| 11-Feb-2024 |
Pierre Jolivet <pierre@joliv.et> |
Remove multiple parentheses and extra semicolon
git grep -l -E "\(\([a-zA-Z0-9_]+\)\(\*[a-zA-Z0-9_]+\)[, )]" | xargs sed -r -i'' 's#\(\(([a-zA-Z0-9_]+)\)\(\*([a-zA-Z0-9_]+)\)([, )])#\(\(\1\)\*\2\3#g
Remove multiple parentheses and extra semicolon
git grep -l -E "\(\([a-zA-Z0-9_]+\)\(\*[a-zA-Z0-9_]+\)[, )]" | xargs sed -r -i'' 's#\(\(([a-zA-Z0-9_]+)\)\(\*([a-zA-Z0-9_]+)\)([, )])#\(\(\1\)\*\2\3#g' git grep -l -E "Petsc(.)*\(\(\*[a-zA-Z0-9_]*\)," | xargs sed -r -i'' 's#Petsc([a-zA-Z0-9_]*)\(\(\*([a-zA-Z0-9_]*)\), #Petsc\1(*\2, #g' git grep -l -E "([\!\&\~\*\(]|\)\)|\([^,\*\(]+\**\))\(([a-zA-Z0-9_]+((\.|->)[a-zA-Z0-9_]+|\[[a-zA-Z0-9_ \%\+\*\-]+\])+)\)" | xargs sed -r -i'' 's#([\!\&\~\*\(]|\)\)|\([^,\*\(]+\**\))\(([a-zA-Z0-9_]+((\.|->)[a-zA-Z0-9_]+|\[[a-zA-Z0-9_ \%\+\*\-]+\])+)\)#\1\2#g'
show more ...
|
| #
729ad44d
|
| 29-Jan-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/2024-01-29/fix-err-msg-vecreplacearray' into 'main'
Add better error message in the unsupported VecReplaceArray() after VecDuplicateVecs
Closes #1537
See merge request petsc/
Merge branch 'jczhang/2024-01-29/fix-err-msg-vecreplacearray' into 'main'
Add better error message in the unsupported VecReplaceArray() after VecDuplicateVecs
Closes #1537
See merge request petsc/petsc!7245
show more ...
|
| #
2abc3324
|
| 29-Jan-2024 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Vec: add better error message in the unsupported VecReplaceArray() on the first Vec got from VecDuplicateVecs()
close #1537
|
| #
8f397a5c
|
| 14-Jan-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/remove-deadcode' into 'main'
Remove unused Fortran kernels
See merge request petsc/petsc!7192
|
| #
f6fd6296
|
| 14-Jan-2024 |
Pierre Jolivet <pierre@joliv.et> |
Remove unused Fortran kernels
|
| #
942d34d0
|
| 03-Jan-2024 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/fix-ubsan' into 'main'
Pacify UBSan
See merge request petsc/petsc!7158
|
| #
8e3a54c0
|
| 24-Dec-2023 |
Pierre Jolivet <pierre@joliv.et> |
Sys: add PetscSafePointerPlusOffset() to pacify UBSan
|
| #
314ab5fd
|
| 22-Dec-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2023-06-07/optimize-multivecs-zhang' into 'main'
Optimize VecMDot_Seq as suggested by Junchao Zhang using BLAS 2 gemv
See merge request petsc/petsc!6580
|
| #
b29a8671
|
| 19-Dec-2023 |
Junchao Zhang <jczhang@anl.gov> |
Vec: add GEMV optimizations for VecMDot and friends for VecStandard
Remove KSPPIPEFGMRES from example with skip convergence test since very sensitive to happy ending
Appears to have a sweet spot of
Vec: add GEMV optimizations for VecMDot and friends for VecStandard
Remove KSPPIPEFGMRES from example with skip convergence test since very sensitive to happy ending
Appears to have a sweet spot of much better performance for smallish vectors then matches unrolled code for large vectors
Sample results on Barry's Apple M2 Laptop (using Apple's BLAS)
./ex19 -da_refine 5 -pc_type none -log_view -ksp_gmres_preallocate -ksp_view
Vector length 37,636
VecMDot 1920 1.0 1.9707e-01 1.0 2.23e+09 1.0 0.0e+00 0.0e+00 0.0e+00 25 29 0 0 0 25 29 0 0 0 11291
-vec_mdot_use_gemv
VecMDot 1920 1.0 7.5098e-02 1.0 2.23e+09 1.0 0.0e+00 0.0e+00 0.0e+00 12 29 0 0 0 12 29 0 0 0 29693 VecMDot 1920 1.0 8.1523e-02 1.0 2.23e+09 1.0 0.0e+00 0.0e+00 0.0e+00 12 29 0 0 0 12 29 0 0 0 27353 VecMDot 1920 1.0 7.0889e-02 1.0 2.23e+09 1.0 0.0e+00 0.0e+00 0.0e+00 11 29 0 0 0 11 29 0 0 0 31456
-da_refine 6
Vector length 148,996
VecMDot 4340 1.0 1.7666e+00 1.0 2.00e+10 1.0 0.0e+00 0.0e+00 0.0e+00 20 29 0 0 0 20 29 0 0 0 11319
-vec_mdot_use_gemv
VecMDot 4422 1.0 1.3725e+00 1.0 2.04e+10 1.0 0.0e+00 0.0e+00 0.0e+00 15 29 0 0 0 15 29 0 0 0 14884 VecMDot 4422 1.0 1.4354e+00 1.0 2.04e+10 1.0 0.0e+00 0.0e+00 0.0e+00 16 29 0 0 0 16 29 0 0 0 14231
./ex19 -da_refine 7 -pc_type none -log_view -ksp_gmres_preallocate -ksp_view -vec_mdot_use_gemv -ksp_max_it 100 -snes_max_it 1
Vector length 592,900
VecMDot 100 1.0 1.5915e-01 1.0 1.72e+09 1.0 0.0e+00 0.0e+00 0.0e+00 14 27 0 0 0 14 27 0 0 0 10804
-vec_mdot_use_gemv
VecMDot 100 1.0 1.6854e-01 1.0 1.72e+09 1.0 0.0e+00 0.0e+00 0.0e+00 14 27 0 0 0 14 27 0 0 0 10230 VecMDot 100 1.0 1.5698e-01 1.0 1.72e+09 1.0 0.0e+00 0.0e+00 0.0e+00 14 27 0 0 0 14 27 0 0 0 10983
-da_refine 8
vector length 2,365,444
VecMDot 100 1.0 6.2499e-01 1.0 6.86e+09 1.0 0.0e+00 0.0e+00 0.0e+00 13 27 0 0 0 13 27 0 0 0 10976
-vec_mdot_use_gemv
VecMDot 100 1.0 6.8197e-01 1.0 6.88e+09 1.0 0.0e+00 0.0e+00 0.0e+00 14 27 0 0 0 14 27 0 0 0 10087
show more ...
|
| #
9c14a7f3
|
| 26-Oct-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'Fande-Kong/vecghost_gtl' into 'main'
Add VecGhostGetGhostIS to get the ghost indices of a ghosted vector
See merge request petsc/petsc!5016
|