| #
3d9668e3
|
| 10-May-2023 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
Add MatDenseCUPMSetPreallocation
|
| #
78aafc55
|
| 11-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
e1d39e92
|
| 10-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/fix-spacing' into 'release'
Docs: fix extra spacing
See merge request petsc/petsc!6443
|
| #
9c443813
|
| 10-May-2023 |
Pierre Jolivet <pierre@joliv.et> |
Docs: fix extra spacing
|
| #
3ec5d0d2
|
| 08-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
4d5ef84c
|
| 07-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jose/release/fix-man-pages' into 'release'
Fix several manual pages
See merge request petsc/petsc!6423
|
| #
e1c5b2e2
|
| 06-May-2023 |
Jose E. Roman <jroman@dsic.upv.es> |
Fix manpage of MatSetValue()
|
| #
592ea0f6
|
| 02-May-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/mat-remove-matsetvaluesdevice' into 'main'
remove MatSetValuesDevice
See merge request petsc/petsc!6378
|
| #
4c55d725
|
| 02-May-2023 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
remove MatSetValuesDevice
|
| #
81cd56f8
|
| 25-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
e42225a5
|
| 23-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'prj--release-patch-92401' into 'release'
Update petscmat.h
See merge request petsc/petsc!6362
|
| #
eb495b2a
|
| 23-Apr-2023 |
Pierre Jolivet <pierre@joliv.et> |
Update petscmat.h
|
| #
a663c1df
|
| 13-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
517f05f0
|
| 13-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2023-03-30/fix-include-man-pages/release' into 'release'
Fix manual pages in include files
See merge request petsc/petsc!6272
|
| #
16a05f60
|
| 13-Apr-2023 |
Barry Smith <bsmith@mcs.anl.gov> |
Fix manual pages in include files
|
| #
dd874c20
|
| 10-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hongzh/sell-cuda' into 'main'
SELL-based SpMV
See merge request petsc/petsc!3428
|
| #
b921024e
|
| 06-Apr-2021 |
Hong Zhang <hongzhang@anl.gov> |
Add MatSeqSELLGetAvgSliceSize
It returns the variance of the slice sizes.
|
| #
07e43b41
|
| 10-Sep-2020 |
Hong Zhang <hongzhang@anl.gov> |
Further optimization of MatMult_SeqSELLCUDA
- Add more kernels - Use multiple threads per row for matrices with narrow slices - Use multiple blocks per slice for matrices with wide slices - Add thre
Further optimization of MatMult_SeqSELLCUDA
- Add more kernels - Use multiple threads per row for matrices with narrow slices - Use multiple blocks per slice for matrices with wide slices - Add three new APIs to return the irregularity ratio, the maximum slice width and the average slice width
Experiments show that column blocking gives much worse performance for wide matrices and permulation based on slice width has almost no impact on the performance.
show more ...
|
| #
2d1451d4
|
| 09-Jan-2020 |
Hong Zhang <hongzhang@anl.gov> |
Initial commit for porting SELL to GPU
- Add tiled SPMV and basic SpMVfor SeqSELL - Tested in serial - Offloadmask is used to determine when the matrix should be copied to GPU - Use different slice
Initial commit for porting SELL to GPU
- Add tiled SPMV and basic SpMVfor SeqSELL - Tested in serial - Offloadmask is used to determine when the matrix should be copied to GPU - Use different slice height for CUDA version - By checking the nonzerostate, PETSc can decide if the whole matrix need to be copied or just the values need to be copied - Make the convert function public so that the very slow MatConvert_Basic can be avoided sometimes. E.g. one can use a two-step convert method: AIJ->SELL,SELL->SELLCUDA instead of the direct convert AIJ->SELLCUDA - Make the FLOPS count for SELL same as that for AIJCUSPARSE. - MatDisAssemble is not needed. - Change slice height from 32 to 16 for GPU - To overlap communication with MatMult, VecScatterBegin() should be called before MatMult() for the diagonal part. - SLICE_HEIGHT is defined to be 32 to match the warp size of GPU. For other cases, it is still 8.
Funded-by: Project: PETSc for GPU Time: 42 hours Reported-by: Thanks-to:
show more ...
|
| #
df649aa6
|
| 08-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/remove-sparseelemental' into 'main'
Remove MATSOLVERSPARSELEMENTAL
See merge request petsc/petsc!6296
|
| #
98446d56
|
| 08-Apr-2023 |
Pierre Jolivet <pierre@joliv.et> |
Remove MATSOLVERSPARSELEMENTAL
-pc_factor_mat_solver_type sparseelemental causes segmentation faults, has been unsupported for many years, and there is no interest in anyone supporting or developing
Remove MATSOLVERSPARSELEMENTAL
-pc_factor_mat_solver_type sparseelemental causes segmentation faults, has been unsupported for many years, and there is no interest in anyone supporting or developing the code
show more ...
|
| #
239bd9ba
|
| 08-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
8e10d8cf
|
| 08-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jolivet/fix-Wzero-as-null-pointer-constant-ci' into 'release'
Fix -Wzero-as-null-pointer-constant
See merge request petsc/petsc!6294
|
| #
f22e26b7
|
| 06-Apr-2023 |
Pierre Jolivet <pierre@joliv.et> |
Fix -Wzero-as-null-pointer-constant + El warnings
|
| #
1f7c7380
|
| 05-Apr-2023 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hsuh/mat-create-dense-from-vectype' into 'main'
Add VecCreateMatDense(): Create a matrix that matches the type of a Vec
See merge request petsc/petsc!6237
|