| ce496241 | 06-Jan-2021 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATAIJ: fix some more direct usage of values array
Remove unused MatDistribute_MPIAIJ code |
| 33c9ba73 | 06-Jan-2021 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQAIJCUSPARSE: add MatScale support |
| dfa0f9e5 | 05-Jan-2021 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatScale_SeqAIJ: use accessor routines
Reported-by: Jose Roman <jroman@dsic.upv.es> |
| 6f3d89d0 | 21-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatCreate: When PETSc is configure with device, set boundtocpu to true at creation time |
| 930e68a5 | 12-Jan-2021 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
Adams/landau cleanup Added mass matrix construction to GPU kernel to avoid problem with sparser mass matrix than Jacobian, messing up MatAXPY, rarely but unpredictably. Added Kokkos solver st
Adams/landau cleanup Added mass matrix construction to GPU kernel to avoid problem with sparser mass matrix than Jacobian, messing up MatAXPY, rarely but unpredictably. Added Kokkos solver stubs to work with GPU offloaded matrices. Fixed up ex2 for paper Added runex2_[kokkos|cuda] targets to Landau makefile for paper
show more ...
|
| a952e0d5 | 29-Dec-2020 |
Barry Smith <bsmith@mcs.anl.gov> |
Report resource errors from CUDA, cuBLAS, and cuSparse as PETSC_ERR_GPU_RESOURCE so test harness can retry them.
Convert the error code to a GPU resource error code so that the test harness will rer
Report resource errors from CUDA, cuBLAS, and cuSparse as PETSC_ERR_GPU_RESOURCE so test harness can retry them.
Convert the error code to a GPU resource error code so that the test harness will rerun the example multiple times.
This should decrease the number of false job failures for CUDA runs
/spend 1.5h
show more ...
|
| 00702c57 | 29-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQAIJCUSPARSE: add PetscInfo diagnostics for buffer size in MatMat operations |
| c215019a | 28-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMatBackend: use device memory when populating the COO buffers |
| f0cf5187 | 28-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQAIJKOKKOS: fix AXPY, add GetArray support |
| 039c6fba | 27-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatAXPY: add support for CUSPARSE |
| 41f5e1b1 | 27-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatAXPY_SeqAIJ: log flops |
| d9d719b4 | 27-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatAXPY_MPIAIJ: use local mataxpy when SAME_NONZERO_PATTERN is specified |
| cd7f47a8 | 14-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMKLPARDISO: factorize schur complement in driver routine |
| abb89eb1 | 05-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMatMPIAIJBACKEND: add few more options
do not copy CPU values down to GPU if not present! |
| 2ed87e7e | 05-Dec-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatGetLocalMatMerge: fix issues with thrust on SUMMIT
I suspect partition_copy is buggy |
| bfcc3627 | 23-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
Mat: Move COO events out of CUSPARSE class
Fix a few typos in the code |
| 8909a122 | 20-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatSeqAIJCUSPARSEMergeMats: fix for CUDA < 10 |
| ec446438 | 19-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
Apply 7 suggestion(s) to 3 file(s) |
| cec0a6c6 | 19-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatAIJMKL: enable parallel PtaP
Wondering if it ever worked? |
| 4e84afc0 | 19-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMPIAIJ: move generic code for MatMat product to base class |
| 6a29ce69 | 19-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMPIAIJSetPreallocation_ : minor fixes for cusparse and kokkos |
| 11c5b395 | 17-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQAIJCUSPARSE: remove copy to the CPU after GPU operations |
| e589036e | 16-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatAIJCUSPARSESetGenerateTranspose: convenience function for seq and mpi |
| 7eaca502 | 12-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
PetscStackCallThrust |
| 57181aed | 16-Nov-2020 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQAIJCUSPARSE: factors need to update the CPU memory |