| #
29882ab9
|
| 04-Jan-2022 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jed/thrust-nosync' into 'main'
thrust: upgrade to 1.16 prerelease for thrust::cuda::par_nosync in vector operations
See merge request petsc/petsc!4676
|
| #
a2cee5fe
|
| 26-Dec-2021 |
Jed Brown <jed@jedbrown.org> |
thrust: upgrade to 1.16 prerelease for async on streams
Headers in this version are less promiscuous so we need more includes.
|
| #
ba70aee8
|
| 10-Dec-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/fix_mat_ex5k' into 'main'
GPU assembly bug fixes. Unified kokkos and CUDA GPU assembly tests to some degree, off diagonal block...
See merge request petsc/petsc!4609
|
| #
49b994a9
|
| 08-Dec-2021 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
fix parrel GPU assembly for Kokkos
|
| #
b9968e31
|
| 18-Nov-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/landau-cuda-opt' into 'main'
Landau optimizations + GPU assembly bug fix
See merge request petsc/petsc!4358
|
| #
365b711f
|
| 25-Sep-2021 |
Mark Adams <ma2325@columbia.edu> |
add hook for CPU cusparse solves
|
| #
99551766
|
| 28-Sep-2021 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
fix 64 bit integer bug with GPU assembly
|
| #
6cb41cfa
|
| 01-Nov-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'rmills/feature-dm-bind-below' into 'main'
Rmills/feature dm bind below
See merge request petsc/petsc!3455
|
| #
56258e06
|
| 06-Oct-2020 |
Richard Tran Mills <rmills@rmills.org> |
In MatMult_MPIAIJCUSPARSE(), if the matrix is bound to the CPU, bind the local vector used for matrix multiplication as well.
|
| #
0baf1215
|
| 20-Oct-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jacobf/2021-09-04/petscdevice-cupminitialization' into 'main'
Replace current CUPM initialization with unified PetscDevice version
See merge request petsc/petsc!4310
|
| #
a4af0cee
|
| 19-Oct-2021 |
Jacob Faibussowitsch <jacob.fai@gmail.com> |
- Replace the current CUPM initialization infrastructure with PetscDevice. - Remove -petsc_use_default_null_stream and -[cuda|hip]_synchronize options - Remove PetscCUDASynchronize and PetscHIPSynchr
- Replace the current CUPM initialization infrastructure with PetscDevice. - Remove -petsc_use_default_null_stream and -[cuda|hip]_synchronize options - Remove PetscCUDASynchronize and PetscHIPSynchronize - Remove PetscCUDAInitialize(), PetscCUDAInitializeCheck(), PetscHIPInitialize(), and PetscHIPInitializeCheck() - Remove PetscCUBLASInitializeHandle(), PetscCUSOLVERDnInitializeHandle(), PetscHIPBLASInitializeHandle(), and PetscHIPSOLVERInitializeHandle(). - Remove petsc_gputimer_begin and petsc_gputimer_begin - Add -device_enable, -device_select and -device_view options - Replace -[cuda|hip]_device with split options -device_enable_[cuda|hip] and -device_select_[cuda|hip] - Replace -[cuda|hip]_view with -device_view_[cuda|hip] - Add PetscDeviceInitType - Add PetscDeviceInitialize() and PetscDeviceInitialized() - Add PetscDeviceView()
show more ...
|
| #
5ce27ff0
|
| 11-Sep-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'stefanozampini/hypre-gpu' into 'main'
Interface HYPRE GPU solvers
See merge request petsc/petsc!4248
|
| #
ae48a8d0
|
| 30-Aug-2021 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATHYPRE: register convertors from CUSPARSE
|
| #
e3c94c14
|
| 18-Aug-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/feature-add-comments-matproduct' into 'main'
Mat/SF: add some comments to code
See merge request petsc/petsc!4231
|
| #
96344195
|
| 13-Aug-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/release'
|
| #
ddea5d60
|
| 12-Aug-2021 |
Junchao Zhang <jczhang@mcs.anl.gov> |
MAT/SF: add some comments to code
|
| #
662169a7
|
| 09-Aug-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/feature-tune-garray-malloc' into 'main'
Mat: remove the unneeded +1 in memory allocation for garray
See merge request petsc/petsc!4214
|
| #
efc51567
|
| 09-Aug-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/fix-matmpiaijcusparse-manual/release' into 'release'
Doc: fix MATMPIAIJCUSPARSE manual
See merge request petsc/petsc!4215
|
| #
6bb69460
|
| 05-Aug-2021 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Doc: fix MATMPIAIJCUSPARSE manual
Reported-by: Victor Eijkhout <eijkhout@tacc.utexas.edu>
|
| #
b3c64f9d
|
| 04-Aug-2021 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Mat: remove the unneeded +1 in memory allocation for garray
|
| #
10f1504d
|
| 10-Jun-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'barry/2020-11-11/cleanup-matsetvaluesdevice' into 'main'
MatSetValuesDevice: Cleanup and simplify code, including example
See merge request petsc/petsc!3411
|
| #
042217e8
|
| 10-Jun-2021 |
Barry Smith <bsmith@mcs.anl.gov> |
MatSetValuesDevice: Cleanup and simplify code, including example
User reported crash of example code. Kernel was passed an ierr that lived in CPU memory
MatSetValuesDevice: do not include private h
MatSetValuesDevice: Cleanup and simplify code, including example
User reported crash of example code. Kernel was passed an ierr that lived in CPU memory
MatSetValuesDevice: do not include private headers from public headers
Feature: MatSetValuesDevice determines automatically from the context (where it is included from) if it is being used from C, CUDA, or Kokkos, PETSC_DEVICE_FUNC_DEC no longer needs to be set before including petscaijdevice.h
Feature: MatSetValuesDevice() now ignores all values outside the global column range.
PetscSplitCSRDataStructure is now a pointer, not a struct, like most PETSc objects, please leave it that way.
Fix all uses of CTABLE that were related to the original MatSetValuesDevice()
Have atomicAdd use Kokkos atomic-add with CPU build when building with Kokkos.
Cuda should now work with --download-openmpi, this is done by updating updateCompilers() to rerun portions of packages/cuda.py after the compilers are reset to use MPI wrappers. This is needed because the resetting of the compilers removes all the compiler flags and packages/cuda.py sets certain values into these flags that was previously lost.
Add MPICXX_INCLUDES, MPICXX_LIBS to fix compile targets for Kokkos examples
'make check' now runs properly for Kokkos test of src/snes/ex3k, fixed bug in the makefile wrt MPI_IS_MPIUNI check
Testing makefile rules: add ex*cu binaries to clean rule
Reported-by: Sam Fagbemi <samkorede24@gmail.com> Thanks-to: Stefano Zampini <stefano.zampini@gmail.com> Thanks-to: Mark Adams <mfadams@lbl.gov>
/spend 16h
show more ...
|
| #
0e5d4aae
|
| 20-May-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/jczhang/fix-gcc4-complex'
Disable petsccomplexfix.h with C++14 and GNU-4
See merge request petsc/petsc!3989
|
| #
81cc8ef4
|
| 19-May-2021 |
Junchao Zhang <jczhang@mcs.anl.gov> |
Remove use of PETSC_SKIP_CXX_COMPLEX_FIX in petsc source code
|
| #
7d01355a
|
| 05-May-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'hongzh/add-cuda-event-timer' into 'main'
Add CUDA and HIP event timers
See merge request petsc/petsc!3852
|