| #
e98e3f39
|
| 29-Mar-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/cusparse-lu-band' into 'main'
Add cuSparse Band LU factorization
See merge request petsc/petsc!3713
|
| #
bddcd29d
|
| 29-Mar-2021 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
Add cuSparse Band LU factorization
|
| #
72f6e323
|
| 29-Mar-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'jczhang/fix-kokkos-memspace' into 'main'
Kokkos: do not redeclare functions if device is host
See merge request petsc/petsc!3774
|
| #
15166ebc
|
| 27-Mar-2021 |
Junchao Zhang <jczhang@mcs.anl.gov> |
LandauKokkos: fix prototype
|
| #
4a5a4fb6
|
| 12-Jan-2021 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/landau-cleanup' into 'master'
Adams/landau cleanup
See merge request petsc/petsc!3542
|
| #
930e68a5
|
| 12-Jan-2021 |
Mark Adams <524115-markadams4@users.noreply.gitlab.com> |
Adams/landau cleanup Added mass matrix construction to GPU kernel to avoid problem with sparser mass matrix than Jacobian, messing up MatAXPY, rarely but unpredictably. Added Kokkos solver st
Adams/landau cleanup Added mass matrix construction to GPU kernel to avoid problem with sparser mass matrix than Jacobian, messing up MatAXPY, rarely but unpredictably. Added Kokkos solver stubs to work with GPU offloaded matrices. Fixed up ex2 for paper Added runex2_[kokkos|cuda] targets to Landau makefile for paper
show more ...
|
| #
8ff777de
|
| 23-Dec-2020 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/landau-gpu-assembly' into 'master'
Adams/landau gpu assembly
See merge request petsc/petsc!3423
|
| #
a587d139
|
| 23-Dec-2020 |
Mark <cal2princeton@yahoo.com> |
Adding Cuda and Kokkos assembly. Added Device assembly to Landau operator. Added Kokkos test mat/ex5k.
|
| #
47f0c0cd
|
| 17-Nov-2020 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/landau-cuda-opt' into 'master'
Adams/landau cuda opt
See merge request petsc/petsc!3418
|
| #
52cdd6ea
|
| 17-Nov-2020 |
Mark <cal2princeton@yahoo.com> |
Nvidia optimizations
integrating new Cuda kernels
merged Peng's optimizations and cleanup (has debugging code)
wip, adding SOA
|
| #
53d71bcd
|
| 26-Aug-2020 |
Satish Balay <balay@mcs.anl.gov> |
Merge branch 'adams/feature-dmplex-snes-landau-kokkos-barry-rebased' into 'master'
Adams/feature dmplex snes landau kokkos barry rebased
See merge request petsc/petsc!3050
|
| #
e0eea495
|
| 26-Aug-2020 |
Mark <cal2princeton@yahoo.com> |
Add a Landau collision operator, based on DMPlex and PetscFE, that uses p4est. It uses the new Kokkos interface and has a separate Cuda implementation. This could be deployed as 3rd party library, bu
Add a Landau collision operator, based on DMPlex and PetscFE, that uses p4est. It uses the new Kokkos interface and has a separate Cuda implementation. This could be deployed as 3rd party library, but this is easier to deploy to ECP, among other apps.
show more ...
|