History log of /petsc/src/mat/impls/aij/seq/seqcusparse/aijcusparse.cu (Results 326 – 350 of 685)
Revision Date Author Comments
# 22f8e9db 18-Oct-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'adams/gamg-reduce-opt-cuda' into 'master'

Adams/gamg reduce opt cuda

See merge request petsc/petsc!3339


# c9567895 18-Oct-2020 Mark <cal2princeton@yahoo.com>

Reducing first coarse grid to one process/device in Cuda and manual override of this and other heuristics for process reduction in GAMG coarse grids.


# efba6e65 10-Oct-2020 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/release' into master


# d1debf72 09-Oct-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/fix-gamg-transposegencuda' into 'release'

Fix CUDA performance regression and KOKKOS bugfixes

See merge request petsc/petsc!3310


# d98d7c49 07-Oct-2020 Stefano Zampini <stefano.zampini@gmail.com>

MATMPIAIJCUSPARSE: fix conversions


# c58ef05e 06-Oct-2020 Stefano Zampini <stefano.zampini@gmail.com>

MatInode: set it to false when using device matrices


# 12c380df 28-Sep-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'adams/feature-mat-cuda' into 'master'

Adams/feature mat cuda

See merge request petsc/petsc!3137


# 3fa6b06a 02-Sep-2020 Mark Adams <mfadams@lbl.gov>

add cuda matrix for meta data method, assembly


# 95d17da4 28-Sep-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2020-09-25/do-not-log-gpu-transfer-time' into 'master'

Do not log transfer time as GPU time. GPU time is used to get performance ON...

See merge request petsc/petsc!3247


# ae74c241 26-Sep-2020 Barry Smith <bsmith@mcs.anl.gov>

Do not log transfer time as GPU time. GPU time is used to get performance ON the GPU (for example flop rates)
logging transfer times distorts these numbers since they would include the time to copy d

Do not log transfer time as GPU time. GPU time is used to get performance ON the GPU (for example flop rates)
logging transfer times distorts these numbers since they would include the time to copy down the matrix.
The transfer times are already logged in the copy to and from events and the performance flop rates including the
copy times are already logged in the overall flop rate.

Commit-type: bug-fix, style-fix
/spend 5m

show more ...


# 6f7728a9 22-Sep-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/support-cuda-11' into 'master'

Add support of CUDA-11

See merge request petsc/petsc!3207


# afb2bd1c 17-Sep-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Add support of CUDA-11


# fd7c363c 17-Sep-2020 Satish Balay <balay@mcs.anl.gov>

Revert "CUDA: Get PETSc to compile with CUDA 11."

This reverts commit 21a6055d1ede0202436f221d06f48a7b9836052d.


# cf5f8798 09-Sep-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'karlrupp/fix-cuda-11-compilation' into 'master'

CUDA: Get PETSc to compile with CUDA 11.

See merge request petsc/petsc!3154


# 21a6055d 08-Sep-2020 Karl Rupp <me@karlrupp.net>

CUDA: Get PETSc to compile with CUDA 11.

CUSPARSE has changed significantly in CUDA 11.
This has lead to a lot of broken code.
This commit disables the AIJCUSPARSE class so that PETSc at least compi

CUDA: Get PETSc to compile with CUDA 11.

CUSPARSE has changed significantly in CUDA 11.
This has lead to a lot of broken code.
This commit disables the AIJCUSPARSE class so that PETSc at least compiles with other GPU features enabled.

show more ...


# fd2eec0d 23-Aug-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/add-kokkos-init' into 'master'

Add Kokkos initialization

See merge request petsc/petsc!3048


# 05035670 19-Aug-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Add support of CUDA/HIP initialization at the same time


# 721b214e 02-Aug-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/feature-delay-cuda-init' into 'master'

Do not init cuda if a code does not create any petsc cuda objects

See merge request petsc/petsc!2987


# cae85d06 21-Jul-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Do lazy petsc cuda initialization and validation

So that we can avoid overburdening the GPU in CI.
See more at https://gitlab.com/petsc/petsc/-/issues/360#note_373723821


# dadd21d5 04-Jul-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'barry/2020-06-23/pcfactor-no-ordering-external' into 'master'

Most external factorization packages do not use the ordering provided by...

See merge request petsc/petsc!2913


# 2c7c0729 24-Jun-2020 Barry Smith <bsmith@mcs.anl.gov>

Most external factorization packages do not use the ordering provided by PETSc, therefor only provide it when needed.

Adds MatOrderingType external to indicate not to generate an ordering and use wh

Most external factorization packages do not use the ordering provided by PETSc, therefor only provide it when needed.

Adds MatOrderingType external to indicate not to generate an ordering and use what the package needs
Now -pc_view will not print the wrong PETSc ordering when the ordering is done externally

Saves some memory and compute time.

Commit-type: optimization
Time: 1.2 hours
Reported-by: Junchao Zhang <junchao.zhang@gmail.com>
Thanks-to: Stefano Zampini <stefano.zampini@gmail.com>

show more ...


# 7f3f1200 01-Jul-2020 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/feature-cusparse-notransgen-herm' into 'master'

feature cusparse notransgen herm

See merge request petsc/petsc!2915


# 54da937a 28-Jun-2020 Stefano Zampini <stefano.zampini@gmail.com>

MATSEQAIJCUSPARSE: test multiple transpose generation flags


# 85ba7357 28-Jun-2020 Stefano Zampini <stefano.zampini@gmail.com>

MATSEQAIJCUSPARSE: log transpose generation


# 51c6d536 27-Jun-2020 Stefano Zampini <stefano.zampini@gmail.com>

MATSEQAIJCUSPARSE: remove wrong CPUtoGPU memory logging


1...<<11121314151617181920>>...28