History log of /petsc/src/mat/impls/aij/seq/seqcusparse/aijcusparse.cu (Results 401 – 425 of 685)
Revision Date Author Comments
# c1fb3f03 24-Jul-2019 Stefano Zampini <stefano.zampini@gmail.com>

MATSEQAIJCUSPARSE: optimize write access when possible

In MatMult and MatMultAdd_ with compressed rows, we only write access to the GPU array output


# 489de41d 22-Jul-2019 Stefano Zampini <stefano.zampini@gmail.com>

MatSEQAIJ{CUSPARSE|VIENNACL}: do not copy to the GPU if not at the final stage of assembly


# 142fd69e 25-Jul-2019 Karl Rupp <me@karlrupp.net>

CUDA: Removed check for ancient CUDA version >= 5000


# 301298b4 13-Jul-2019 Mark Adams <ma2325@columbia.edu>

CUDA: Remove ancient checks for CUDA >= 4.2


# b5d17604 17-Jul-2019 Karl Rupp <me@karlrupp.net>

Merge branch 'hannah/gpu-gpu-logging' [PR #1881]

* hannah/gpu-gpu-logging:
This branch adds timers to include GPU data movement (device to device data movement) in GPU logging.


# ce7cfea0 16-Jul-2019 hannah_mairs <hannah.mairs@gmail.com>

removing comments


# 619acb3e 16-Jul-2019 hannah_mairs <hannah.mairs@gmail.com>

adding GPU-GPU data movement to logging


# a8158fb5 14-Jul-2019 Karl Rupp <me@karlrupp.net>

Merge branch 'gropp/doc-fixes-1' [PR #1838]


# 7a71495b 04-Jul-2019 Karl Rupp <me@karlrupp.net>

Merge branch 'hannah/gpu-computation-logging' [PR #1843]

* hannah/gpu-computation-logging:
Adding GPU flop rate and GPU time.


# 7a052e47 03-Jul-2019 hannah_mairs <hannah.mairs@gmail.com>

PetscLogGpuTimeStart -> Begin


# 958c4211 01-Jul-2019 hannah_mairs <hannah.mairs@gmail.com>

Adding Gpu flop rate and GPU time


# 57a5760d 02-Jul-2019 William Gropp <wgropp@illinois.edu>

Merge from doc-fixes branch


# 3b49ee3e 28-Jun-2019 Hannah Morgan <hannah.mairs@gmail.com>

Merged in hannah/gpu-communication-logging (pull request #1814)

Hannah/gpu communication logging

Approved-by: BarryFSmith <bsmith@mcs.anl.gov>
Approved-by: Richard Mills <rtm@eecs.utk.edu>


# 4863603a 28-Jun-2019 Satish Balay <balay@mcs.anl.gov>

Adding vector logging, started matrix logging


# a2b725a8 27-Jun-2019 William Gropp <wgropp@illinois.edu>

Fixes for doctext update


# b6a92dca 26-Jun-2019 BarryFSmith <bsmith@mcs.anl.gov>

Merged in barry/cuda-multigrid-test (pull request #1763)

Various improvements for GPUs (mostly for performance and CUDA)


# fdc842d1 31-May-2019 Barry Smith <bsmith@mcs.anl.gov>

Various improvements for GPUs (mostly for performance and CUDA)

1) Add VecPinToCPU() for CUDA vector and matrices
2) Move initialization of cuBLAS to PetscInitialize() since it takes 1/2 second and

Various improvements for GPUs (mostly for performance and CUDA)

1) Add VecPinToCPU() for CUDA vector and matrices
2) Move initialization of cuBLAS to PetscInitialize() since it takes 1/2 second and distorts timing with -log_view
3) Add logging for DMCreateMatrix (for large meshes this is very large)
4) Add VecGet/RestoreArrayWrite() to prevent unneeded copies from GPU (only implemented so far for CUDA);
added a small number of usages in the source so that snes tutorials ex19 does not do unneeded communication from the GPU
5) Automatically convert MAIJ matrices to AIJ for CUDA since they are not yet supported natively in PETSc's CUDA matrix implementation
6) Pinned objects should still use the CUDA/ViennaCL versions of Destroy to clean up the GPU stuff

Commit-type: feature

show more ...


# 89583661 21-Jun-2019 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/barry/add-petscarrayxxx-ops' [PR #1771]


# f22e0265 19-Jun-2019 Barry Smith <bsmith@mcs.anl.gov>

Remove dead Matlab code, fix typo, fix caste needed for const free

Commit-type: portability-fix
Reported-by: Satish Balay <balay@mcs.anl.gov>


# 580bdb30 07-Jun-2019 Barry Smith <bsmith@mcs.anl.gov>

Add PetscArraycmp(), PetscArraycpy(), PetscArrayzero(), PetscArraymove(), update code to use new forms

Replace a couple multi-mallocs with a single malloc
Replace backwards loops in MatSetValues_*()

Add PetscArraycmp(), PetscArraycpy(), PetscArrayzero(), PetscArraymove(), update code to use new forms

Replace a couple multi-mallocs with a single malloc
Replace backwards loops in MatSetValues_*() with PetscArraymove()

Commit-type: style-fix, feature

show more ...


# 613bfe33 02-Jun-2019 BarryFSmith <bsmith@mcs.anl.gov>

Merged in barry/update-collective-on (pull request #1744)

Update the use of Collective on in the manual pages to reflect the new style


# d083f849 01-Jun-2019 Barry Smith <bsmith@mcs.anl.gov>

Update the use of Collective on in the manual pages to reflect the new style

Commit-type: style-fix, documentation
Thanks-to: Patrick Sanan <patrick.sanan@gmail.com>


# b2134635 27-May-2019 Karl Rupp <me@karlrupp.net>

Merge branch 'barry/rm-readwrite' [PR #1710]

* barry/rm-readwrite:
Remove the ReadWrite notation from Vec array access for GPU vectors to match usage on the CPU.


# f2d70e9d 25-May-2019 Barry Smith <bsmith@mcs.anl.gov>

Remove the ReadWrite notation from Vec array access for GPU vectors to match usage on the CPU

Commit-type: style-fix

Development Tools: Vim, Emacs, Eclipse


# a5a49157 25-Oct-2018 Joseph Pusztay <josephpusztay@Josephs-MacBook-Pro.local>

Merge branch 'master' into jpusztay/feature-swarm-symplectic-example


1...<<11121314151617181920>>...28