| #
c1fb3f03
|
| 24-Jul-2019 |
Stefano Zampini <stefano.zampini@gmail.com> |
MATSEQAIJCUSPARSE: optimize write access when possible
In MatMult and MatMultAdd_ with compressed rows, we only write access to the GPU array output
|
| #
489de41d
|
| 22-Jul-2019 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatSEQAIJ{CUSPARSE|VIENNACL}: do not copy to the GPU if not at the final stage of assembly
|
| #
142fd69e
|
| 25-Jul-2019 |
Karl Rupp <me@karlrupp.net> |
CUDA: Removed check for ancient CUDA version >= 5000
|
| #
301298b4
|
| 13-Jul-2019 |
Mark Adams <ma2325@columbia.edu> |
CUDA: Remove ancient checks for CUDA >= 4.2
|
| #
b5d17604
|
| 17-Jul-2019 |
Karl Rupp <me@karlrupp.net> |
Merge branch 'hannah/gpu-gpu-logging' [PR #1881]
* hannah/gpu-gpu-logging: This branch adds timers to include GPU data movement (device to device data movement) in GPU logging.
|
| #
ce7cfea0
|
| 16-Jul-2019 |
hannah_mairs <hannah.mairs@gmail.com> |
removing comments
|
| #
619acb3e
|
| 16-Jul-2019 |
hannah_mairs <hannah.mairs@gmail.com> |
adding GPU-GPU data movement to logging
|
| #
a8158fb5
|
| 14-Jul-2019 |
Karl Rupp <me@karlrupp.net> |
Merge branch 'gropp/doc-fixes-1' [PR #1838]
|
| #
7a71495b
|
| 04-Jul-2019 |
Karl Rupp <me@karlrupp.net> |
Merge branch 'hannah/gpu-computation-logging' [PR #1843]
* hannah/gpu-computation-logging: Adding GPU flop rate and GPU time.
|
| #
7a052e47
|
| 03-Jul-2019 |
hannah_mairs <hannah.mairs@gmail.com> |
PetscLogGpuTimeStart -> Begin
|
| #
958c4211
|
| 01-Jul-2019 |
hannah_mairs <hannah.mairs@gmail.com> |
Adding Gpu flop rate and GPU time
|
| #
57a5760d
|
| 02-Jul-2019 |
William Gropp <wgropp@illinois.edu> |
Merge from doc-fixes branch
|
| #
3b49ee3e
|
| 28-Jun-2019 |
Hannah Morgan <hannah.mairs@gmail.com> |
Merged in hannah/gpu-communication-logging (pull request #1814)
Hannah/gpu communication logging
Approved-by: BarryFSmith <bsmith@mcs.anl.gov> Approved-by: Richard Mills <rtm@eecs.utk.edu>
|
| #
4863603a
|
| 28-Jun-2019 |
Satish Balay <balay@mcs.anl.gov> |
Adding vector logging, started matrix logging
|
| #
a2b725a8
|
| 27-Jun-2019 |
William Gropp <wgropp@illinois.edu> |
Fixes for doctext update
|
| #
b6a92dca
|
| 26-Jun-2019 |
BarryFSmith <bsmith@mcs.anl.gov> |
Merged in barry/cuda-multigrid-test (pull request #1763)
Various improvements for GPUs (mostly for performance and CUDA)
|
| #
fdc842d1
|
| 31-May-2019 |
Barry Smith <bsmith@mcs.anl.gov> |
Various improvements for GPUs (mostly for performance and CUDA)
1) Add VecPinToCPU() for CUDA vector and matrices 2) Move initialization of cuBLAS to PetscInitialize() since it takes 1/2 second and
Various improvements for GPUs (mostly for performance and CUDA)
1) Add VecPinToCPU() for CUDA vector and matrices 2) Move initialization of cuBLAS to PetscInitialize() since it takes 1/2 second and distorts timing with -log_view 3) Add logging for DMCreateMatrix (for large meshes this is very large) 4) Add VecGet/RestoreArrayWrite() to prevent unneeded copies from GPU (only implemented so far for CUDA); added a small number of usages in the source so that snes tutorials ex19 does not do unneeded communication from the GPU 5) Automatically convert MAIJ matrices to AIJ for CUDA since they are not yet supported natively in PETSc's CUDA matrix implementation 6) Pinned objects should still use the CUDA/ViennaCL versions of Destroy to clean up the GPU stuff
Commit-type: feature
show more ...
|
| #
89583661
|
| 21-Jun-2019 |
Satish Balay <balay@mcs.anl.gov> |
Merge remote-tracking branch 'origin/barry/add-petscarrayxxx-ops' [PR #1771]
|
| #
f22e0265
|
| 19-Jun-2019 |
Barry Smith <bsmith@mcs.anl.gov> |
Remove dead Matlab code, fix typo, fix caste needed for const free
Commit-type: portability-fix Reported-by: Satish Balay <balay@mcs.anl.gov>
|
| #
580bdb30
|
| 07-Jun-2019 |
Barry Smith <bsmith@mcs.anl.gov> |
Add PetscArraycmp(), PetscArraycpy(), PetscArrayzero(), PetscArraymove(), update code to use new forms
Replace a couple multi-mallocs with a single malloc Replace backwards loops in MatSetValues_*()
Add PetscArraycmp(), PetscArraycpy(), PetscArrayzero(), PetscArraymove(), update code to use new forms
Replace a couple multi-mallocs with a single malloc Replace backwards loops in MatSetValues_*() with PetscArraymove()
Commit-type: style-fix, feature
show more ...
|
| #
613bfe33
|
| 02-Jun-2019 |
BarryFSmith <bsmith@mcs.anl.gov> |
Merged in barry/update-collective-on (pull request #1744)
Update the use of Collective on in the manual pages to reflect the new style
|
| #
d083f849
|
| 01-Jun-2019 |
Barry Smith <bsmith@mcs.anl.gov> |
Update the use of Collective on in the manual pages to reflect the new style
Commit-type: style-fix, documentation Thanks-to: Patrick Sanan <patrick.sanan@gmail.com>
|
| #
b2134635
|
| 27-May-2019 |
Karl Rupp <me@karlrupp.net> |
Merge branch 'barry/rm-readwrite' [PR #1710]
* barry/rm-readwrite: Remove the ReadWrite notation from Vec array access for GPU vectors to match usage on the CPU.
|
| #
f2d70e9d
|
| 25-May-2019 |
Barry Smith <bsmith@mcs.anl.gov> |
Remove the ReadWrite notation from Vec array access for GPU vectors to match usage on the CPU
Commit-type: style-fix
Development Tools: Vim, Emacs, Eclipse
|
| #
a5a49157
|
| 25-Oct-2018 |
Joseph Pusztay <josephpusztay@Josephs-MacBook-Pro.local> |
Merge branch 'master' into jpusztay/feature-swarm-symplectic-example
|