| 29f7384a | 01-Aug-2014 |
Karl Rupp <rupp@iue.tuwien.ac.at> |
Merged in dmeiser/petsc/fix-cusp-ex4_par (pull request #181)
A bug fix in `VecScatterBegin` for CUSP vectors |
| 501695b7 | 31-Jul-2014 |
Barry Smith <bsmith@mcs.anl.gov> |
Merge branch 'maint' |
| df3898ee | 31-Jul-2014 |
Barry Smith <bsmith@mcs.anl.gov> |
with with -> with the the -> the |
| 9556c46e | 28-Jul-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Make sure data on CPU is up to date.
If VecCUSPCopyFromGPUSome_Public cannot be called we have to call VecCUSPCopyFromGPU. Otherwise the host data might be out of data or unallocated, as is the case
Make sure data on CPU is up to date.
If VecCUSPCopyFromGPUSome_Public cannot be called we have to call VecCUSPCopyFromGPU. Otherwise the host data might be out of data or unallocated, as is the case in the new test introduced in this branch.
With this fix the new test runex4_cusp2 passes. There are some valgrind errors due to uninitialized values. These are fixed by the additional bug fixes in fix-vecscattercusp_stos and fix-cusp-dmda.
show more ...
|
| 2b185f60 | 28-Jul-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Add a parallel cusp test based on ex4.c.
This test currently segfaults. Here is the valgrind stack trace:
``` ==8014== Invalid read of size 8 ==8014== at 0x57BCEA: Pack_1 (vpscat.c:497) ==8014=
Add a parallel cusp test based on ex4.c.
This test currently segfaults. Here is the valgrind stack trace:
``` ==8014== Invalid read of size 8 ==8014== at 0x57BCEA: Pack_1 (vpscat.c:497) ==8014== by 0x58A89D: VecScatterBegin_1 (vpscat.h:110) ==8014== by 0x572441: VecScatterBegin (vscat.c:1724) ==8014== by 0x407D33: main (ex4.c:35) ==8014== Address 0x0 is not stack'd, malloc'd or (recently) free'd ```
show more ...
|
| 237a2283 | 28-Jul-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Use VecCUSPGetArrayReadWrite to obtain scatter destination array.
VecCUSPGetArrayWrite is not sufficient because the scatter only modifies part of the distination vector. VecCUSPGetArrayReadWrite m
Use VecCUSPGetArrayReadWrite to obtain scatter destination array.
VecCUSPGetArrayWrite is not sufficient because the scatter only modifies part of the distination vector. VecCUSPGetArrayReadWrite makes sure that data on GPU has been synchronized with CPU.
show more ...
|
| b51c2805 | 28-Jul-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Remove tabs. |
| 3dd73acf | 24-Jul-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Add cusp scatter test.
This test fails. |
| 305f76c9 | 26-Jun-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Delete commented out code in cuda related files.
Note that there are still a fairly large number of commented out chunks of code in the rest of the code base:
[dmeiser@ivy petsc]$ grep -R '#if\s0'
Delete commented out code in cuda related files.
Note that there are still a fairly large number of commented out chunks of code in the rest of the code base:
[dmeiser@ivy petsc]$ grep -R '#if\s0' * |grep '^src\|include'| wc -l 87
show more ...
|
| 74be5e5d | 05-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Fixing bug in VecCUSPAllocateCheckHost. |
| ca1b137e | 04-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Bug fixes in host memory allocation/checking in VecGetLocalVector_SeqCUSP/VecRestoreLocalVector_SeqCUSP. |
| 2a3563c6 | 04-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Add memory logging, remove debug statements. |
| a7cb3fb1 | 04-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Need to ensure that host vector is allocated. |
| e2af7706 | 04-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Fix allocation of host memory in VecCUSPAllocateCheckHost. |
| 3743e9c1 | 04-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Tweak valid_GPU_array logic. |
| 7954fc51 | 03-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Fix condition in VecValidValues.
Previous logic was broken in the case where vec-valid_GPU_array == PETSC_CUSP_UNALLOCATED. |
| cffa7766 | 03-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Set getlocalvector and restorelocalvector ops entries in Vec_MPICUSP. |
| 39426757 | 03-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Need to check whether vector has a non-null data ptr before dereferencing it. |
| 95a2de98 | 03-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Bug fix in VecGetLocalVector and VecGetLocalVectorRead. |
| 7976bcbf | 03-Feb-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Adding implementations VecGetLocalVector_SeqCUSP and VecRestoreLocalVector_SeqCUSP. |
| 3523fe5c | 31-Jan-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Add methods for acquiring local vector views to _VecOps.
Conflicts: include/petsc-private/vecimpl.h |
| 65a94282 | 29-Jan-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Checking that the host pointer is valid should be sufficient.
Because the GPU array gets invalidated later in this function. |
| 3cd93fd6 | 29-Jan-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Ensure that host array is allocated.
This commit fixes a bug where a null pointer is returned when ((Vec_Seq*)x->data)->array == 0 and x->valid_GPU_array == PETSC_CUSP_UNALLOCATED. This situation a
Ensure that host array is allocated.
This commit fixes a bug where a null pointer is returned when ((Vec_Seq*)x->data)->array == 0 and x->valid_GPU_array == PETSC_CUSP_UNALLOCATED. This situation arises for vectors generated with VecDuplicate_SeqCUSP.
show more ...
|
| 0f5e66f0 | 29-Jan-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Replace cudaMemcpyAsync with cudaMemcpy.
Also page locked host memory registration. |
| 18096054 | 29-Jan-2014 |
Dominic Meiser <dmeiser@txcorp.com> |
Trying to fix the cudaHostRegister/cudaHostUnregister logic. |