| 803a1b88 | 15-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
VecScatter: reorganization
implement several vecnode routines rm mpi3 from _p_VecScatter struct -- nolonger useful rm using native shared memory from src/ksp/ksp/examples/tests/ex56.c
VecScatter: reorganization
implement several vecnode routines rm mpi3 from _p_VecScatter struct -- nolonger useful rm using native shared memory from src/ksp/ksp/examples/tests/ex56.c fix memory bug in VecScatterCreateLocal_PtoS_MPI3() bugfix for VecScatterCopy() -- copy scattertype add flg is_duplicate to _p_VecScatter; use it to prevent using StoP in VecScatterEndMPI3Node() rename VecScatterCreateGeneral_ -> VecScatterCreate_ rename VecScatterCreate_vectype_mpitype() -> VecScatterCreateLocal_vectype_mpitype() check for duplicate indices when mpi3node scatter is created; update tests undo the commit bb8026e126c43792cc81c094ed5610e9a92280e3: add an example for testing MatMult() with node vectors add MPI_Barrier(mscomm) in VecScatterEndMPI3Node for safe, but would lose correctness due to wright to same variable (see run2_5 and run3_5). Until finding an efficient way to deal with variable racing, we may give up this work :-( add test output files bugfix for VecScatterEndMPI3Node; update ex2.c Merge branch 'hzhang/vec_node/master-rebased' of bitbucket.org:petsc/petsc into hzhang/vec_node/master-rebased add an example for testing MatMult() with node vectors bugfix for VecScatterEndMPI3Node() rm checking vec_type in VecScatterCreate_mpi3node() add winarray to Vec_node struct to avoid calls to MPI_Win_shared_query() in scatterEnd() optionize a loop for xv -> yv small bugfix and optimization implement several Vec routines for node type using object state counter for vector to rm 2 MPI_Barrier() small optimization and cleanup small build fix add few vector routines for node type add VecScatterCreate_vectype_private() for cleanup replace MPI_Barrier(comm) by MPI_Barrier(mscomm) add tests for StoP scatter bugfix: src/vec/vscat/examples/ex1 and ex2 are working well now update ex1 and ex2 reverse scatter PtoS works for bs=1 cleanup; fixed memory bug for bs>1 PtoP and PtoS(forward) scatter work for bs>1 mpi3node works for PtoP forward scatter, I can read on-core xvalues in VecScatterEndMPI3Node check vectypes for VecScatterCreate_MPI3Node() add VecScatterCreate_MPI2Node() -- not done yet rename VecScatterCreate_StoS() -> xxx_Seq(); add VecScatterCreate_type(); reorganize cleanup and organization -- all seems working well cleanup; update VecScatterSetFromOptions() add VecScatterSetFromOptions() rm VecScatterCreate_MPI1(); cleanup cleanup VecScatterCreateGeneral_StoP() add VecScatterCreateGeneral_StoP() cleanup VecScatterCreateGeneral_PtoS() VecScatterCreateGeneral_PtoS() works bugfix after rebase -- passed local tests for both mpi1 and mpi3
add VecScatterCreate_StoS() add GetInputISType_private() update src/vec/vscat/examples/ex1.c and its output new VecScatterCreate() as an interface function add src/vec/vscat/interface/dlregisvecscat.c mv vecscatter routines from src/vec/vec/ to src/vec/vscat/ add src/vec/vscat/interface/vscreate.c reorganize files: create src/vec/vscat small fix for mmaij.c add VecScatterType rename VecScatterType -> VecScatterFormat add new VecType VECNODE for creating vector over on-node shared memory
show more ...
|
| 1bc6beb8 | 15-Feb-2018 |
Satish Balay <balay@mcs.anl.gov> |
superlu_dist now sets _LONGINT in superlu_dist_config.h - so remove the duplicate definition from petsc |
| b60407b9 | 29-Jan-2018 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMPIAIJSetPreallocationCSR_MPIAIJ: fix for empty processes |
| 0f7f3e6a | 13-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
Merge branch 'hzhang/mumps-invA' |
| 2366e350 | 12-Feb-2018 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatSuperLU: add support for MatSeqSELL |
| 7ee00b23 | 12-Feb-2018 |
Stefano Zampini <stefano.zampini@gmail.com> |
MatMUMPS: add support for MAT_FACTOR_LU and MatSeqSELL |
| 49ea6917 | 07-Feb-2018 |
Hong Zhang <hongzhang@anl.gov> |
Add AVX and AVX2 versions of MatMult for MATAIJ
Funded-by: Project: PETSc for Theta Time: 2.0 hours Reported-by: Thanks-to: |
| 6a6db293 | 04-Oct-2017 |
Hong Zhang <hongzhang@anl.gov> |
Vectorize MatMult for aijperm
Done in the similar way with aij. It results in worse performance.
Funded-by: Project: PETSc for Theta Time: 4.0 hours Reported-by: Thanks-to: |
| 407323c4 | 04-Oct-2017 |
Hong Zhang <hongzhang@anl.gov> |
Vectorize MatMult for aij
Funded-by: Project: PETSc for Theta Time: 4.0 hours Reported-by: Thanks-to: |
| 5390ecae | 09-Feb-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Merge branch 'maint' |
| 76a8abe0 | 08-Feb-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Updates for viewing Load Balance information
Don't display for sequential objects. Label all values with "Load Balance" Want to display load balance for petsc ordering for vectors, not using natural
Updates for viewing Load Balance information
Don't display for sequential objects. Label all values with "Load Balance" Want to display load balance for petsc ordering for vectors, not using natural ordering
Commit-type: feature
show more ...
|
| ef5fdb51 | 07-Feb-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Add viewer format object for displaying load imbalance of vectors and nonzeros in sparse MPIAIJ matrices
Could be extended to DMs include DMDA and DMPlex
Commit-type: feature Thanks-to: Fande Kong
Add viewer format object for displaying load imbalance of vectors and nonzeros in sparse MPIAIJ matrices
Could be extended to DMs include DMDA and DMPlex
Commit-type: feature Thanks-to: Fande Kong <fande.kong@inl.gov>
show more ...
|
| 89a9c03a | 07-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
rename MatMumpsGetMatInverse() -> MatMumpsGetInverse |
| e3f2db6a | 06-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
MatMumpsGetMatInverse() works in parallel |
| 14267174 | 05-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
fix memory leak; cleanup ex214.c |
| bb599dfd | 04-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
add MatMumpsGetMatInverse() -- works in sequential |
| 9481e6e9 | 04-Feb-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
sequential '-nrhs 12' works now |
| 566a9afd | 03-Feb-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
removed unused variables in cpardiso interface; added casts for address of MPI_Comm since this is their interface
Commit-type: style-fix |
| 680d2628 | 03-Feb-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Accidently dropped ; when marking variable as const
Commit-type: bug-fix |
| 74fd8ad3 | 02-Feb-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Need work vector for MatMult() to be ViennaCL
Commit-type: bug-fix Reported-by: nightly tests |
| 671f4814 | 31-Jan-2018 |
Satish Balay <balay@mcs.anl.gov> |
fix: compile error with --with-errorchecking=0
/home/balay/petsc/src/mat/impls/aij/mpi/mpiaij.c: In function ‘matsetvaluesmpiaij_’: /home/balay/petsc/src/mat/impls/aij/mpi/mpiaij.c:5734:11: error: ‘
fix: compile error with --with-errorchecking=0
/home/balay/petsc/src/mat/impls/aij/mpi/mpiaij.c: In function ‘matsetvaluesmpiaij_’: /home/balay/petsc/src/mat/impls/aij/mpi/mpiaij.c:5734:11: error: ‘else’ without a previous ‘if’
show more ...
|
| 87b22cf4 | 31-Jan-2018 |
Hong Zhang <hzhang@mcs.anl.gov> |
add support for mumps computing user-specified set of entries in inv(A) (ICNTL(30)) Requested-by: Marius Buerkle <mbuerkle@web.de> |
| 542ff2ea | 30-Jan-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Merge branch 'barry/fix-matdensegetarray' |
| ea799195 | 29-Jan-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
change MatSolverType to be a const char* typedef to match other XXType
Commit-type: style-fix Reported-by: Franck Houssen <franck.houssen@inria.fr> |
| ca6d6036 | 28-Jan-2018 |
Barry Smith <bsmith@mcs.anl.gov> |
Merge branch 'maint' |