History log of /petsc/src/mat/impls/aij/mpi/mpicusparse/makefile (Results 51 – 75 of 80)
Revision Date Author Comments
# 898446f9 11-Mar-2015 Shri Abhyankar <abhyshr@mcs.anl.gov>

Merge branch 'master' into shri/ts-is-for-differential-variables

Conflicts:
include/petsc-private/tsimpl.h
src/ts/impls/eimex/eimex.c
src/ts/interface/ts.c


# f2dee214 06-Mar-2015 Barry Smith <bsmith@mcs.anl.gov>

Merge branch 'master' into jed/chem
Note merging a branch with good stuff in it for a year and a half is insane!

Conflicts:
src/ts/examples/tutorials/makefile
src/ts/interface/ts.c


# 4e7385fa 16-Feb-2015 Stefano Zampini <stefano.zampini@gmail.com>

Merge branch 'master' into stefano_zampini/pcbddc-adaptive-selection

Conflicts:
src/ksp/pc/impls/bddc/bddcprivate.c


# a197972a 11-Feb-2015 Barry Smith <bsmith@mcs.anl.gov>

Merge branch 'master' into barry/saws-options-merge-master

Totally confusing manual merge, could be wrong

Conflicts:
include/petscoptions.h
src/dm/impls/da/dacreate.c
src/sys/objects/aoptions.c

Merge branch 'master' into barry/saws-options-merge-master

Totally confusing manual merge, could be wrong

Conflicts:
include/petscoptions.h
src/dm/impls/da/dacreate.c
src/sys/objects/aoptions.c
src/sys/threadcomm/interface/threadcomm.c

show more ...


# 50705773 10-Feb-2015 Barry Smith <bsmith@mcs.anl.gov>

Merge branch 'master' into pr252/dmay/PetscViewerBinarySkipHeaderVecViewLoad
Manual merge of makefile

Conflicts:
src/sys/classes/viewer/examples/tutorials/makefile


# 3fe9ae9b 15-Jan-2015 Mark Adams <mfadams@lbl.gov>

Merge branch 'master' of bitbucket.org:petsc/petsc into mark/gamg-crs-ksp2


# fe998a80 13-Jan-2015 Barry Smith <bsmith@mcs.anl.gov>

manual merge of conflict


# d3c8c067 01-Jan-2015 Barry Smith <bsmith@mcs.anl.gov>

Three directory namespace changes for better --prefix install support

1) include/finclude becomes include/petsc-finclude
2) include/mpiuni becomes include/petsc-mpiuni
3) conf (both $PETSC_DIR/conf

Three directory namespace changes for better --prefix install support

1) include/finclude becomes include/petsc-finclude
2) include/mpiuni becomes include/petsc-mpiuni
3) conf (both $PETSC_DIR/conf and $PETSC_DIR/$PETSC_ARCH/conf become) lib/petsc-conf

show more ...


# 9c925a2c 18-Mar-2014 Shri <abhyshr@mcs.anl.gov>

Merge branch 'master' into shri/ts-events

Conflicts:
src/ts/interface/ts.c


# edbbd480 10-Dec-2013 Barry Smith <bsmith@mcs.anl.gov>

Merge branch 'master' into barry/xcode


# 256ff83f 11-Sep-2013 Barry Smith <bsmith@mcs.anl.gov>

Merge branch 'master' into barry/wirth-fusion-materials

Conflicts:
src/ts/examples/tutorials/advection-diffusion-reaction/ex10.c


# cc85fe4d 04-Sep-2013 Barry Smith <bsmith@mcs.anl.gov>

Merge branch 'barry/dmvecmattypes' into barry/saws

Needed to work with version of PETSc that did not have constant calls to VecSetFromOptions() etc

Conflicts:
src/ksp/ksp/interface/ams/kspams.c
s

Merge branch 'barry/dmvecmattypes' into barry/saws

Needed to work with version of PETSc that did not have constant calls to VecSetFromOptions() etc

Conflicts:
src/ksp/ksp/interface/ams/kspams.c
src/snes/impls/composite/snescomposite.c
src/snes/impls/gs/snesgs.c
src/snes/impls/nasm/nasm.c
src/snes/impls/ngmres/snesngmres.c

show more ...


# c0c93d0e 28-Aug-2013 Matthew G. Knepley <knepley@gmail.com>

Merge branch 'master' into knepley/feature-dmda-section

* master: (287 commits)
Mat ex170: Comments
VTK: Small fix to error message (.vts to .vtu)
VTK: Small fix to error message
Fixed bib e

Merge branch 'master' into knepley/feature-dmda-section

* master: (287 commits)
Mat ex170: Comments
VTK: Small fix to error message (.vts to .vtu)
VTK: Small fix to error message
Fixed bib entries
Bib: Updates
AO: fix erroneous processing of -ao_view and factor into AOViewFromOptions
doc: fix named argument in {Vec,Mat,DM}ViewFromOptions
Sys: add PetscDataTypeFromString() and test code
Mat: Should say that it has a nullspace in MatView()
parms: update tarball with fix for namespace conflict with metis
fix citation 'Golub_Varga_1961'
parmetis: update tarball to parmetis-4.0.2-p5 which fixes an install issue with cygwin
Sys Logging: revert parent traversal
fixed hdf5.py so that if self.libraries.compression is None the code still runs correctly
DMDA: fix bad cast of DM_DA to PetscObject
MatClique: follow DistMultiVec API changes
MatClique: remove unused variables
config cmakeboot: add C++ flags any time compiler is available
config OpenMP: check for C++ flag any time the compiler is available
replaced all left-over uses of a single PetscMalloc() to allocated multiple arrays: replaced with PetscMallocN() The only ones left are when the second array is set into the first array and one ugly usage in the MUMPS interface that cannot be easily fixed
...

show more ...


# b0418fcf 25-Jul-2013 Stefano Zampini <stefano.zampini@gmail.com>

Merge remote-tracking branch 'origin/master' into stefano_zampini/pcbddc-improvelocalsolvers


# 8533652c 25-Jul-2013 Stefano Zampini <stefano.zampini@gmail.com>

Merge remote-tracking branch 'origin/master' into stefano_zampini/pcbddc-mirrorsfix


# 6daa6ed0 25-Jul-2013 Stefano Zampini <stefano.zampini@gmail.com>

Merge remote-tracking branch 'origin/master' into stefano_zampini/pcbddc-constraintssetupimproved


# 72cfe0ad 23-Jul-2013 Karl Rupp <rupp@iue.tuwien.ac.at>

Merge branch 'paulmullowney/txpetscgpu-package-removal'


# 1019ffe6 20-Jul-2013 Matthew G. Knepley <knepley@gmail.com>

CUDA: CUSPARSE still includes CUSP headers, so change the guards


# b06137fd 27-Jun-2013 Paul Mullowney <paulm@txcorp.com>

Removing TXPETSCGPU from veccusp and mpiaijcusparse

In this next step of removing TXPETSCGPU, the host-device and
device-host messaging code has been significantly simplified. In
particular, all met

Removing TXPETSCGPU from veccusp and mpiaijcusparse

In this next step of removing TXPETSCGPU, the host-device and
device-host messaging code has been significantly simplified. In
particular, all methods VecCUSPCopyToGPU/FromGPU now use
a cudaMemcpyAsync with a stream (and a stream synchronize()).
This never hurts you. Moreover, it can help you in the case
of the multi-GPU SpMV as this data transfer will overlap
with the MatMult kernel. The more signficant change comes in
VecCUSPCopyToGPUSome and VecCUSPCopyFromGPUSome. In this code,
the data transfer now moves the smallest contiguous set of
vector data containing ALL the indices in a single asynchronous data
transfer. Then, the stream containing the data transfer is
synchronized (not the entire device). While this can be wasteful
in terms of messaging too much data, it has shown the best
scalability performance across a wide range of matrices. Lastly
the simplicity of the code is a significant advantage over
the old way of doing the data transfer. Some old cold
in these methods is "if 0"-ed out for reference and will be
cleaned up later. One final optimization in the vector code
involves registering the host buffer as page locked--which
is done in VecCUSPAllocateCheck. Then, the buffer must be
unregistered at VecDestroy_SeqCUSP. This shows a nice
speedup in the data transfer for a parallel MatMult.

Also in this commit, I am removing the TXPETSCGPU dependence from
the mpiaijcusparse class--it now depends only on CUDA. In order
for the same stream to be used in the MatMult and MatMultAdd
(necessary for an optimal Multi-GPU SpMV), the stream is built
in the mpiaijcusparse and then passed in the seqaijcusparse data
structure via a new method (MatCUSPARSESetStream). A similar method
is added for the CUSPARSE library handle (context) as I think the
stream needs to be attached to a particular context to work properly.
When running in parallel, multiple GPUs, the references to the handle
in the seqaijcusparse are cleared from the mpiaijcusparse classes with
the method MatCUSPARSEClearHandle. Then, the mpiaijcusparse class
deletes the handle.

One other non-trivial change was made to the seqaijcusparse. The alpha
and beta parameters to the SpMV are now device data which is owned by
the Mat_SEQAIJCUSPARSEMultStruct structure. This enables slightly better
multi-GPU performance as this data does not need to be copied to the
GPU at each kernel launch.

Multi-GPU SpMV now works without TXPETSCGPU and the performance is recovered
as tested on up to 4 GPUs. Code is valgrind clean and cuda-memcheck clean.

Results of tests have been modified to have 1 less digit of precision. This
yields consistent results across different GPUs. Lastly, the parallel test
is set to run on a different matrix (shallow_water1) so that the iteration
actually converges.

show more ...


# e33c197d 11-Jun-2013 Richard Mills <rtm@eecs.utk.edu>

Merged petsc/petsc into rmills/petsc master.


# 20cbb662 11-Apr-2013 Jed Brown <jed@59A2.org>

Merge branch 'jed/fix-locdir'

* jed/fix-locdir:
LOCDIR: fix all outdated paths and normalize trailing slash


# 3e1910f1 10-Apr-2013 Jed Brown <jed@59A2.org>

LOCDIR: fix all outdated paths and normalize trailing slash

Update performed by:

git ls-files '*/makefile' | xargs perl -pi -MFile::Basename \
-e 's!(LOCDIR[[:space:]]+=) *(.+)$!$1." ".dirnam

LOCDIR: fix all outdated paths and normalize trailing slash

Update performed by:

git ls-files '*/makefile' | xargs perl -pi -MFile::Basename \
-e 's!(LOCDIR[[:space:]]+=) *(.+)$!$1." ".dirname($ARGV)."/"!e'

Fortran auto must be fixed using 'make allfortranstubs' and the result
can be checked by 'bin/maint/checkloc.py'.

show more ...


# 4042b796 17-Mar-2013 Jed Brown <jed@59A2.org>

Merge branch 'master' into jed/ts-eimex

Sync to include Git conversion, PETSC_EXTERN, and minor API changes.

Conflicts:
src/ts/interface/tsregall.c


# b769355b 31-Dec-2012 Jed Brown <jed@59A2.org>

merge PetscSF implementation

Hg-commit: 98dccc99445b69cb7a66312149eb16fec359e294


# 5a4671ae 30-Dec-2012 Barry Smith <bsmith@mcs.anl.gov>

removed trailing spaces from all files

Hg-commit: 1cfa55059afe46c5145e0e830cd797b97216abfb


1234