History log of /petsc/src/vec/is/sf/impls/basic/sfpack.c (Results 101 – 125 of 126)
Revision Date Author Comments
# b0d1f38a 22-Mar-2020 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/jczhang/feature-sf-pack-direct'

Directly pass root/leafdata to MPI in SF when possible

See merge request petsc/petsc!2506


# c2a741ee 05-Feb-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Test MPI gpu awareness and use default -use_gpu_aware_mpi 1


# fcc7397d 21-Jan-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Use a 3d submatrix to describle indices in packing


# f01131f0 08-Jan-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Change PetscMemcpyWithMemType to PetscSFLinkMemcpy and make it asynchronous

With that, all cuda calls in SF are asynchronous and work on link->stream. The reason is we want to avoid sudden 'join poi

Change PetscMemcpyWithMemType to PetscSFLinkMemcpy and make it asynchronous

With that, all cuda calls in SF are asynchronous and work on link->stream. The reason is we want to avoid sudden 'join points' caused by synchronous cuda calls.

show more ...


# b85e67b7 08-Jan-2020 Junchao Zhang <jczhang@mcs.anl.gov>

Add option -sf_use_stream_aware_mpi


# cd620004 05-Dec-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Refactor SF packing

1) Separate out local communication from remote communication
2) Directly pass root/leafdata to MPI when suitable


# a9e32f46 12-Dec-2019 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/petscpartitioner-target-weights' into 'master'

Fix PetscPartitioner and add support for target partition weights

See merge request petsc/petsc!2333


# fb61b9e4 27-Nov-2019 Stefano Zampini <stefano.zampini@gmail.com>

PetscSortedIntRemoveDups: use PetscCheckSorted


# 203a8786 29-Nov-2019 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jczhang/feature-sf-do-pack-on-gpu' into 'master'

Add support to do pack/unpack on GPU and do MPI on CPU

See merge request petsc/petsc!2205


# 51ccb202 05-Nov-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Add an option -sf_use_pinned_buffer to use non-pagable host memory for send/recv buffer when passing GPU data


# 637e6665 16-Oct-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Rename rkey, lkey to rootdata, leafdata


# 893c5908 30-Oct-2019 Satish Balay <balay@mcs.anl.gov>

Merge branch 'maint'


# 7033dd2d 30-Oct-2019 Satish Balay <balay@mcs.anl.gov>

Merge branch 'jed/fix-destroy-spelling' into 'maint'

Spelling: Destory -> Destroy

See merge request petsc/petsc!2231


# 64f49bab 28-Oct-2019 Jed Brown <jed@jedbrown.org>

Spelling: Destory -> Destroy

Commit-type: style-fix


# 71f2a993 16-Oct-2019 Satish Balay <balay@mcs.anl.gov>

Merge branch 'stefanozampini/fix-sfcompose' into 'master'

Allow sparse leaves in SFCompose operations

See merge request petsc/petsc!2164


# 5ad15460 11-Oct-2019 Junchao Zhang <jczhang@mcs.anl.gov>

SF: treat PETSc builtin datatypes the same way as MPI builtin datatypes

So that we won't duplicate PETSc builtin datatypes like MPIU_2INT. Otherwise,
we can not simply use "if (type != MPIU_2INT) SE

SF: treat PETSc builtin datatypes the same way as MPI builtin datatypes

So that we won't duplicate PETSc builtin datatypes like MPIU_2INT. Otherwise,
we can not simply use "if (type != MPIU_2INT) SETERRQ()" in code.

Add support to unwrap a type created by MPI_Type_contiguous(1,..)
Let dumb types use their own type

show more ...


# c1acdb04 28-Sep-2019 Satish Balay <balay@mcs.anl.gov>

Merge remote-tracking branch 'origin/jczhang/feature-sf-on-gpu'

Add GPU-aware VecScatter/PetscSF

See merge request petsc/petsc!1995


# eb02082b 25-Sep-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Added SF GPU support


# b23bfdef 13-Aug-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Update pack/unpack routines to do packing/unpacking for all neighbors in at most two routines

One is used to pack data in self to self communication; The second is used for remote communication.
So

Update pack/unpack routines to do packing/unpacking for all neighbors in at most two routines

One is used to pack data in self to self communication; The second is used for remote communication.
So that on GPU, we can use at most two kernels to do packing/packing for all neighbors instead of multiple kernels

show more ...


# 1b085a39 29-Jul-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Code style change


# 05393080 25-Jul-2019 Karl Rupp <me@karlrupp.net>

Merge branch 'jczhang/sf-more-opts' [PR #1567]

* jczhang/sf-more-opts:
Add more optimizations in SF and use it as the default for VecScatter.


# 18fb5014 24-Jul-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Code style change


# 33c49614 24-Jul-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Ignore potential overlapping when rootdata=leafdata=NULL

This case happens with
ex62_fetidp_3d_tet – snes.tutorials
ex62_fetidp_3d_hex – snes.tutorials
ex59_bddc_fetidp_ml_eqlimit_2 – ksp.ksp.tutori

Ignore potential overlapping when rootdata=leafdata=NULL

This case happens with
ex62_fetidp_3d_tet – snes.tutorials
ex62_fetidp_3d_hex – snes.tutorials
ex59_bddc_fetidp_ml_eqlimit_2 – ksp.ksp.tutorials
ex59_bddc_fetidp_ml_eqlimit_1 – ksp.ksp.tutorials

show more ...


# 9d1c8add 23-Jul-2019 Junchao Zhang <jczhang@mcs.anl.gov>

SF: Partially fix a bug when overlapped SF communications have same rootdata or leafdata on some ranks

Now we use two keys (rootdata, leafdata) to identify a pending communication. But that is still

SF: Partially fix a bug when overlapped SF communications have same rootdata or leafdata on some ranks

Now we use two keys (rootdata, leafdata) to identify a pending communication. But that is still not
enough for cases where communications have same rootdata and leafdata on some ranks. Currently We
error out on these cases. See src/vec/is/sf/examples/tutorials/ex2.c for various cases we can handle
and we can not handle.

show more ...


# da2e4c71 23-Jun-2019 Junchao Zhang <jczhang@mcs.anl.gov>

Replace PetscMemcpy with PetscArraycpy


123456