| 2db7ab32 | 15-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Doc - fix L-vector indexing in dev docs (#557)
* Doc - fix L-vector indexing in dev docs
* Doc - fix same issue in ElemRestriction doc |
| 95e93d34 | 15-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Improve Developer Documentation (#556)
* doc - document l/e/q vector layouts and object inheritance
* doc - clean up some referencing
* doc - fix some reference errors
* doc - fix enumerati
Improve Developer Documentation (#556)
* doc - document l/e/q vector layouts and object inheritance
* doc - clean up some referencing
* doc - fix some reference errors
* doc - fix enumeration in inheritance list
* doc - include note for dissimilar E/Q layouts
show more ...
|
| c04a41a7 | 15-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Composite Operator support for AssembleLinearDiag/PBDiag (#552)
* ref - add composite operator support for building diagonal/pb diagonal
* ref - add error for non-composite mult-field operator di
Composite Operator support for AssembleLinearDiag/PBDiag (#552)
* ref - add composite operator support for building diagonal/pb diagonal
* ref - add error for non-composite mult-field operator diagonal/pb diagonal assembly
* tap - add t538 exclusion because OCCA does not support galleries
* tests - adjust test cases for ceed examples for test coverage
* Op - fix documentation
show more ...
|
| b1d74153 | 12-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
CUDA/MAGMA - add AssembleLinearQFunction (#553)
* CUDA - refactor operator apply for CUDA assemble linear QF impl
* CUDA/MAGMA - add AssembleLinearQFunction
* make style
* CUDA - clean up a
CUDA/MAGMA - add AssembleLinearQFunction (#553)
* CUDA - refactor operator apply for CUDA assemble linear QF impl
* CUDA/MAGMA - add AssembleLinearQFunction
* make style
* CUDA - clean up assembleLinearQF after q/e layout refactor
* CUDA - fallback operator for cuda/gen to cuda/ref
* CUDA - use delegation for cuda/gen prefered memtype
show more ...
|
| 05fc0d27 | 12-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #555 from CEED/jeremy/ci-badge
README - update Travis badge |
| 70e59d80 | 12-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
README - update Travis badge |
| 49fd234c | 12-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Convert CUDA ref/reg/shared E-Layout (#554)
* tests - update tests for multiple e-layouts
* CUDA - convert ref and reg restrictions to Q-layout
* CUDA - ref/reg/shared use gen/magma E-Layout f
Convert CUDA ref/reg/shared E-Layout (#554)
* tests - update tests for multiple e-layouts
* CUDA - convert ref and reg restrictions to Q-layout
* CUDA - ref/reg/shared use gen/magma E-Layout for multi elememnt basis apply and operator apply
* CUDA/MAGMA - drop eandqdiffer and separate MAGMA operator code
* CUDA - update operator comment
* reg - clarify read/write dofs/quads
* CUDA - drop dead code
show more ...
|
| 5f38a0bf | 11-Jun-2020 |
Valeria Barra <39932030+valeriabarra@users.noreply.github.com> |
Merge pull request #551 from CEED/valeria/benchmarking-petsc-bps
PETSc Bps: allow looping over degrees, problems and backends |
| 5f284d84 | 09-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
examples/petsc/bps: reuse the same DMPlex for all desired BPs and CEED resources
DM partitioning and redistribution for low-order discretizations are relatively much more expensive than a solve. Th
examples/petsc/bps: reuse the same DMPlex for all desired BPs and CEED resources
DM partitioning and redistribution for low-order discretizations are relatively much more expensive than a solve. This way we get more reuse from the DMPlex.
show more ...
|
| 981e58ae | 08-Jun-2020 |
valeriabarra <valeriabarra21@gmail.com> |
postprocess_plot.py: style |
| da9108ad | 08-Jun-2020 |
valeriabarra <valeriabarra21@gmail.com> |
PETSc bps & bpsraw: Remove benchmark_mode (not needed anymore) and always disregard first run performance
This is preferred for benchmarking purposes, so that we are preloading the cache; |
| 8d0bb2bb | 08-Jun-2020 |
valeriabarra <valeriabarra21@gmail.com> |
PETSc bpsraw: always print DoFs/Sec in CG info (except when in test mode) |
| 4c583f1f | 08-Jun-2020 |
valeriabarra <valeriabarra21@gmail.com> |
PETSc bps: always print DoFs/Sec in CG info (except when in test mode) |
| 6001b4f4 | 08-Jun-2020 |
valeriabarra <valeriabarra21@gmail.com> |
petsc/bps/README: Use code style to avoid bold type and specify FEM nodes just to avoid confusion with compute nodes |
| 565a3730 | 08-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
examples/petsc/bps: allow looping over problems and CEED resources. |
| c36f77d8 | 08-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
examples/petsc/bps: add -ksp_max_it_clip and apply DRY |
| aa2aa0ae | 06-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
benchmarks: simplify and improve legacy parsing |
| dec49e00 | 06-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
benchmarks: add to autopep8 |
| dd839fb7 | 06-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
benchmarks: parse logs from self-contained examples/petsc/bps.c |
| d13e9b48 | 28-May-2020 |
Jed Brown <jed@jedbrown.org> |
benchmarks: script maintenance
* rename scripts to use _ instead of - (so they can be imported) * order -> degree (remove ambiguity; the column means polynomial degree) * postprocess_base.py: add ks
benchmarks: script maintenance
* rename scripts to use _ instead of - (so they can be imported) * order -> degree (remove ambiguity; the column means polynomial degree) * postprocess_base.py: add ksp_its and time_per_it
show more ...
|
| 53a0f73b | 06-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
examples/petsc/bps: support degree lists and local_nodes ranges
* print hostname, total number of ranks, and number of ranks per node * Allow number of ranks per node to be specified (mostly in case
examples/petsc/bps: support degree lists and local_nodes ranges
* print hostname, total number of ranks, and number of ranks per node * Allow number of ranks per node to be specified (mostly in case one has an old MPI that doesn't support MPI_Comm_split_type) and hostname to be specified at the command line.
show more ...
|
| 1794bc00 | 06-Jun-2020 |
Jed Brown <jed@jedbrown.org> |
make style |
| d965c7a7 | 06-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
CPU Impl of AssemblePointBlockDiagonal (#503)
* Operator - add AssemblePointBlockDiagonal for CPU backends, with test
* CUDA - add point block diagonal not supported message
* make style
*
CPU Impl of AssemblePointBlockDiagonal (#503)
* Operator - add AssemblePointBlockDiagonal for CPU backends, with test
* CUDA - add point block diagonal not supported message
* make style
* Operator - improve point block description
* OCCA - explicitly remove OCCA fallback to CPU assembly functions, will update after new OCCA backend
* Op - remove gap removal in point block diagonal
* Op - update diagonal assembly documentation
* Update backends/ref/ceed-ref-operator.c
Co-authored-by: Jed Brown <jed@jedbrown.org>
* style - fix extra space in * with nopad
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| 1e284482 | 05-Jun-2020 |
valeriabarra <valeriabarra21@gmail.com> |
PETSc BPs: Refactor benchmark mode to run in loop |
| 4eb74d6c | 29-May-2020 |
Jed Brown <jed@jedbrown.org> |
travis-ci: MFEM_USE_SIMD=NO on ppc64le to fix build
Suggested-by: Veselin Dobrev <dobrev@llnl.gov> |