| #
6d69246a
|
| 21-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
cuda - separate compile functionality into new header
|
| #
aa616312
|
| 10-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #855 from CEED/jeremy/drop-oper-e-data
Remove magic number 16
|
| #
bf4cb664
|
| 08-Dec-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
op - turn magic number 16 into constant CEED_FIELD_MAX
|
| #
0b548709
|
| 14-Sep-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
Merge pull request #806 from CEED/jeremy/get-fields
Promote Field Getters to Public API
|
| #
7e7773b5
|
| 09-Sep-2021 |
Jeremy L Thompson <jeremy@jeremylt.org> |
interface - refactor *GetFields to include number of fields
|
| #
c53bf7d0
|
| 13-Sep-2021 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #810 from CEED/cuda-gen-launch
Adjust cuda-gen launch to limit z dimension of thread block
|
| #
13516544
|
| 13-Sep-2021 |
nbeams <246972+nbeams@users.noreply.github.com> |
Check z dimension thread block limits before launching
|
| #
26513686
|
| 08-Sep-2021 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #804 from CEED/jed/cuda-register-block-size
make CUDA block sizes fit according to number of used registers
|
| #
39532ceb
|
| 07-Sep-2021 |
Jed Brown <jed@jedbrown.org> |
backends/cuda-gen: use occupancy to calculate launch sizes
Choose sizes that actually fit while being big enough to amortize thread block overhead and choosing sizes that permit high occupancy.
htt
backends/cuda-gen: use occupancy to calculate launch sizes
Choose sizes that actually fit while being big enough to amortize thread block overhead and choosing sizes that permit high occupancy.
https://developer.nvidia.com/blog/cuda-pro-tip-occupancy-api-simplifies-launch-configuration/
show more ...
|
| #
eaf62fff
|
| 02-Aug-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Refactor Preconditioning Ingredients (#795)
* op - split preconditioning into separate file
* pc - move FDM to interface level
* pc - initial move of diagonal assembly to interface
* interf
Refactor Preconditioning Ingredients (#795)
* op - split preconditioning into separate file
* pc - move FDM to interface level
* pc - initial move of diagonal assembly to interface
* interface - only return error code or error
* doc - improve preconditioning helper function documentation
* pc - use qdata layout for interface diagonal assembly
* pc - update for consistency in naming, use of CeedChk, and static
* doc - spelling
* style - minor
* doc - add doc for CeedBuildMassLaplace
* doc - update release notes
Co-authored-by: Jed Brown <jed@jedbrown.org>
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| #
81d4e52a
|
| 30-Apr-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #750 from CEED/jeremy/identity
qf - allow identity qfunctions to have in and out EVAL_NONE
|
| #
0b454692
|
| 22-Apr-2021 |
jeremylt <thompson.jeremy.luke@gmail.com> |
op - impl restriction only identity operators for CPU, error on code gen GPU backends
|
| #
874019bc
|
| 31-Mar-2021 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #716 from CEED/jed/install-backend.h
Jed/install backend.h
|
| #
ec3da8bc
|
| 26-Mar-2021 |
Jed Brown <jed@jedbrown.org> |
Install install backend headers under include/ceed/
This makes it possible to distribute source plugins that provide additional backends. It's also used in MFEM, perhaps temporarily.
Deprecate ceed
Install install backend headers under include/ceed/
This makes it possible to distribute source plugins that provide additional backends. It's also used in MFEM, perhaps temporarily.
Deprecate ceed-backend.h, which was not previously installed, but some users accessed it from an in-place build.
Also install CUDA and HIP headers that allow users to provide CUfunction and hipFunction_t.
Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu> Requested-by: Andrew T. Barker <barker29@llnl.gov>
show more ...
|
| #
e15f9bd0
|
| 20-Mar-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Error Handling Improvement [fix #696] (#691)
* Operator - add operator/qfunction field compatibility checks
* QFunction - do not allow adding fields to QFunction in use with an operator
* Exam
Error Handling Improvement [fix #696] (#691)
* Operator - add operator/qfunction field compatibility checks
* QFunction - do not allow adding fields to QFunction in use with an operator
* Examples - add some extra exclusion markers in ceed example
* error - add error enum
* error - update error enum names and numbering
* error - use CEED_ERROR_BACKEND in all backend errors
* error - begin classifying interface errors
* error - update backends to use CEED_ERROR_SUCCESS and CeedChkBackend
* error - use new errors in gallery
* error - add some unsaved modifications
* error - improve documentation
* error - define CEED_ERROR_SUCCESS in GPU JiT; we really should have a common header to pipe defines to the JiT code
* error - more error code editing
* error - fix error string
* operator - fix setting field qpts
* basis - add input/output dimension error checking
* python - move basis utility methods to ceed object, no basis required or used
* python - force exit with negative error code
* make style-py
* rust - initial work to add error handling logic
* rust - add ceed.resource method
* rust - add results for methods that may fail
* rust - also format doctests
* minor - drop unused CeedChk()
* error - rename terminal/nonterminal to major/minor
* rust - set ErrorStore as default errorhandler
* python - revert error handing change for python
* python - use success error code from C bindings
* error - only upgrade error code in backend if positive
show more ...
|
| #
3d576824
|
| 29-Jan-2021 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-
headers - clearify includes to not rely on transitive includes (#701)
* headers - clearify includes to not rely on transitive includes
* style - add header recommendations from 'include-what-you-use'
* style - apply 'include-what-you-use' changes to CUDA backends
* style - 'include-what-you-use' for hip backends
* style - drop ceed.h includes in gallery qf source
* docs - add dev notes for header files
* style - header style and alphabetize
show more ...
|
| #
2a8ae808
|
| 17-Sep-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
Merge branch 'main' into icl/hip-magma
|
| #
18d499f1
|
| 17-Sep-2020 |
Yohann <dudouit1@llnl.gov> |
Enable under-integration for cuda-shared and cuda-gen backends (#620)
* Support under integration in cuda-shared.
* Add under-integration to the cuda-gen backend.
* Fix bugs when under-integ i
Enable under-integration for cuda-shared and cuda-gen backends (#620)
* Support under integration in cuda-shared.
* Add under-integration to the cuda-gen backend.
* Fix bugs when under-integ in cuda-shared.
* Factor some code.
* Factor some code in cuda-gen.
* Guard more carefully.
* Introduce T1d.
* Fix a bug in readQuads3d
* Fix bugs in 3D.
* Fix a typo
* Safety init.
* Try something with ContractZ3d.
* Guard the add
* revert add.
* Add more thread guards
* Same as previous
* Fix a bug in add.
* style.
* Check that the bases are tensor in cuda-gen.
* move isTensor
* Add T1d to cuda-gen and guard contractions.
* Fix typos.
* add guards in 1d.
* Rewrite weight functions.
* typo
* CUDA - fix cuda-gen collocated check
* make style.
Co-authored-by: jeremylt <thompson.jeremy.luke@gmail.com>
show more ...
|
| #
777ff853
|
| 14-Aug-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
QFunction Context Data Object (#596)
* Ctx - create context object for QFunction context data
* Context - rename UserContext -> QFunctionContext
* Ctx - add lcov markers
* Ctx - fix leak in
QFunction Context Data Object (#596)
* Ctx - create context object for QFunction context data
* Context - rename UserContext -> QFunctionContext
* Ctx - add lcov markers
* Ctx - fix leak in identity QFunctions
* Hip/Cuda - rename sync functions for vector/context
* Tests - lcov marker update
* QFunction - drop unused function
* Python - fix copy-paste errors
* Ctx - update notes for Fortran usage
* Fortran - drop unneeded cast
Co-authored-by: Jed Brown <jed@jedbrown.org>
* Interface - use void* for SetData interfaces
* Make - use call quiet for NVCC
* Interface - use void* for GetData interfaces
* Make - add quiet call option for examples
* Makefile - create common makefile to reduce duplication/complexity in example makefiles
Co-authored-by: Jed Brown <jed@jedbrown.org>
show more ...
|
| #
ddc5c1c2
|
| 27-Jul-2020 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #594 from CEED/jed/hip-lcov-excl
backends/hip: exclude testing of unimplemented code
|
| #
e9f4dca0
|
| 27-Jul-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
Cuda - add LCOV markers
|
| #
77ad9f29
|
| 15-Jul-2020 |
nbeams <246972+nbeams@users.noreply.github.com> |
Merge branch 'main' into natalie/hip-ref-v2
|
| #
d2d7e99c
|
| 26-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #564 from CEED/jeremy/cuda-diag
CUDA - LinearAssemble*Diagonal
|
| #
868092e3
|
| 19-Jun-2020 |
Jeremy L Thompson <thompson.jeremy.luke@gmail.com> |
CUDA - inital version of LinearAssemble*
|
| #
4d36c801
|
| 24-Jun-2020 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #568 from CEED/jeremy/cuda-destroy
Small CUDA Tidying
|