History log of /libCEED/rust/libceed-sys/c-src/backends/cuda-gen/ceed-cuda-gen-operator.c (Results 76 – 100 of 134)
Revision Date Author Comments
# 6d69246a 21-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

cuda - separate compile functionality into new header


# aa616312 10-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #855 from CEED/jeremy/drop-oper-e-data

Remove magic number 16


# bf4cb664 08-Dec-2021 Jeremy L Thompson <jeremy@jeremylt.org>

op - turn magic number 16 into constant CEED_FIELD_MAX


# 0b548709 14-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #806 from CEED/jeremy/get-fields

Promote Field Getters to Public API


# 7e7773b5 09-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

interface - refactor *GetFields to include number of fields


# c53bf7d0 13-Sep-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #810 from CEED/cuda-gen-launch

Adjust cuda-gen launch to limit z dimension of thread block


# 13516544 13-Sep-2021 nbeams <246972+nbeams@users.noreply.github.com>

Check z dimension thread block limits before launching


# 26513686 08-Sep-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #804 from CEED/jed/cuda-register-block-size

make CUDA block sizes fit according to number of used registers


# 39532ceb 07-Sep-2021 Jed Brown <jed@jedbrown.org>

backends/cuda-gen: use occupancy to calculate launch sizes

Choose sizes that actually fit while being big enough to amortize thread
block overhead and choosing sizes that permit high occupancy.

htt

backends/cuda-gen: use occupancy to calculate launch sizes

Choose sizes that actually fit while being big enough to amortize thread
block overhead and choosing sizes that permit high occupancy.

https://developer.nvidia.com/blog/cuda-pro-tip-occupancy-api-simplifies-launch-configuration/

show more ...


# eaf62fff 02-Aug-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Refactor Preconditioning Ingredients (#795)

* op - split preconditioning into separate file

* pc - move FDM to interface level

* pc - initial move of diagonal assembly to interface

* interf

Refactor Preconditioning Ingredients (#795)

* op - split preconditioning into separate file

* pc - move FDM to interface level

* pc - initial move of diagonal assembly to interface

* interface - only return error code or error

* doc - improve preconditioning helper function documentation

* pc - use qdata layout for interface diagonal assembly

* pc - update for consistency in naming, use of CeedChk, and static

* doc - spelling

* style - minor

* doc - add doc for CeedBuildMassLaplace

* doc - update release notes

Co-authored-by: Jed Brown <jed@jedbrown.org>

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# 81d4e52a 30-Apr-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #750 from CEED/jeremy/identity

qf - allow identity qfunctions to have in and out EVAL_NONE


# 0b454692 22-Apr-2021 jeremylt <thompson.jeremy.luke@gmail.com>

op - impl restriction only identity operators for CPU, error on code gen GPU backends


# 874019bc 31-Mar-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #716 from CEED/jed/install-backend.h

Jed/install backend.h


# ec3da8bc 26-Mar-2021 Jed Brown <jed@jedbrown.org>

Install install backend headers under include/ceed/

This makes it possible to distribute source plugins that provide
additional backends. It's also used in MFEM, perhaps temporarily.

Deprecate ceed

Install install backend headers under include/ceed/

This makes it possible to distribute source plugins that provide
additional backends. It's also used in MFEM, perhaps temporarily.

Deprecate ceed-backend.h, which was not previously installed, but some
users accessed it from an in-place build.

Also install CUDA and HIP headers that allow users to provide CUfunction
and hipFunction_t.

Co-authored-by: Jeremy L. Thompson <jeremy.thompson@colorado.edu>
Requested-by: Andrew T. Barker <barker29@llnl.gov>

show more ...


# e15f9bd0 20-Mar-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Error Handling Improvement [fix #696] (#691)

* Operator - add operator/qfunction field compatibility checks

* QFunction - do not allow adding fields to QFunction in use with an operator

* Exam

Error Handling Improvement [fix #696] (#691)

* Operator - add operator/qfunction field compatibility checks

* QFunction - do not allow adding fields to QFunction in use with an operator

* Examples - add some extra exclusion markers in ceed example

* error - add error enum

* error - update error enum names and numbering

* error - use CEED_ERROR_BACKEND in all backend errors

* error - begin classifying interface errors

* error - update backends to use CEED_ERROR_SUCCESS and CeedChkBackend

* error - use new errors in gallery

* error - add some unsaved modifications

* error - improve documentation

* error - define CEED_ERROR_SUCCESS in GPU JiT; we really should have a common header to pipe defines to the JiT code

* error - more error code editing

* error - fix error string

* operator - fix setting field qpts

* basis - add input/output dimension error checking

* python - move basis utility methods to ceed object, no basis required or used

* python - force exit with negative error code

* make style-py

* rust - initial work to add error handling logic

* rust - add ceed.resource method

* rust - add results for methods that may fail

* rust - also format doctests

* minor - drop unused CeedChk()

* error - rename terminal/nonterminal to major/minor

* rust - set ErrorStore as default errorhandler

* python - revert error handing change for python

* python - use success error code from C bindings

* error - only upgrade error code in backend if positive

show more ...


# 3d576824 29-Jan-2021 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

headers - clearify includes to not rely on transitive includes (#701)

* headers - clearify includes to not rely on transitive includes

* style - add header recommendations from 'include-what-you-

headers - clearify includes to not rely on transitive includes (#701)

* headers - clearify includes to not rely on transitive includes

* style - add header recommendations from 'include-what-you-use'

* style - apply 'include-what-you-use' changes to CUDA backends

* style - 'include-what-you-use' for hip backends

* style - drop ceed.h includes in gallery qf source

* docs - add dev notes for header files

* style - header style and alphabetize

show more ...


# 2a8ae808 17-Sep-2020 nbeams <246972+nbeams@users.noreply.github.com>

Merge branch 'main' into icl/hip-magma


# 18d499f1 17-Sep-2020 Yohann <dudouit1@llnl.gov>

Enable under-integration for cuda-shared and cuda-gen backends (#620)

* Support under integration in cuda-shared.

* Add under-integration to the cuda-gen backend.

* Fix bugs when under-integ i

Enable under-integration for cuda-shared and cuda-gen backends (#620)

* Support under integration in cuda-shared.

* Add under-integration to the cuda-gen backend.

* Fix bugs when under-integ in cuda-shared.

* Factor some code.

* Factor some code in cuda-gen.

* Guard more carefully.

* Introduce T1d.

* Fix a bug in readQuads3d

* Fix bugs in 3D.

* Fix a typo

* Safety init.

* Try something with ContractZ3d.

* Guard the add

* revert add.

* Add more thread guards

* Same as previous

* Fix a bug in add.

* style.

* Check that the bases are tensor in cuda-gen.

* move isTensor

* Add T1d to cuda-gen and guard contractions.

* Fix typos.

* add guards in 1d.

* Rewrite weight functions.

* typo

* CUDA - fix cuda-gen collocated check

* make style.

Co-authored-by: jeremylt <thompson.jeremy.luke@gmail.com>

show more ...


# 777ff853 14-Aug-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

QFunction Context Data Object (#596)

* Ctx - create context object for QFunction context data

* Context - rename UserContext -> QFunctionContext

* Ctx - add lcov markers

* Ctx - fix leak in

QFunction Context Data Object (#596)

* Ctx - create context object for QFunction context data

* Context - rename UserContext -> QFunctionContext

* Ctx - add lcov markers

* Ctx - fix leak in identity QFunctions

* Hip/Cuda - rename sync functions for vector/context

* Tests - lcov marker update

* QFunction - drop unused function

* Python - fix copy-paste errors

* Ctx - update notes for Fortran usage

* Fortran - drop unneeded cast

Co-authored-by: Jed Brown <jed@jedbrown.org>

* Interface - use void* for SetData interfaces

* Make - use call quiet for NVCC

* Interface - use void* for GetData interfaces

* Make - add quiet call option for examples

* Makefile - create common makefile to reduce duplication/complexity in example makefiles

Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


# ddc5c1c2 27-Jul-2020 Jed Brown <jed@jedbrown.org>

Merge pull request #594 from CEED/jed/hip-lcov-excl

backends/hip: exclude testing of unimplemented code


# e9f4dca0 27-Jul-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

Cuda - add LCOV markers


# 77ad9f29 15-Jul-2020 nbeams <246972+nbeams@users.noreply.github.com>

Merge branch 'main' into natalie/hip-ref-v2


# d2d7e99c 26-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #564 from CEED/jeremy/cuda-diag

CUDA - LinearAssemble*Diagonal


# 868092e3 19-Jun-2020 Jeremy L Thompson <thompson.jeremy.luke@gmail.com>

CUDA - inital version of LinearAssemble*


# 4d36c801 24-Jun-2020 Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com>

Merge pull request #568 from CEED/jeremy/cuda-destroy

Small CUDA Tidying


123456