Lines Matching refs:backend
102 - OCCA backend updated to latest OCCA release; DPC++ and OMP OCCA modes enabled.
103 …e OCCA parser, typedefs are required to use pointers to arrays in QFunctions with the OCCA backend.
133 …red` and `/gpu/cuda/gen` as well as `/gpu/hip/shared` and `/gpu/hip/gen` backend to improve mainta…
166 - Warning added when compiling OCCA backend to alert users that this backend is experimental.
167 - `ceed-backend.h`, `ceed-hash.h`, and `ceed-khash.h` removed. Users should use `ceed/backend.h`, `…
187 - Added support for non-tensor $H(\text{div})$ elements, to include CPU backend implementations and…
195 - Create backend constant `CEED_FIELD_MAX` to reduce magic numbers in codebase.
223 …ckends have been removed. These backends were intially added to test the backend inheritance mecha…
261 - Replace various uses of {code}`Ceed*Get*Status` with {code}`Ceed*Is*` in the backend API to match…
273 - New HIP backend: `/gpu/hip/ref`.
278 - OCCA backend rebuilt to facilitate future performance enhancements.
287 - The `/gpu/cuda/reg` backend has been removed, with its core features moved into `/gpu/cuda/ref` a…
314 - New backend: `/cpu/self/memcheck/serial`.
317 - Fix some missing edge cases in CUDA backend.
321 - MAGMA backend performance optimization and non-tensor bases.
375 the family of backends, of which, the new `cuda-gen` backend achieves state-of-the-art
377 can define Q-Functions in a single source code independently of the targeted backend
388 `/cpu/self/ref/memcheck` backend (which relies upon the
412 | `/cpu/self/ref/memcheck` | Memcheck backend, undefined value checks |
461 four new CPU backends, two new GPU backends, CPU backend optimizations, initial
471 The `/gpu/cuda/ref` backend is a reference CUDA backend, providing reasonable
472 performance for most problem configurations. The `/gpu/cuda/reg` backend uses a simple
475 backend unroll loops and map memory address to registers. The `/gpu/cuda/reg` backend
533 non-tensor bases, backend optimization, and improved Fortran interface. This release
542 used in an optimized CPU backend. Although this is typically not visible to the user,
544 This CPU backend also implements an algebraic factorization of tensor product gradients
587 A MAGMA backend (which relies upon the
591 As any other backend, the MAGMA backend provides extended basic data structures for
636 This release also contained major improvements in the OCCA backend (including a new
637 `/ocl/occa` backend) and new examples. The standalone libCEED example was modified to
680 implementation for them serves as the basis for specialized backend implementations.