History log of /libCEED/ (Results 2776 – 2800 of 5646)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
e03682af13-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

rust - add is_some() and is_none() for vector/elemrestriction/basis/qfunction opts

c68be7a213-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

rust - remove unwrap() in documentation in favor of ?

8ef3655d13-Sep-2021 David Medina <dmed256@gmail.com>

[#808][OCCA] Avoid verifying objects in destructors (#809)

[#808][OCCA] Avoid verifying objects in destructors

* Update backends/occa/ceed-occa-qfunction.cpp

Co-authored-by: Jeremy L Thompson

[#808][OCCA] Avoid verifying objects in destructors (#809)

[#808][OCCA] Avoid verifying objects in destructors

* Update backends/occa/ceed-occa-qfunction.cpp

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

show more ...

c33c11c110-Sep-2021 Will Pazner <will.e.p@gmail.com>

[Julia] Bump minimum_libceed_version. Change version comparison logic.

Add ceedversion_ge, which performs the same comparison as the macro
CEED_VERSION_GE. Non-release builds compare as infinity.

443fcf8a10-Sep-2021 Will Pazner <will.e.p@gmail.com>

[Julia] update bindings

e9b533fb09-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

doc - add advanced function classification

f479eb2309-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

doc - add single precision release note

f04ea55209-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

qf/op - make immutability conditions explicit

28567f8f09-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

interface - add CeedOperatorGetFieldName

43bbe13809-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

interface - promote field getters to public API

7e7773b509-Sep-2021 Jeremy L Thompson <jeremy@jeremylt.org>

interface - refactor *GetFields to include number of fields

c53bf7d013-Sep-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #810 from CEED/cuda-gen-launch

Adjust cuda-gen launch to limit z dimension of thread block

1351654413-Sep-2021 nbeams <246972+nbeams@users.noreply.github.com>

Check z dimension thread block limits before launching

2651368608-Sep-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #804 from CEED/jed/cuda-register-block-size

make CUDA block sizes fit according to number of used registers

39532ceb07-Sep-2021 Jed Brown <jed@jedbrown.org>

backends/cuda-gen: use occupancy to calculate launch sizes

Choose sizes that actually fit while being big enough to amortize thread
block overhead and choosing sizes that permit high occupancy.

htt

backends/cuda-gen: use occupancy to calculate launch sizes

Choose sizes that actually fit while being big enough to amortize thread
block overhead and choosing sizes that permit high occupancy.

https://developer.nvidia.com/blog/cuda-pro-tip-occupancy-api-simplifies-launch-configuration/

show more ...

44abf3e807-Sep-2021 Jed Brown <jed@jedbrown.org>

backends/cuda: record cudaDeviceProp struct instead of just max block size

4853cbf005-Sep-2021 Jed Brown <jed@jedbrown.org>

backends/cuda: choose block size based on number of registers used by kernel

Complicate QFunctions, such as those in solid mechanics, use too many
registers to launch blocks of 1024 threads (hardwar

backends/cuda: choose block size based on number of registers used by kernel

Complicate QFunctions, such as those in solid mechanics, use too many
registers to launch blocks of 1024 threads (hardware max on
Volta/Ampere). We ask the kernel how large a block it can use and select
that block size. As a refinement, we could consider making the block
sizes smaller if there are fewer blocks than SMs (strong scaling limit).

show more ...

a784c50002-Sep-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #787 from CEED/yohann/cuda-gen/restrict

Use more `__restrict__` in `cuda-gen` backend.

80a9ef0502-Sep-2021 Natalie Beams <246972+nbeams@users.noreply.github.com>

Allow CeedScalar to be single precision (#788)

One can modify `ceed.h` to include `ceed-f32.h` and then use single precision. This is tested for C in CI and has been tested by developers with Rust,

Allow CeedScalar to be single precision (#788)

One can modify `ceed.h` to include `ceed-f32.h` and then use single precision. This is tested for C in CI and has been tested by developers with Rust, Julia, and Python. This interface is evolving and should be considered experimental at this time (thus lack of automated build support).

* Introduce CeedScalarType enum

* WIP changes to allow different definitions of CeedScalar

* Introduce new header files for float and double

* Only use avx tensor contract and MAGMA non-tensor basis if CeedScalar is double

* WIP changes to allow CeedScalar to be float

* WIP start trying to adjust test tolerances for float or double

* fix typos in comments

* install ceed-f32/64 headers

* Fix missing casts for hipMAGMA element restrictions

* make CeedQFunctionContextGetContextSize available for Python bindings

* Changes to Python bindings to allow CeedScalar to be float

* WIP adjust Python tests for float or double

* make style

* remove QFunctionContextGetContextSize from backend header

* Use quotes instead of <> in include statement

* Remove unncessary includes

* Update tolerances for tests

* [Julia] allow CeedScalar to be Float32

* [Julia] Use Preferences instead of custom build configuration

# Conflicts:
# julia/LibCEED.jl/src/C.jl

* [Makefile] Change definition of CC_VENDOR so it works with cross-compilation

* [Julia] Use Preferences in CI

# Conflicts:
# .github/workflows/julia-test-with-style.yml

* [Julia] Update docs about preferences

* [Julia] Add test/Project.toml workaround for Preferences

* Add CeedGetScalarType to get the type of CeedScalar at runtime

* [Julia] Move functions from Ceed.jl to LibCEED.jl

* [Julia] Add support for getting library path and scalar type at runtime

* [Julia] Minor change to checking if CUDA is loaded

* [Julia] Check correct CeedScalar types in basis functions

* [Julia] Fix tests comparing with output file

* [Julia] Change devtests to use CeedScalar instead of Float64

* Update test 402 so context will be same size in double or float

* Update tolerances for ceed examples

* [Julia] CUDA fixes

* remove unused variable in t208

* SchurDecomposition: do not compute tau on final iteration

* Update tolerances for some basis tests (for single precision)

* Make style

* Python style fixes for basis test

* Add single precision output for t300 and t320 and adjust checks; skip t541 in single

* Add LCOV exclusions after moving to new line

* fix spacing

* Python: make CEED_EPSILON available as libceed.EPSILON

* Python: optional parameter to specify different output file for test comparison

* Python: update tests' use of EPSILON and change test_300 output file for single precision

* Python: add convenience function for getting dtype corresponding to CeedScalar

* rust - add single precision support

* [Julia] Fall back on Float64 if CeedGetScalarType is not available

* [Julia] style

* Adjust tolerance for t301

* xsmm - add single precision support

* avx - add single precision support

* Add initial single precision support for MAGMA non-tensor basis

* Skip t300 and t320 in single precision; revert Python t300 changes

* Revert output changes for t300 and t320 in junit

* [Julia] Changes to autogenerated bindings for mixed precision

* [Julia] style

* [Julia] Check scalar type when changing libceed library path

The check is also performed when the package is loaded. This prevents having to
restart the Julia session twice

* [Julia] Require JLLWrappers version 1.3

This is needed to use Preferences to change the library path

* Add documentation page for precision development

Co-authored-by: Will Pazner <will.e.p@gmail.com>

* Cleanup from merge: remove old README

* Return CEED_ALIGN to backend.h

* Make Fortran compiler (FC) optional; empty skips Fortran tests

Use in Python and Rust builds, which may not have a Fortran compiler
installed and thus would produce confusing output.

* Add single precision CI test for Noether

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

Co-authored-by: Will Pazner <will.e.p@gmail.com>
Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>
Co-authored-by: Jed Brown <jed@jedbrown.org>

show more ...


.gitlab-ci.yml
Makefile
backends/avx/ceed-avx-blocked.c
backends/avx/ceed-avx-serial.c
backends/avx/ceed-avx-tensor-f32.c
backends/avx/ceed-avx-tensor-f64.c
backends/avx/ceed-avx.h
backends/cuda-gen/ceed-cuda-gen-operator-build.cpp
backends/cuda-shared/ceed-cuda-shared-basis.c
backends/cuda-shared/kernels/cuda-shared-basis.cu
backends/cuda/ceed-cuda-basis.c
backends/cuda/ceed-cuda-vector.c
backends/cuda/ceed-cuda.c
backends/hip-gen/ceed-hip-gen-operator-build.cpp
backends/hip-shared/ceed-hip-shared-basis.c
backends/hip/ceed-hip-basis.c
backends/hip/ceed-hip-compile.cpp
backends/hip/ceed-hip-vector.c
backends/magma/ceed-magma-basis.c
backends/magma/ceed-magma.h
backends/magma/kernels/common/elem_restriction.h
backends/magma/kernels/common/interp_device.h
backends/magma/kernels/common/magma_common_device.h
backends/magma/kernels/cuda/magma_drestrictApply.cu
backends/magma/kernels/hip/magma_drestrictApply.hip.cpp
backends/magma/magma_sgemm_nontensor.c
backends/xsmm/ceed-xsmm-blocked.c
backends/xsmm/ceed-xsmm-serial.c
backends/xsmm/ceed-xsmm-tensor_f32.c
backends/xsmm/ceed-xsmm-tensor_f64.c
backends/xsmm/ceed-xsmm.h
doc/sphinx/source/index.md
doc/sphinx/source/precision.md
examples/ceed/ex1-volume.c
examples/ceed/ex2-surface.c
examples/rust/ex1-volume/src/main.rs
examples/rust/ex1-volume/src/transform.rs
examples/rust/ex2-surface/src/main.rs
examples/rust/ex2-surface/src/transform.rs
examples/rust/mesh/src/lib.rs
include/ceed/backend.h
include/ceed/ceed-f32.h
include/ceed/ceed-f64.h
include/ceed/ceed.h
include/ceed/fortran.h
interface/ceed-basis.c
interface/ceed-qfunctioncontext.c
interface/ceed.c
julia/LibCEED.jl/Project.toml
julia/LibCEED.jl/gen/generator.toml
julia/LibCEED.jl/src/Basis.jl
julia/LibCEED.jl/src/C.jl
julia/LibCEED.jl/src/Cuda.jl
julia/LibCEED.jl/src/Globals.jl
julia/LibCEED.jl/src/LibCEED.jl
julia/LibCEED.jl/src/generated/libceed_bindings.jl
julia/LibCEED.jl/test/output/Float32/b1.out
julia/LibCEED.jl/test/output/Float32/b2.out
julia/LibCEED.jl/test/output/Float32/b3.out
julia/LibCEED.jl/test/output/Float64/b1.out
julia/LibCEED.jl/test/output/Float64/b2.out
julia/LibCEED.jl/test/output/Float64/b3.out
julia/LibCEED.jl/test/runtests.jl
python/__init__.py
python/build_ceed_cffi.py
python/ceed.py
python/ceed_basis.py
python/ceed_constants.py
python/ceed_qfunctioncontext.py
python/ceed_vector.py
python/tests/buildmats.py
python/tests/libceed-qfunctions.c
python/tests/test-1-vector.py
python/tests/test-2-elemrestriction.py
python/tests/test-3-basis.py
python/tests/test-4-qfunction.py
python/tests/test-5-operator.py
rust/libceed-sys/build.rs
rust/libceed/src/basis.rs
rust/libceed/src/elem_restriction.rs
rust/libceed/src/lib.rs
rust/libceed/src/operator.rs
rust/libceed/src/qfunction.rs
rust/libceed/src/vector.rs
setup.py
tests/junit.py
tests/t100-vector.c
tests/t101-vector.c
tests/t103-vector.c
tests/t104-vector.c
tests/t105-vector.c
tests/t106-vector.c
tests/t108-vector.c
tests/t109-vector.c
tests/t114-vector.c
tests/t119-vector.c
tests/t200-elemrestriction.c
tests/t201-elemrestriction.c
tests/t204-elemrestriction.c
tests/t205-elemrestriction.c
tests/t206-elemrestriction.c
tests/t207-elemrestriction.c
tests/t209-elemrestriction.c
tests/t300-basis.c
tests/t301-basis.c
tests/t303-basis.c
tests/t304-basis.c
tests/t305-basis.c
tests/t310-basis.c
tests/t311-basis.c
tests/t312-basis.c
tests/t314-basis.c
tests/t315-basis.c
tests/t316-basis.c
tests/t317-basis.c
tests/t320-basis.c
tests/t321-basis.c
tests/t322-basis.c
tests/t323-basis.c
tests/t324-basis.c
tests/t325-basis.c
tests/t401-qfunction.c
tests/t402-qfunction.c
tests/t501-operator.c
tests/t502-operator.c
tests/t503-operator.c
tests/t505-operator.c
tests/t506-operator.c
tests/t507-operator.c
tests/t511-operator.c
tests/t521-operator.c
tests/t522-operator.c
tests/t524-operator.c
tests/t530-operator.c
tests/t531-operator.c
tests/t532-operator.c
tests/t533-operator.c
tests/t534-operator.c
tests/t535-operator.c
tests/t536-operator.c
tests/t537-operator.c
tests/t538-operator.c
tests/t540-operator.c
tests/t541-operator.c
tests/t550-operator.c
tests/t551-operator.c
tests/t552-operator.c
tests/t553-operator.c
tests/t560-operator.c
tests/t561-operator.c
tests/t562-operator.c
tests/t563-operator.c
tests/t564-operator.c
tests/t565-operator.c
tests/tap.sh
ae718e2f02-Sep-2021 Jed Brown <jed@jedbrown.org>

doc: add note on restrict qualified semantics for QFunctions

3c17d89b29-Aug-2021 Jed Brown <jed@jedbrown.org>

Merge pull request #803 from CEED/jed/test-junit

testing updates: junit classname, bpsraw tolerances, CUDA on lv

96e2ca2229-Aug-2021 Jed Brown <jed@jedbrown.org>

gitlab-ci: update lv configuration

* CUDA-11.2+ bug circumvented so use latest 11.4
* Run minimal CPU tests in parallel so parallelism applies to all compilation
*
* Run GPU tests sequentially to

gitlab-ci: update lv configuration

* CUDA-11.2+ bug circumvented so use latest 11.4
* Run minimal CPU tests in parallel so parallelism applies to all compilation
*
* Run GPU tests sequentially to avoid cudaGetDevice returning CUDA_ERROR_NOT_INITIALIZED
* This is weird because nvidia-smi -q reports very low resource utilization
* MPS and retrying within the same process failed, though other
processes can get a device

show more ...

bc251d8428-Aug-2021 Jed Brown <jed@jedbrown.org>

cuda/ref: modify weight kernels to avoid CUDA-11.2+ bug on RTX 2080 (issue #802)

b868981d18-Aug-2021 Jed Brown <jed@jedbrown.org>

examples/fluids: make interface/tests support PETSc main

b9ce5a0317-Aug-2021 Jed Brown <jed@jedbrown.org>

gitlab-ci: simpler handling of success for codecov upload

1...<<111112113114115116117118119120>>...226