History log of /libCEED/ (Results 2526 – 2550 of 5646)
Revision Date Author Comments
(<<< Hide modified files)
(Show modified files >>>)
c47bfe2b16-Feb-2022 Jed Brown <jed@jedbrown.org>

backends/cuda-shared: limit 1D thread counts

We need to avoid this error:

CUDA_ERROR_LAUNCH_OUT_OF_RESOURCES: max_threads_per_block 512 on block size (24,1,32), shared_size 0, num_regs 106

A pro

backends/cuda-shared: limit 1D thread counts

We need to avoid this error:

CUDA_ERROR_LAUNCH_OUT_OF_RESOURCES: max_threads_per_block 512 on block size (24,1,32), shared_size 0, num_regs 106

A proper solution is to use cuOccupancyMaxPotentialBlockSize to place a
number of elements per block that stays within resource limits. This
would involve a bit more refactoring to do cleanly.

show more ...

63d3996f16-Feb-2022 Jed Brown <jed@jedbrown.org>

backends/cuda: more informative error reporting

f190906a16-Feb-2022 Jed Brown <jed@jedbrown.org>

xsmm: support for 1.17 headers

3a8b50de15-Feb-2022 Jed Brown <jed@jedbrown.org>

CI: update lv for cuda-11.6

2361c88813-Feb-2022 Jed Brown <jed@jedbrown.org>

Merge pull request #895 from CEED/jeremylt/stray-char

qf - remove stray character

8d000c7712-Feb-2022 Jed Brown <jed@jedbrown.org>

Merge pull request #897 from CEED/jed/vec-zero-sized

Vector: error-free path for get/take array when size=0

e076c21912-Feb-2022 Jed Brown <jed@jedbrown.org>

tests: add zero sized array tests

50c643e112-Feb-2022 Jed Brown <jed@jedbrown.org>

Vector: error-free path for get/take array when size=0

Among other things, this is important so that parallel callers can
easily keep collective control flow even when some subdomains (materials
or

Vector: error-free path for get/take array when size=0

Among other things, this is important so that parallel callers can
easily keep collective control flow even when some subdomains (materials
or boundary surfaces) are size 0 on some ranks.

show more ...

edfb5f2310-Feb-2022 Jeremy L Thompson <jeremy@jeremylt.org>

qf - remove stray character

8c11b84209-Feb-2022 Jeremy L Thompson <jeremy@jeremylt.org>

Merge pull request #892 from CEED/jeremy/small-leak

op - fix small leak in composite ctx label

f2adece308-Feb-2022 Jeremy L Thompson <jeremy@jeremylt.org>

test - add test for label that doesn't exist

60801d1908-Feb-2022 Jed Brown <jed@jedbrown.org>

Merge pull request #894 from CEED/will/julia-style

Update JuliaFormatter version

cdf9579108-Feb-2022 Will Pazner <will.e.p@gmail.com>

[julia] Update JuliaFormatter version

a48e5f4307-Feb-2022 Jeremy L Thompson <jeremy@jeremylt.org>

op - fix small leak in composite ctx label

c6e1a27907-Feb-2022 Jed Brown <jed@jedbrown.org>

Merge pull request #889 from CEED/rezgar/oriented-restr

Element Restriction Oriented

86e1ed6505-Feb-2022 nbeams <246972+nbeams@users.noreply.github.com>

Add launch bounds to HIP QFunction kernels

f71aa81b01-Feb-2022 nbeams <246972+nbeams@users.noreply.github.com>

add launch bounds to magma kernels;
add macro definition for y-dim of magma basis kernel threadblocks

Co-authored-by: Ahmad Abdelfattah <ahmad@icl.utk.edu>

b3c5430c01-Feb-2022 nbeams <246972+nbeams@users.noreply.github.com>

Add flag to use atomic adds on supported AMD GPU hardware

000294e304-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

updated ceed-ref-restriction.c

b435c5a604-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

Added CeedElemRestrictionIsOriented function

c774505304-Feb-2022 Rezgar Shakeri <42816410+rezgarshakeri@users.noreply.github.com>

Update interface/ceed-elemrestriction.c

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

d4b88fd203-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

tests: deleted CeedVectorSetValue in restriction tests

61e7462c03-Feb-2022 Rezgar Shakeri <42816410+rezgarshakeri@users.noreply.github.com>

Update interface/ceed-elemrestriction.c

Co-authored-by: Jeremy L Thompson <jeremy@jeremylt.org>

4dd06d3302-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

update ceed-elemrestriction.c: fixed formatting

cf6be90701-Feb-2022 rezgarshakeri <rezgar.shakeri@colorado.edu>

tests: added t220-elemrestriction.c

1...<<101102103104105106107108109110>>...226