| 074be161 | 03-Jun-2019 |
Yohann Dudouit <yohann.dudouit@gmail.com> |
Optimization of weight kernel and dynamic allocation of shared memory.
- First optimization of weight kernel, kernels are now coalesce but might not be fully using SMs (need to batch elements per
Optimization of weight kernel and dynamic allocation of shared memory.
- First optimization of weight kernel, kernels are now coalesce but might not be fully using SMs (need to batch elements per block) - Switch to dynamic shared memory allocation in order to batch elements for interpolation and gradient in cuda-shared backend. - Add GetPreferedMemoryType for cuda-reg and cuda-shared backends. (Can be removed in the future with delegation of this function)
show more ...
|
| d3232bb7 | 30-May-2019 |
Yohann Dudouit <yohann.dudouit@gmail.com> |
Optimization of cuda-reg restriction. |
| 9ef20713 | 17-May-2019 |
Yohann Dudouit <yohann.dudouit@gmail.com> |
Start the optimization of the Cuda restriction operator. |
| 196a75e4 | 10-Jun-2019 |
Jed Brown <jed@jedbrown.org> |
Makefile: distclean removes config.mk; NVCC ?= |
| 57213447 | 10-Jun-2019 |
Valeria Barra <valeria.barra@colorado.edu> |
Added append option to tee for CC, FC, and NVCC |
| c74a5ab7 | 05-Jun-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #268 from CEED/petsc-bp-bugfix
PETSc BP4/6 Typo: Error -> Error3 |
| 4b5b4ec1 | 04-Jun-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Typo: Error -> Error3 |
| d3c6b40f | 02-Jun-2019 |
Jed Brown <jed@jedbrown.org> |
make configure: cache user variables in config.mk |
| 1571c99d | 02-Jun-2019 |
Valeria Barra <valeria.barra@colorado.edu> |
Add -include directive to Makefile to allow import of custom build configurations |
| 4bee41bb | 31-May-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #264 from CEED/get-memtype-fix
Add delegation to CeedGetPreferedMemType |
| c263cd57 | 31-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Add delegation to CeedGetPreferedMemType |
| 918855fa | 31-May-2019 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #263 from CEED/valeria/update-postprocessingscripts
Update postprocessing scripts |
| e113e564 | 31-May-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #262 from CEED/occa-clang-tidy
OCCA Backend clang-tidy fixes |
| 103dcb42 | 31-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
OCCA backend update note |
| 4c77ad0b | 30-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
Travis: clang-tidy only on linux |
| bec1c034 | 30-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
NOLINT for OCCA tensor contract false positive |
| d1f4761f | 29-May-2019 |
Valeria Barra <valeria.barra@colorado.edu> |
Re-add test_required_examples for proper title and pdf file name |
| 79f79454 | 29-May-2019 |
Valeria Barra <valeria.barra@colorado.edu> |
Update postprocessing scripts |
| f5ef5ec0 | 29-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
OCCA Backend clang-tidy fixes |
| 0c59ef15 | 29-May-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Add PETSc BP2, BP4, BP5, BP6 (#243)
* Add PETSc BP2, BP4, BP5, BP6
* Documentation update
* Update make benchmark
* Minor changes
* Add PETSc bpData structure
* Update make benchmarks
Add PETSc BP2, BP4, BP5, BP6 (#243)
* Add PETSc BP2, BP4, BP5, BP6
* Documentation update
* Update make benchmark
* Minor changes
* Add PETSc bpData structure
* Update make benchmarks
* Benchmark - use default qextra
* Documentation fix
* Clean up PETSc bps.c, add benchmarking first run
* Drop -ksp_max_it from benchmarkscript
show more ...
|
| 4febc4f3 | 29-May-2019 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #261 from CEED/valeria/update-gitignore
Update top level .gitignore |
| e1834370 | 29-May-2019 |
Valeria Barra <valeria.barra@colorado.edu> |
Update top level .gitignore |
| 1856ee7c | 29-May-2019 |
Jeremy L Thompson <25011573+jeremylt@users.noreply.github.com> |
Merge pull request #239 from CEED/decorator
Add delegates for specific objects |
| ad9dbbe3 | 29-May-2019 |
Jed Brown <jed@jedbrown.org> |
Merge pull request #240 from valeriabarra/rescaled-ns-example
Rescaled ns example |
| 68ba6316 | 28-May-2019 |
jeremylt <jeremy.thompson@colorado.edu> |
README: Drop tmpl backend from list |