honee/doc/auxiliary.md

*0fb1909eSJames Wright# Auxiliary Functionality
*0fb1909eSJames WrightThis section documents functionality that is not apart of the core PDE solver, but is used for other miscellaneous tasks, such as statistics collection or in-situ machine learning.
*0fb1909eSJames Wright
*0fb1909eSJames Wright(aux-statistics)=
*0fb1909eSJames Wright## Statistics Collection
*0fb1909eSJames WrightFor scale-resolving simulations (such as LES and DNS), statistics for a simulation are more often useful than time-instantaneous snapshots of the simulation itself.
*0fb1909eSJames WrightTo make this process more computationally efficient, averaging in the spanwise direction, if physically correct, can help reduce the amount of simulation time needed to get converged statistics.
*0fb1909eSJames Wright
*0fb1909eSJames WrightFirst, let's more precisely define what we mean by spanwise average.
*0fb1909eSJames WrightDenote $\langle \phi \rangle$ as the Reynolds average of $\phi$, which in this case would be a average over the spanwise direction and time:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\langle \phi \rangle(x,y) = \frac{1}{L_z + (T_f - T_0)}\int_0^{L_z} \int_{T_0}^{T_f} \phi(x, y, z, t) \mathrm{d}t \mathrm{d}z
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames Wrightwhere $z$ is the spanwise direction, the domain has size $[0, L_z]$ in the spanwise direction, and $[T_0, T_f]$ is the range of time being averaged over.
*0fb1909eSJames WrightNote that here and in the code, **we assume the spanwise direction to be in the $z$ direction**.
*0fb1909eSJames Wright
*0fb1909eSJames WrightTo discuss the details of the implementation we'll first discuss the spanwise integral, then the temporal integral, and lastly the statistics themselves.
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Spanwise Integral
*0fb1909eSJames WrightThe function $\langle \phi \rangle (x,y)$ is represented on a 2-D finite element grid, taken from the full domain mesh itself.
*0fb1909eSJames WrightIf isoperiodicity is set, the periodic face is extracted as the spanwise statistics mesh.
*0fb1909eSJames WrightOtherwise the negative z face is used.
*0fb1909eSJames WrightWe'll refer to this mesh as the *parent grid*, as for every "parent" point in the parent grid, there are many "child" points in the full domain.
*0fb1909eSJames WrightDefine a function space on the parent grid as $\mathcal{V}_p^\mathrm{parent} = \{ \bm v(\bm x) \in H^{1}(\Omega_e^\mathrm{parent}) \,|\, \bm v(\bm x_e(\bm X)) \in P_p(\bm{I}), e=1,\ldots,N_e \}$.
*0fb1909eSJames WrightWe enforce that the order of the parent FEM space is equal to the full domain's order.
*0fb1909eSJames Wright
*0fb1909eSJames WrightMany statistics are the product of 2 or more solution functions, which results in functions of degree higher than the parent FEM space, $\mathcal{V}_p^\mathrm{parent}$.
*0fb1909eSJames WrightTo represent these higher-order functions on the parent FEM space, we perform an $L^2$ projection.
*0fb1909eSJames WrightDefine the spanwise averaged function as:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\langle \phi \rangle_z(x,y,t) = \frac{1}{L_z} \int_0^{L_z} \phi(x, y, z, t) \mathrm{d}z
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames Wrightwhere the function $\phi$ may be the product of multiple solution functions and $\langle \phi \rangle_z$ denotes the spanwise average.
*0fb1909eSJames WrightThe projection of a function $u$ onto the parent FEM space would look like:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\bm M u_N = \int_0^{L_x} \int_0^{L_y} u \psi^\mathrm{parent}_N \mathrm{d}y \mathrm{d}x
*0fb1909eSJames Wright$$
*0fb1909eSJames Wrightwhere $\bm M$ is the mass matrix for $\mathcal{V}_p^\mathrm{parent}$, $u_N$ the coefficients of the projected function, and $\psi^\mathrm{parent}_N$ the basis functions of the parent FEM space.
*0fb1909eSJames WrightSubstituting the spanwise average of $\phi$ for $u$, we get:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\bm M [\langle \phi \rangle_z]_N = \int_0^{L_x} \int_0^{L_y} \left [\frac{1}{L_z} \int_0^{L_z} \phi(x,y,z,t) \mathrm{d}z \right ] \psi^\mathrm{parent}_N(x,y) \mathrm{d}y \mathrm{d}x
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames WrightThe triple integral in the right hand side is just an integral over the full domain
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\bm M [\langle \phi \rangle_z]_N = \frac{1}{L_z} \int_\Omega \phi(x,y,z,t) \psi^\mathrm{parent}_N(x,y) \mathrm{d}\Omega
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames WrightWe need to evaluate $\psi^\mathrm{parent}_N$ at quadrature points in the full domain.
*0fb1909eSJames WrightTo do this efficiently, **we assume and exploit the full domain grid to be a tensor product in the spanwise direction**.
*0fb1909eSJames WrightThis assumption means quadrature points in the full domain have the same $(x,y)$ coordinate location as quadrature points in the parent domain.
*0fb1909eSJames WrightThis also allows the use of the full domain quadrature weights for the triple integral.
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Temporal Integral/Averaging
*0fb1909eSJames WrightTo calculate the temporal integral, we do a running average using left-rectangle rule.
*0fb1909eSJames WrightAt the beginning of each simulation, the time integral of a statistic is set to 0, $\overline{\phi} = 0$.
*0fb1909eSJames WrightPeriodically, the integral is updated using left-rectangle rule:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$\overline{\phi}_\mathrm{new} = \overline{\phi}_{\mathrm{old}} + \phi(t_\mathrm{new}) \Delta T$$
*0fb1909eSJames Wrightwhere $\phi(t_\mathrm{new})$ is the statistic at the current time and $\Delta T$ is the time since the last update.
*0fb1909eSJames WrightWhen stats are written out to file, this running sum is then divided by $T_f - T_0$ to get the time average.
*0fb1909eSJames Wright
*0fb1909eSJames WrightWith this method of calculating the running time average, we can plug this into the $L^2$ projection of the spanwise integral:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\bm M [\langle \phi \rangle]_N = \frac{1}{L_z + (T_f - T_0)} \int_\Omega \int_{T_0}^{T_f} \phi(x,y,z,t) \psi^\mathrm{parent}_N \mathrm{d}t \mathrm{d}\Omega
*0fb1909eSJames Wright$$
*0fb1909eSJames Wrightwhere the integral $\int_{T_0}^{T_f} \phi(x,y,z,t) \mathrm{d}t$ is calculated on a running basis.
*0fb1909eSJames Wright
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Running
*0fb1909eSJames WrightAs the simulation runs, it takes a running time average of the statistics at the full domain quadrature points.
*0fb1909eSJames WrightThis running average is only updated at the interval specified by `-ts_monitor_turbulence_spanstats_collect_interval` as number of timesteps.
*0fb1909eSJames WrightThe $L^2$ projection problem is only solved when statistics are written to file, which is controlled by `-ts_monitor_turbulence_spanstats_viewer_interval`.
*0fb1909eSJames WrightNote that the averaging is not reset after each file write.
*0fb1909eSJames WrightThe average is always over the bounds $[T_0, T_f]$, where $T_f$ in this case would be the time the file was written at and $T_0$ is the solution time at the beginning of the run.
*0fb1909eSJames Wright
*0fb1909eSJames Wright:::{list-table} Spanwise Turbulent Statistics Runtime Options
*0fb1909eSJames Wright:header-rows: 1
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - Option
*0fb1909eSJames Wright  - Description
*0fb1909eSJames Wright  - Default value
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-ts_monitor_turbulence_spanstats_collect_interval`
*0fb1909eSJames Wright  - Number of timesteps between statistics collection
*0fb1909eSJames Wright  - `1`
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-ts_monitor_turbulence_spanstats_viewer`
*0fb1909eSJames Wright  - Sets the PetscViewer for the statistics file writing, such as `cgns:output-%d.cgns` (requires PETSc `--download-cgns`). Also turns the statistics collection on.
*0fb1909eSJames Wright  -
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-ts_monitor_turbulence_spanstats_viewer_interval`
*0fb1909eSJames Wright  - Number of timesteps between statistics file writing (`-1` means only at end of run)
*0fb1909eSJames Wright  - `-1`
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-ts_monitor_turbulence_spanstats_viewer_cgns_batch_size`
*0fb1909eSJames Wright  - Number of frames written per CGNS file if the CGNS file name includes a format specifier (`%d`).
*0fb1909eSJames Wright  - `20`
*0fb1909eSJames Wright:::
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Turbulent Statistics
*0fb1909eSJames Wright
*0fb1909eSJames WrightThe focus here are those statistics that are relevant to turbulent flow.
*0fb1909eSJames WrightThe terms collected are listed below, with the mathematical definition on the left and the label (present in CGNS output files) is on the right.
*0fb1909eSJames Wright
*0fb1909eSJames Wright| Math                           | Label                           |
*0fb1909eSJames Wright| -----------------              | --------                        |
*0fb1909eSJames Wright| $\langle \rho \rangle$         | MeanDensity                     |
*0fb1909eSJames Wright| $\langle p \rangle$            | MeanPressure                    |
*0fb1909eSJames Wright| $\langle p^2 \rangle$          | MeanPressureSquared             |
*0fb1909eSJames Wright| $\langle p u_i \rangle$        | MeanPressureVelocity[$i$]       |
*0fb1909eSJames Wright| $\langle \rho T \rangle$       | MeanDensityTemperature          |
*0fb1909eSJames Wright| $\langle \rho T u_i \rangle$   | MeanDensityTemperatureFlux[$i$] |
*0fb1909eSJames Wright| $\langle \rho u_i \rangle$     | MeanMomentum[$i$]               |
*0fb1909eSJames Wright| $\langle \rho u_i u_j \rangle$ | MeanMomentumFlux[$ij$]          |
*0fb1909eSJames Wright| $\langle u_i \rangle$          | MeanVelocity[$i$]               |
*0fb1909eSJames Wright
*0fb1909eSJames Wrightwhere [$i$] are suffixes to the labels. So $\langle \rho u_x u_y \rangle$ would correspond to MeanMomentumFluxXY.
*0fb1909eSJames WrightThis naming convention is chosen to align with the CGNS standard naming style.
*0fb1909eSJames Wright
*0fb1909eSJames WrightTo get second-order statistics from these terms, simply use the identity:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\langle \phi' \theta' \rangle = \langle \phi \theta \rangle - \langle \phi \rangle \langle \theta \rangle
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames Wright(aux-differential-filtering)=
*0fb1909eSJames Wright## Differential Filtering
*0fb1909eSJames Wright
*0fb1909eSJames WrightThere is the option to filter the solution field using differential filtering.
*0fb1909eSJames WrightThis was first proposed in {cite}`germanoDiffFilterLES1986`, using an inverse Hemholtz operator.
*0fb1909eSJames WrightThe strong form of the differential equation is
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\overline{\phi} - \nabla \cdot (\beta (\bm{D}\bm{\Delta})^2 \nabla \overline{\phi} ) = \phi
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames Wrightfor $\phi$ the scalar solution field we want to filter, $\overline \phi$ the filtered scalar solution field, $\bm{\Delta} \in \mathbb{R}^{3 \times 3}$ a symmetric positive-definite rank 2 tensor defining the width of the filter, $\bm{D}$ is the filter width scaling tensor (also a rank 2 SPD tensor), and $\beta$ is a kernel scaling factor on the filter tensor.
*0fb1909eSJames WrightThis admits the weak form:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\int_\Omega \left( v \overline \phi + \beta \nabla v \cdot (\bm{D}\bm{\Delta})^2 \nabla \overline \phi \right) \,d\Omega
*0fb1909eSJames Wright- \cancel{\int_{\partial \Omega} \beta v \nabla \overline \phi \cdot (\bm{D}\bm{\Delta})^2 \bm{\hat{n}} \,d\partial\Omega} =
*0fb1909eSJames Wright\int_\Omega v \phi \, , \; \forall v \in \mathcal{V}_p
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames WrightThe boundary integral resulting from integration-by-parts is crossed out, as we assume that $(\bm{D}\bm{\Delta})^2 = \bm{0} \Leftrightarrow \overline \phi = \phi$ at boundaries (this is reasonable at walls, but for convenience elsewhere).
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Filter Width Tensor, Δ
*0fb1909eSJames WrightFor homogenous filtering, $\bm{\Delta}$ is defined as the identity matrix.
*0fb1909eSJames Wright
*0fb1909eSJames Wright:::{note}
*0fb1909eSJames WrightIt is common to denote a filter width dimensioned relative to the radial distance of the filter kernel.
*0fb1909eSJames WrightNote here we use the filter *diameter* instead, as that feels more natural (albeit mathematically less convenient).
*0fb1909eSJames WrightFor example, under this definition a box filter would be defined as:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames WrightB(\Delta; \bm{r}) =
*0fb1909eSJames Wright\begin{cases}
*0fb1909eSJames Wright1 & \Vert \bm{r} \Vert \leq \Delta/2 \\
*0fb1909eSJames Wright0 & \Vert \bm{r} \Vert > \Delta/2
*0fb1909eSJames Wright\end{cases}
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright:::
*0fb1909eSJames Wright
*0fb1909eSJames WrightFor inhomogeneous anisotropic filtering, we use the finite element grid itself to define $\bm{\Delta}$.
*0fb1909eSJames WrightThis is set via `-diff_filter_grid_based_width`.
*0fb1909eSJames WrightSpecifically, we use the filter width tensor defined in {cite}`prakashDDSGSAnisotropic2022`.
*0fb1909eSJames WrightFor finite element grids, the filter width tensor is most conveniently defined by $\bm{\Delta} = \bm{g}^{-1/2}$ where $\bm g = \nabla_{\bm x} \bm{X} \cdot \nabla_{\bm x} \bm{X}$ is the metric tensor.
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Filter Width Scaling Tensor, $\bm{D}$
*0fb1909eSJames WrightThe filter width tensor $\bm{\Delta}$, be it defined from grid based sources or just the homogenous filtering, can be scaled anisotropically.
*0fb1909eSJames WrightThe coefficients for that anisotropic scaling are given by `-diff_filter_width_scaling`, denoted here by $c_1, c_2, c_3$.
*0fb1909eSJames WrightThe definition for $\bm{D}$ then becomes
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\bm{D} =
*0fb1909eSJames Wright\begin{bmatrix}
*0fb1909eSJames Wright    c_1 & 0        & 0        \\
*0fb1909eSJames Wright    0        & c_2 & 0        \\
*0fb1909eSJames Wright    0        & 0        & c_3 \\
*0fb1909eSJames Wright\end{bmatrix}
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames WrightIn the case of $\bm{\Delta}$ being defined as homogenous, $\bm{D}\bm{\Delta}$ means that $\bm{D}$ effectively sets the filter width.
*0fb1909eSJames Wright
*0fb1909eSJames WrightThe filtering at the wall may also be damped, to smoothly meet the $\overline \phi = \phi$ boundary condition at the wall.
*0fb1909eSJames WrightThe selected damping function for this is the van Driest function {cite}`vandriestWallDamping1956`:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\zeta = 1 - \exp\left(-\frac{y^+}{A^+}\right)
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames Wrightwhere $y^+$ is the wall-friction scaled wall-distance ($y^+ = y u_\tau / \nu = y/\delta_\nu$), $A^+$ is some wall-friction scaled scale factor, and $\zeta$ is the damping coefficient.
*0fb1909eSJames WrightFor this implementation, we assume that $\delta_\nu$ is constant across the wall and is defined by `-diff_filter_friction_length`.
*0fb1909eSJames Wright$A^+$ is defined by `-diff_filter_damping_constant`.
*0fb1909eSJames Wright
*0fb1909eSJames WrightTo apply this scalar damping coefficient to the filter width tensor, we construct the wall-damping tensor from it.
*0fb1909eSJames WrightThe construction implemented currently limits damping in the wall parallel directions to be no less than the original filter width defined by $\bm{\Delta}$.
*0fb1909eSJames WrightThe wall-normal filter width is allowed to be damped to a zero filter width.
*0fb1909eSJames WrightIt is currently assumed that the second component of the filter width tensor is in the wall-normal direction.
*0fb1909eSJames WrightUnder these assumptions, $\bm{D}$ then becomes:
*0fb1909eSJames Wright
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright\bm{D} =
*0fb1909eSJames Wright\begin{bmatrix}
*0fb1909eSJames Wright    \max(1, \zeta c_1) & 0         & 0                  \\
*0fb1909eSJames Wright    0                  & \zeta c_2 & 0                  \\
*0fb1909eSJames Wright    0                  & 0         & \max(1, \zeta c_3) \\
*0fb1909eSJames Wright\end{bmatrix}
*0fb1909eSJames Wright$$
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Filter Kernel Scaling, β
*0fb1909eSJames WrightWhile we define $\bm{D}\bm{\Delta}$ to be of a certain physical filter width, the actual width of the implied filter kernel is quite larger than "normal" kernels.
*0fb1909eSJames WrightTo account for this, we use $\beta$ to scale the filter tensor to the appropriate size, as is done in {cite}`bullExplicitFilteringExact2016`.
*0fb1909eSJames WrightTo match the "size" of a normal kernel to our differential kernel, we attempt to have them match second order moments with respect to the prescribed filter width.
*0fb1909eSJames WrightTo match the box and Gaussian filters "sizes", we use $\beta = 1/10$ and $\beta = 1/6$, respectively.
*0fb1909eSJames Wright$\beta$ can be set via `-diff_filter_kernel_scaling`.
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Runtime Options
*0fb1909eSJames Wright
*0fb1909eSJames Wright:::{list-table} Differential Filtering Runtime Options
*0fb1909eSJames Wright:header-rows: 1
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - Option
*0fb1909eSJames Wright  - Description
*0fb1909eSJames Wright  - Default value
*0fb1909eSJames Wright  - Unit
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_monitor`
*0fb1909eSJames Wright  - Enable differential filter TSMonitor
*0fb1909eSJames Wright  - `false`
*0fb1909eSJames Wright  - boolean
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_grid_based_width`
*0fb1909eSJames Wright  - Use filter width based on the grid size
*0fb1909eSJames Wright  - `false`
*0fb1909eSJames Wright  - boolean
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_width_scaling`
*0fb1909eSJames Wright  - Anisotropic scaling for filter width in wall-aligned coordinates (snz)
*0fb1909eSJames Wright  - `1,1,1`
*0fb1909eSJames Wright  - `m`
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_kernel_scaling`
*0fb1909eSJames Wright  - Scaling to make differential kernel size equivalent to other filter kernels
*0fb1909eSJames Wright  - `0.1`
*0fb1909eSJames Wright  - `m^2`
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_wall_damping_function`
*0fb1909eSJames Wright  - Damping function to use at the wall for anisotropic filtering (`none`, `van_driest`)
*0fb1909eSJames Wright  - `none`
*0fb1909eSJames Wright  - string
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_wall_damping_constant`
*0fb1909eSJames Wright  - Constant for the wall-damping function. $A^+$ for `van_driest` damping function.
*0fb1909eSJames Wright  - 25
*0fb1909eSJames Wright  -
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-diff_filter_friction_length`
*0fb1909eSJames Wright  - Friction length associated with the flow, $\delta_\nu$. Used in wall-damping functions
*0fb1909eSJames Wright  - 0
*0fb1909eSJames Wright  - `m`
*0fb1909eSJames Wright:::
*0fb1909eSJames Wright
*0fb1909eSJames Wright(aux-in-situ-ml)=
*0fb1909eSJames Wright## *In Situ* Machine-Learning Model Training
*0fb1909eSJames WrightTraining machine-learning models normally uses *a priori* (already gathered) data stored on disk.
*0fb1909eSJames WrightThis is computationally inefficient, particularly as the scale of the problem grows and the data that is saved to disk reduces to a small percentage of the total data generated by a simulation.
*0fb1909eSJames WrightOne way of working around this to to train a model on data coming from an ongoing simulation, known as *in situ* (in place) learning.
*0fb1909eSJames Wright
*0fb1909eSJames WrightThis is implemented in the code using [SmartSim](https://www.craylabs.org/docs/overview.html).
*0fb1909eSJames WrightBriefly, the fluid simulation will periodically place data for training purposes into a database that a separate process uses to train a model.
*0fb1909eSJames WrightThe database used by SmartSim is [Redis](https://redis.com/modules/redis-ai/) and the library to connect to the database is called [SmartRedis](https://www.craylabs.org/docs/smartredis.html).
*0fb1909eSJames WrightMore information about how to utilize this code in a SmartSim configuration can be found on [SmartSim's website](https://www.craylabs.org/docs/overview.html).
*0fb1909eSJames Wright
*0fb1909eSJames WrightTo use this code in a SmartSim *in situ* setup, first the code must be built with SmartRedis enabled.
*0fb1909eSJames WrightThis is done by specifying the installation directory of SmartRedis using the `SMARTREDIS_DIR` environment variable when building:
*0fb1909eSJames Wright
*0fb1909eSJames Wright```
*0fb1909eSJames Wrightmake SMARTREDIS_DIR=~/software/smartredis/install
*0fb1909eSJames Wright```
*0fb1909eSJames Wright
*0fb1909eSJames Wright### SGS Data-Driven Model *In Situ* Training
*0fb1909eSJames WrightCurrently the code is only setup to do *in situ* training for the SGS data-driven model.
*0fb1909eSJames WrightTraining data is split into the model inputs and outputs.
*0fb1909eSJames WrightThe model inputs are calculated as the same model inputs in the SGS Data-Driven model described {ref}`earlier<sgs-dd-model>`.
*0fb1909eSJames WrightThe model outputs (or targets in the case of training) are the subgrid stresses.
*0fb1909eSJames WrightBoth the inputs and outputs are computed from a filtered velocity field, which is calculated via {ref}`aux-differential-filtering`.
*0fb1909eSJames WrightThe settings for the differential filtering used during training are described in {ref}`aux-differential-filtering`.
*0fb1909eSJames WrightThe training will create multiple sets of data per each filter width defined in `-sgs_train_filter_widths`.
*0fb1909eSJames WrightThose scalar filter widths correspond to the scaling correspond to $\bm{D} = c \bm{I}$, where $c$ is the scalar filter width.
*0fb1909eSJames Wright
*0fb1909eSJames WrightThe SGS *in situ* training can be enabled using the `-sgs_train_enable` flag.
*0fb1909eSJames WrightData can be processed and placed into the database periodically.
*0fb1909eSJames WrightThe interval between is controlled by `-sgs_train_write_data_interval`.
*0fb1909eSJames WrightThere's also the choice of whether to add new training data on each database write or to overwrite the old data with new data.
*0fb1909eSJames WrightThis is controlled by `-sgs_train_overwrite_data`.
*0fb1909eSJames Wright
*0fb1909eSJames WrightThe database may also be located on the same node as a MPI rank (collocated) or located on a separate node (distributed).
*0fb1909eSJames WrightIt's necessary to know how many ranks are associated with each collocated database, which is set by `-smartsim_collocated_database_num_ranks`.
*0fb1909eSJames Wright
*0fb1909eSJames Wright### Runtime Options
*0fb1909eSJames Wright:::{list-table} *In Situ* Machine-Learning Training Runtime Options
*0fb1909eSJames Wright:header-rows: 1
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - Option
*0fb1909eSJames Wright  - Description
*0fb1909eSJames Wright  - Default value
*0fb1909eSJames Wright  - Unit
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-sgs_train_enable`
*0fb1909eSJames Wright  - Whether to enable *in situ* training of data-driven SGS model. Require building with SmartRedis.
*0fb1909eSJames Wright  - `false`
*0fb1909eSJames Wright  - boolean
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-sgs_train_write_data_interval`
*0fb1909eSJames Wright  - Number of timesteps between writing training data into SmartRedis database
*0fb1909eSJames Wright  - `1`
*0fb1909eSJames Wright  -
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-sgs_train_overwrite_data`
*0fb1909eSJames Wright  - Whether new training data should overwrite old data on database
*0fb1909eSJames Wright  - `true`
*0fb1909eSJames Wright  - boolean
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-sgs_train_filter_widths`
*0fb1909eSJames Wright  - List of scalar values for different filter widths to calculate for training data
*0fb1909eSJames Wright  -
*0fb1909eSJames Wright  - `m`
*0fb1909eSJames Wright
*0fb1909eSJames Wright* - `-smartsim_collocated_num_ranks`
*0fb1909eSJames Wright  - Number of MPI ranks associated with each collocated database (i.e. ranks per node)
*0fb1909eSJames Wright  - `1`
*0fb1909eSJames Wright  -
*0fb1909eSJames Wright:::