<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="/rss.xsl.xml"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
    <title>Changes in mpicudamatimpl.h</title>
    <description></description>
    <language>en</language>
    <copyright>Copyright 2025</copyright>
    <generator>Java</generator><item>
        <title>9dd11ecf0918283bb567d8b33a92f53ac4ea7840 - Merge branch &apos;jacobf/2023-08-17/header-guard-check&apos; into &apos;main&apos;</title>
        <link>http://opengrok.kc2vjw.com:8080/history/petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h#9dd11ecf0918283bb567d8b33a92f53ac4ea7840</link>
        <description>Merge branch &apos;jacobf/2023-08-17/header-guard-check&apos; into &apos;main&apos;Check header guardsSee merge request petsc/petsc!6822

            List of files:
            /petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h</description>
        <pubDate>Fri, 25 Aug 2023 13:49:03 +0000</pubDate>
        <dc:creator>Satish Balay &lt;balay@mcs.anl.gov&gt;</dc:creator>
    </item>
<item>
        <title>a496304597bacff3545e802853d69e8765312868 - Convert all header guards to pragma once</title>
        <link>http://opengrok.kc2vjw.com:8080/history/petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h#a496304597bacff3545e802853d69e8765312868</link>
        <description>Convert all header guards to pragma once

            List of files:
            /petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h</description>
        <pubDate>Fri, 18 Aug 2023 15:13:00 +0000</pubDate>
        <dc:creator>Jacob Faibussowitsch &lt;jacob.fai@gmail.com&gt;</dc:creator>
    </item>
<item>
        <title>3ea99036a5fedea4d39e7e77471d0ab500c249d7 - Fix some malformed if !defined() header guards</title>
        <link>http://opengrok.kc2vjw.com:8080/history/petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h#3ea99036a5fedea4d39e7e77471d0ab500c249d7</link>
        <description>Fix some malformed if !defined() header guards

            List of files:
            /petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h</description>
        <pubDate>Thu, 17 Aug 2023 21:56:14 +0000</pubDate>
        <dc:creator>Jacob Faibussowitsch &lt;jacob.fai@gmail.com&gt;</dc:creator>
    </item>
<item>
        <title>dd874c20e1b8bb56639592ba136e1f877322cbbd - Merge branch &apos;hongzh/sell-cuda&apos; into &apos;main&apos;</title>
        <link>http://opengrok.kc2vjw.com:8080/history/petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h#dd874c20e1b8bb56639592ba136e1f877322cbbd</link>
        <description>Merge branch &apos;hongzh/sell-cuda&apos; into &apos;main&apos;SELL-based SpMVSee merge request petsc/petsc!3428

            List of files:
            /petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h</description>
        <pubDate>Mon, 10 Apr 2023 23:51:00 +0000</pubDate>
        <dc:creator>Satish Balay &lt;balay@mcs.anl.gov&gt;</dc:creator>
    </item>
<item>
        <title>2d1451d43b73a0495cd81c074cbc1e0206888947 - Initial commit for porting SELL to GPU</title>
        <link>http://opengrok.kc2vjw.com:8080/history/petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h#2d1451d43b73a0495cd81c074cbc1e0206888947</link>
        <description>Initial commit for porting SELL to GPU- Add tiled SPMV and basic SpMVfor SeqSELL- Tested in serial- Offloadmask is used to determine when the matrix should be copied to GPU- Use different slice height for CUDA version- By checking the nonzerostate, PETSc can decide if the whole matrix need to be copied or just the values need to be copied- Make the convert function public so that the very slow MatConvert_Basic can be avoided sometimes. E.g. one can use a two-step convert method: AIJ-&gt;SELL,SELL-&gt;SELLCUDA instead of the direct convert AIJ-&gt;SELLCUDA- Make the FLOPS count for SELL same as that for AIJCUSPARSE.- MatDisAssemble is not needed.- Change slice height from 32 to 16 for GPU- To overlap communication with MatMult, VecScatterBegin() should be called before MatMult() for the diagonal part.- SLICE_HEIGHT is defined to be 32 to match the warp size of GPU. For other cases, it is still 8.Funded-by:Project: PETSc for GPUTime: 42 hoursReported-by:Thanks-to:

            List of files:
            /petsc/src/mat/impls/sell/mpi/mpicuda/mpicudamatimpl.h</description>
        <pubDate>Thu, 09 Jan 2020 04:05:36 +0000</pubDate>
        <dc:creator>Hong Zhang &lt;hongzhang@anl.gov&gt;</dc:creator>
    </item>
</channel>
</rss>
