Federico Busato – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2023-05-24T00:25:03Z http://www.open-lab.net/blog/feed/ Federico Busato <![CDATA[Accelerating Matrix Multiplication with Block Sparse Format and NVIDIA Tensor Cores]]> http://www.open-lab.net/blog/?p=24706 2023-05-24T00:25:03Z 2021-03-19T16:24:28Z Sparse-matrix dense-matrix multiplication (SpMM) is a fundamental linear algebra operation and a building block for more complex algorithms such as finding the...]]>

Sparse-matrix dense-matrix multiplication (SpMM) is a fundamental linear algebra operation and a building block for more complex algorithms such as finding the solutions of linear systems, computing eigenvalues through the preconditioned conjugate gradient, and multiple right-hand sides Krylov subspace iterative solvers. SpMM is also an important kernel used in many domains such as fluid dynamics…

Source

]]>
21
Federico Busato <![CDATA[Exploiting NVIDIA Ampere Structured Sparsity with cuSPARSELt]]> http://www.open-lab.net/blog/?p=22602 2022-08-21T23:40:49Z 2020-12-08T19:34:58Z Deep neural networks achieve outstanding performance in a variety of fields, such as computer vision, speech recognition, and natural language processing. The...]]>

Deep neural networks achieve outstanding performance in a variety of fields, such as computer vision, speech recognition, and natural language processing. The computational power needed to process these neural networks is rapidly increasing, so efficient models and computation are crucial. Neural network pruning, removing unnecessary model parameters to yield a sparse network, is a useful way to…

Source

]]>
10
���˳���97caoporen����