ManagedCuda-CUBLAS 12.8.60

dotnet add package ManagedCuda-CUBLAS --version 12.8.60
                    
NuGet\Install-Package ManagedCuda-CUBLAS -Version 12.8.60
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="ManagedCuda-CUBLAS" Version="12.8.60" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="ManagedCuda-CUBLAS" Version="12.8.60" />
                    
Directory.Packages.props
<PackageReference Include="ManagedCuda-CUBLAS" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add ManagedCuda-CUBLAS --version 12.8.60
                    
#r "nuget: ManagedCuda-CUBLAS, 12.8.60"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package ManagedCuda-CUBLAS@12.8.60
                    
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=ManagedCuda-CUBLAS&version=12.8.60
                    
Install as a Cake Addin
#tool nuget:?package=ManagedCuda-CUBLAS&version=12.8.60
                    
Install as a Cake Tool

It all started as a hobby project to easily access CUDA from C# - at that time CUDA was available in version 3. Now more than 10 years later, managedCuda is still alive and is updated regularly by me to the latest versions of CUDA. In order to support further developments, I switched from the LGPL license to a dual-license GPLv3 / commercial license starting with managedCuda for Cuda version 12 onwards. In case you plan to use managedCuda 12 for a commercial project, please contact me by mail: managedcuda@articimaging.eu. If you use the open-source license and want to contribute to future development, you can donate me a beer here: Support via PayPal

Official nuget packages

One can find multiple packages for managedCuda on nuget, but the official packages are:

  • ManagedCuda-12 (core library without dependencies)
  • CUBLAS (wrapper for cuBlas library, depends on ManagedCuda-12)
  • CUFFT (wrapper for cuFFT library, depends on ManagedCuda-12)
  • CURAND (wrapper for cuRand library, depends on ManagedCuda-12)
  • CUSOLVE (wrapper for cuSolver library, depends on ManagedCuda-12)
  • CUSPARSE (wrapper for cuSparse library, depends on ManagedCuda-12)
  • NPP (wrapper for NPP library, depends on ManagedCuda-12)
  • NVJITLINK (wrapper for nvJitLink library, depends on ManagedCuda-12)
  • NVJPEG (wrapper for nvjpeg library, depends on ManagedCuda-12)
  • NVRTC (wrapper for nvrtc library, depends on ManagedCuda-12)

managedCuda

ManagedCUDA aims an easy integration of NVidia's CUDA in .net applications written in C#, Visual Basic or any other .net language.

For this it includes:

  • A complete wrapper for the CUDA Driver API, version 12.8 (a 1:1 representation of cuda.h in C#)
  • Based on this, wrapper classes for CUDA context, kernel, device variable, etc.
  • Wrapper for graphics interop with DirectX and OpenGL, respectively SlimDX and OpenTK
  • CUDA vector types like int2, float3 etc. with ToString() methods and operators (+, –, *, /)
  • Define your own types: CudaDeviceVariable accepts any user defined type if it is a value type, i.e. a struct in C#
  • Includes CUDA libraries: CUBLAS, CUFFT, CURAND, CUSOLVER, CUSPARSE, NPP, NvJPEG, NvJitLink and NVRTC
  • Compatibility for .net Framework 4.8 (might be dropped in a future version) and .net Core >3.1.
  • Native Linux support for .net Core >3.1: Automatically switches the native library names.
  • Access device memory directly per element using [] operator:
CudaDeviceVariable<float> devVar = new CudaDeviceVariable<float>(64);
devVar[0] = 1.0f;
devVar[1] = 2.0f;
float hostVar1 = devVar[0];
float hostVar2 = devVar[1];
  • Implicit converter operators: Allocate and initialize device or host arrays in only one line of code:
float3[] array_host = new float3[100];
for (int i = 0; i < 100; i++)
{
	array_host[i] = new float3(i, i+1, i+2);
}
//alloc device memory and copy data:
CudaDeviceVariable<float3> array_device = array_host;
//alloc host array and copy data: 
float3[] array_host2 = array_device; 
  • NPPs extension methods for CudaDeviceVariable. Add a reference to the NPP library and include the ManagedCuda.NPP.NPPsExtensions namespace:
Random rand = new Random();
int length = 256;

//init some ramdom values
double[] randoms = new double[length];
for (int i = 0; i < length; i++)
{
	randoms[i] = rand.NextDouble();
}

//Alloc device memory
CudaDeviceVariable<double> a = randoms;
CudaDeviceVariable<double> b = new CudaDeviceVariable<double>(length);
b.Set(10.0); //NPPs method
int size = a.MeanGetBufferSize(); //NPPs method
//Alloc temporary memory for NPPs mean method
CudaDeviceVariable<byte> buffer = new CudaDeviceVariable<byte>(size);
CudaDeviceVariable<double> mean = new CudaDeviceVariable<double>(1);

a.Mul(b); //NPPs method
a.DivC(10.0); //NPPs method
a.Mean(mean, buffer); //NPPs method

//Copy data back to host
double m = mean;
double[] res = a;

//Clean up
mean.Dispose();
buffer.Dispose();
b.Dispose();
a.Dispose();
  • The new feature 'per thread default stream' is available as a compiler directive of the managedCuda main library: Compile the library with the option "_PerThreadDefaultStream" to enable it.

Note about Cuda context

Nvidia changed the cuda context behavior in the cuda libraries (NPP, Cufft, etc.) why it is highly recommended to use a PrimaryContext instead of a CudaContext when using ManagedCUDA together with Cuda libraries. To create a PrimaryContext in ManagedCUDA, use the following lines of code:

int deviceID = 0;
PrimaryContext ctx = new PrimaryContext(deviceID);
// Set current to CPU thread, mandatory for a PrimaryContext
ctx.SetCurrent();

NppStreamContext

In order to use the NppStreamContext-API of NPP, initialize a NppStreamContext like this:

CudaStream cudaStream = new CudaStream();          //optional, 
NPPNativeMethods.NPPCore.nppSetStream(cudaStream); //if not set, NPP will work on the default null-stream
NppStreamContext nppCtx = NPPNativeMethods.NPPCore.nppGetStreamContext();
Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 is compatible.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 is compatible.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed.  net10.0 was computed.  net10.0-android was computed.  net10.0-browser was computed.  net10.0-ios was computed.  net10.0-maccatalyst was computed.  net10.0-macos was computed.  net10.0-tvos was computed.  net10.0-windows was computed. 
.NET Core netcoreapp3.1 is compatible. 
.NET Framework net48 is compatible.  net481 was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
  • .NETCoreApp 3.1

    • No dependencies.
  • .NETFramework 4.8

    • No dependencies.
  • All Frameworks

  • net6.0

    • No dependencies.
  • net7.0

    • No dependencies.

NuGet packages (2)

Showing the top 2 NuGet packages that depend on ManagedCuda-CUBLAS:

Package Downloads
DeepNet

Deep learning library for F#. Provides symbolic model differentiation, automatic differentiation and compilation to CUDA GPUs. Includes optimizers and model blocks used in deep learning. Make sure to set the platform of your project to x64.

Sigma.Core

Rocket powered machine learning. Create, compare, adapt, improve - neural networks at the speed of thought.

GitHub repositories (4)

Showing the top 4 popular GitHub repositories that depend on ManagedCuda-CUBLAS:

Repository Stars
kunzmi/ImageStackAlignator
Implementation of Google's Handheld Multi-Frame Super-Resolution algorithm (from Pixel 3 and Pixel 4 camera)
GoodAI/BrainSimulator
Brain Simulator is a platform for visual prototyping of artificial intelligence architectures.
dje-dev/Ceres
Ceres - an MCTS chess engine for research and recreation
ThinkingTransistor/Sigma
Rocket powered machine learning. Create, compare, adapt, improve - artificial intelligence at the speed of thought.
Version Downloads Last Updated
12.8.60 534 3/3/2025
12.4.57 1,028 3/22/2024
12.3.54 328 1/2/2024
12.2.52 407 7/13/2023
12.1.51 567 5/18/2023
12.0.48 510 1/29/2023
11.4.47 2,201 8/19/2021
11.0.43 1,665 7/16/2020
10.2.41 1,219 4/26/2020
10.0.31 16,730 10/28/2018
8.0.22 7,417 5/28/2017
8.0.13 24,175 10/23/2016