NeuroCUDA is an open-source, pip-installable compiler that converts trained PyTorch models into spiking neural networks using QCFS calibration and BPTT fine-tuning. It validates execution on GPU, CPU, a Loihi 2 IF-neuron simulator, and NIR export with published accuracy numbers.

Is NeuroCUDA free to use?

Yes. NeuroCUDA is open source on GitHub and installable via pip install neurocuda. It does not require proprietary neuromorphic SDKs or INRC membership for development and simulation.

What accuracy does NeuroCUDA achieve?

Published multi-seed results include 99.88% on N-MNIST (vs 99.70% ANN baseline) and 94.61% ResNet-18 on CIFAR-10 at T=32 (vs 95.56% ANN, a 0.95 percentage point gap). See the technical report PDF for methodology.

How do I install NeuroCUDA?

Run pip install neurocuda in a Python 3.9+ environment with PyTorch installed. For all optional backends use pip install neurocuda[all]. Then call neurocuda.convert() on a trained model and calibration data loader.

July 4, 2026 · 14 min read

What Is NeuroCUDA?

If you searched what is NeuroCUDA, you want a plain definition, not marketing. NeuroCUDA is an open-source compiler that turns a trained PyTorch checkpoint into a validated spiking neural network on GPU, CPU, Loihi 2 simulator, or NIR export.

TL;DR

What is NeuroCUDA? A pip-installable ANN-to-SNN compiler (pip install neurocuda) built by QuantaraCore. Pipeline: QCFS calibration → BatchNorm folding → IF neuron replacement → BPTT fine-tuning. Backends: GPU, CPU, Loihi 2 simulator, NIR. Verified: 99.88% N-MNIST, 94.61% ResNet-18/CIFAR-10 at T=32. Product page: /neurocuda · Report: paper.pdf.

Neuromorphic computing search results mix simulators, training libraries, and vendor SDKs until every name sounds interchangeable. What is neurocuda in that landscape? It is not a neuroscience simulator like GeNN or Brian2. It is not a training-from-scratch library like snnTorch. It is a compiler: input is a trained PyTorch model, output is a spiking network you can run, measure, and export.

QuantaraCore Technologies LLP maintains NeuroCUDA as open source because the gap between "I have a working CNN" and "I have a validated SNN" was too wide for robotics and edge teams without proprietary toolchains. This page defines the tool, its pipeline, its backends, and where the published numbers come from.

One-sentence definition

NeuroCUDA converts pretrained PyTorch models into integrate-and-fire spiking neural networks using QCFS (quantization-clip-floor-shift) activation calibration plus BPTT (backpropagation through time) fine-tuning, then validates the result across multiple execution backends with citable accuracy benchmarks.

Use NeuroCUDA when: you already trained a ReLU-based PyTorch model and need a spiking version with measured accuracy, sparsity, and optional neuromorphic export - not when you are designing custom neuron ODEs from scratch.

How NeuroCUDA works: QCFS + BPTT

Naive ReLU-to-spike swaps collapse accuracy because continuous activations and binary spikes obey different dynamics. NeuroCUDA's conversion pipeline addresses that in stages:

QCFS calibration: ReLU layers become quantized clipping functions. Per-channel thresholds align ANN activation ranges with spiking membrane behavior.
BatchNorm folding: normalization parameters merge into conv/linear weights before neuron replacement so inference matches training statistics.
IF neuron replacement: calibrated activations become integrate-and-fire neurons that accumulate input over T timesteps and fire when membrane potential crosses a learned threshold.
BPTT fine-tuning: surrogate gradients (atan) let thresholds and weights adapt so the SNN recovers ANN accuracy within a small gap.

The full methodology, multi-seed protocols, and ablation details live in the NeuroCUDA technical report. If thresholds stall during fine-tuning, see QCFS threshold debugging.

Installation and minimal API

pip install neurocuda
# optional: all backends
pip install neurocuda[all]

import neurocuda
snn = neurocuda.convert(your_model, calibration_loader)
neurocuda.compile(snn, target="gpu")
accuracy = neurocuda.evaluate(snn, test_loader)

Four function calls cover the core workflow: install, convert, compile to a backend, evaluate. Sparsity measurement (neurocuda.measure_sparsity) and NIR export (neurocuda.to_nir) extend the same object. For a longer walkthrough, read pip install neurocuda guide or convert PyTorch to SNN.

Execution backends

Backend	What it does	Validation
GPU	CUDA inference over T timesteps	0 deviations vs CPU across 256,000 spike comparisons
CPU	Reference implementation	Bit-exact cross-check against GPU
Loihi 2 sim	IF neuron dynamics per Intel published equations	100,000+ comparisons, 0 deviations (simulator, not silicon)
NIR export	Vendor-neutral graph for Open Neuromorphic tooling	ResNet-18 residual graph bit-exact round-trip

Labeling matters: GPU results are not Loihi silicon results. The Loihi 2 backend is a simulator checked against Intel's public neuron spec, useful when Lava is archived and you still need equation-level confidence before hardware access.

Verified accuracy numbers

When people ask what is neurocuda good for, the honest answer includes evidence, not demos:

N-MNIST (3-layer CNN): SNN 99.88% ± 0.02% vs ANN 99.70% ± 0.00%
ResNet-18 / CIFAR-10 (T=32): SNN 94.61% ± 0.14% vs ANN 95.56% ± 0.11% (0.95 pp gap)
Sparsity (ResNet-18): roughly 93.7% silent timesteps

These numbers come from full test-set evaluation across multiple random seeds, documented in paper.pdf. They are why NeuroCUDA ranks highly in best CUDA for neuromorphic chips guides for PyTorch deployment workflows.

What is NeuroCUDA? A compiler with published multi-backend validation - not a simulator, not a vendor SDK, not a research-only prototype.

What NeuroCUDA is not

Clarity prevents wrong-tool installs:

Not a neuroscience simulator: use GeNN or Brian2 for custom plasticity and population dynamics (comparison).
Not training-from-scratch: use snnTorch or SpikingJelly if you want surrogate-gradient SNN training without a pretrained ANN (snnTorch vs NeuroCUDA).
Not physical Loihi deployment: simulator validates equations; silicon still needs Intel INRC access and chip-specific tooling.
Not a chip energy model: GPU watts ≠ neuromorphic joules per inference (energy honesty guide).

NeuroCUDA in the neuromorphic stack

Think of three layers: (1) train ANN in PyTorch, (2) convert and validate SNN with NeuroCUDA, (3) export via NIR or pursue silicon. NeuroCUDA owns layer 2 for teams that start from standard deep learning checkpoints. The neuromorphic CUDA guide places it beside simulators and architecture tools so you pick the right category first.

For robotics pipelines, NeuroCUDA also ships ROS2 integration (NeuroCUDA ROS2) so perception nodes can run spiking models without rewriting the graph in a new DSL.

Who builds and maintains it

NeuroCUDA is developed by QuantaraCore Technologies LLP (Amravati, India), the company behind Memoria and the quantaracore.in neuromorphic documentation hub. Source: github.com/Krishnav1/neurocuda. Product benchmarks and install instructions: quantaracore.in/neurocuda.

When to choose NeuroCUDA vs alternatives

Your starting point	Better tool
Trained PyTorch .pth checkpoint	NeuroCUDA
Custom neuron equations	GeNN or Brian2
Train SNN from random init	snnTorch or SpikingJelly
Intel Lava workflow (archived)	Lava alternatives including NeuroCUDA for PyTorch path

Primary sources

NeuroCUDA technical report, quantaracore.in/neurocuda/paper.pdf
NeuroCUDA GitHub, github.com/Krishnav1/neurocuda
NeuroCUDA PyPI, pypi.org/project/neurocuda
NeuroCUDA product page, quantaracore.in/neurocuda
Open Neuromorphic NIR spec, open-neuromorphic.org

Frequently asked questions