cutlass — package search across npm, PyPI, crates.io, RubyGems, Go, Maven & NuGet

BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

crates.io matches

Showing 12 of 26 · Rust

See all crates.io →

cutlassv0.1.0

crates.io

Macro based library to take the boilerplate out of configuration handling

Abandoned. Last published 6 years ago.

baracuda-cutlass-sysv0.0.1-alpha.63

crates.io

Header acquisition for NVIDIA CUTLASS as a baracuda workspace dependency. Sparse-checkout fetch with file-locked caching; emits cargo:include for downstream build.rs consumers.

Maintained. Niche but maintained, actively maintained.

baracuda-cutlassv0.0.1-alpha.63

crates.io

Safe Rust wrapper for compiled CUTLASS kernels: plan-based GEMM and grouped GEMM with caller-supplied workspace, typed device-buffer arguments, and capture-safe launch.

Maintained. Niche but maintained, actively maintained.

baracuda-cutlass-kernels-sysv0.0.1-alpha.63

crates.io

Compiled CUTLASS template instantiations for the baracuda ecosystem. Hosts curated .cu kernel sources, builds them via baracuda-forge, exposes extern "C" entry points for the safe baracuda-cutlass crate.

Maintained. Niche but maintained, actively maintained.

atomr-accel-cutlassv0.10.0

crates.io

CUTLASS kernel-template instantiation via NVRTC for atomr-accel. Provides GEMM, grouped GEMM, implicit-GEMM convolution, and EVT (epilogue visitor tree) actors that JIT CUTLASS C++ templates against the per-arch toolchain pinned by atomr-accel-cuda's NvrtcActor.

Maintained. Niche but maintained, actively maintained.

baracuda-forgev0.0.1-alpha.63

crates.io

Build-time CUDA kernel compiler for the baracuda ecosystem: nvcc-driven incremental builds, parallel compilation, GPU auto-detection, and CUTLASS / custom git dependency support.

Maintained. Niche but maintained, actively maintained.

baracudav0.0.1-alpha.63

crates.io

Idiomatic Rust wrappers for the NVIDIA CUDA stack (Driver API, Runtime API, NVRTC, cuBLAS, cuDNN, NCCL, NVML, ...). Umbrella crate.

Maintained. Niche but maintained, actively maintained.

atomr-accelv0.10.0

crates.io

Backend-agnostic compute-acceleration core. Defines the AccelBackend trait, AccelRef<T> typed pointers, AccelError enum, and CompletionStrategy — the abstraction layer that lets atomr-accel-cuda (NVIDIA), and future ROCm / Metal / oneAPI / Vulkan backends plug into the same actor surface.

Maintained. Niche but maintained, actively maintained.

atomr-accel-cubv0.10.0

crates.io

CUB-backed device-wide reductions, scans, sorts, histograms, and selects, surfaced as an atomr actor compiled per-(op, dtype) via NVRTC against the atomr-accel-cuda Phase 0.6 disk cache.

Maintained. Niche but maintained, actively maintained.

atomr-accel-cudav0.10.0

crates.io

GPU acceleration via the actor model. Wraps NVIDIA CUDA libraries (cuBLAS, cuDNN, cuFFT, cuRAND, cuSOLVER, cuSPARSE, cuTENSOR, cuBLASLt, NVRTC, NCCL) as supervised atomr actors with generation-validated buffers and a uniform async surface.

Maintained. Niche but maintained, actively maintained.

atomr-accel-flashattnv0.10.0

crates.io

FlashAttention v2 + v3 kernel templates for atomr-accel — fp16/bf16/fp8, causal, varlen, ALiBi, sliding window, sink tokens, MQA/GQA, paged KV-cache, and chunked prefill, dispatched through NVRTC + Phase 0.6 cubin cache.

Maintained. Niche but maintained, actively maintained.

atomr-accel-telemetryv0.10.0

crates.io

GPU observability for atomr-accel: NVTX ranges, NVML metrics actor, and CUPTI activity tracing. Implements the KernelTrace hooks exported by atomr-accel-cuda and registers atomr-telemetry probes.

Maintained. Niche but maintained, actively maintained.