Results for baracuda-kernels

JupyterLab - Code Console

kernelspecsv2.0.1

Abandoned. Last published 5 years ago.

Find Jupyter kernelspecs on a system

@thi.ng/geomv8.3.34

Functional, polymorphic API for 2D geometry types & SVG generation

jupyterlab_nb_venv_kernels_ui_extensionv1.2.28

Jupyterlab extension to allow user to right-click on the kernel launcher button and select 'Show in File Browser' or 'Open Terminal at location' menus and get them to navigate to location or open location in terminal respectively

@thi.ng/pixel-ditherv1.1.203

Extensible image dithering w/ various algorithm presets

@ruvector/ruvllmv2.5.5

Self-learning LLM runtime — TurboQuant KV-cache (6-8x compression), SONA adaptive learning, FlashAttention, speculative decoding, GGUF inference

@amoutonbrady/baracudav1.4.0

Abandoned. Last published 6 years ago.

An alternative to JSX to work with HyperScript view layer

@bitbybit-dev/corev1.0.2

Bit By Bit Developers Core CAD API to Program Geometry

ml-kernelv4.0.0

A factory for kernel functions

@thi.ng/pixel-convolvev1.1.35

Extensible bitmap image convolution, kernel presets, normal map & image pyramid generation

@ruvector/rvfv0.2.1

RuVector Format — unified TypeScript SDK for vector intelligence

@bitbybit-dev/threejsv1.0.2

Bit By Bit Developers THREEJS CAD Library to Program Geometry

@thi.ng/cellularv1.0.64

Highly customizable 1D cellular automata, shared env, multiple rules, arbitrary sized/shaped neighborhoods, short term memory, cell states etc.

@bitbybit-dev/babylonjsv1.0.2

Bit By Bit Developers BABYLONJS CAD Library to Program Geometry

simsimdv6.5.5

Aging — last published 7 months ago — check before adopting.

Portable mixed-precision BLAS-like vector math library for x86 and ARM

jscolorenginev1.4.4

Javascript ICC Profile Color Engine with additional features for color management and analysis

@musical-patterns/pattern-hafuhafuv1.0.212

Abandoned. Last published 3 years ago.

rhythmic circularity; blocks within themselves

@nteract/fs-kernelsv2.1.9

Abandoned. Last published 6 years ago.

A manager for the filesystem aspects of Juyter kernels

@bitbybit-dev/playcanvasv1.0.2

Bit By Bit Developers PlayCanvas CAD Library to Program Geometry

@objectstack/corev8.0.0

Microkernel Core for ObjectStack

@mittalsuraj18/opencode-ipython-pluginv0.1.3

IPython kernel integration plugin for OpenCode - execute Python code with persistent kernels, rich output, and helper prelude

joplin-plugin-jopyterv1.0.0

**Add runnable Python code blocks to your Joplin notes!**

ijavascriptv5.2.1

Abandoned. Last published 4 years ago.

IJavascript is a Javascript kernel for the Jupyter notebook

@scalebox/sdkv5.3.0

A JavaScript SDK for executing multi-language code in controlled sandboxes, supporting both synchronous and asynchronous modes, as well as multi-language kernels (Python, R, Node.js, Deno/TypeScript, Java/IJAVA, Bash)

baracuda-kernelsv0.0.1-alpha.65

crates.io matches

Showing 12 of 16 · Rust

See all crates.io →

Maintained. Niche but maintained, actively maintained.

Unified ML op facade for the baracuda CUDA ecosystem. Exposes every primitive an ML framework would expect (union of PyTorch torch.* + nn.functional and JAX lax.* / numpy ops) through a single Plan-based Rust surface, internally dispatching to baracuda-cutlass, the baracuda-* NVIDIA-library wrappers, or bespoke baracuda-kernels-sys kernels.

baracudav0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Idiomatic Rust wrappers for the NVIDIA CUDA stack (Driver API, Runtime API, NVRTC, cuBLAS, cuDNN, NCCL, NVML, ...). Umbrella crate.

baracuda-kernels-typesv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Shared type vocabulary for the baracuda ML kernel facade: Element / IntElement / FpElement / BiasElement trait hierarchy, layout / epilogue / activation tags, MatrixRef / TensorRef views, PlanPreference, PrecisionGuarantee, and Workspace. Lifted from baracuda-cutlass so that baracuda-kernels and the per-library wrapper crates can share one vocabulary.

baracuda-kernels-sysv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Compiled bespoke .cu kernel template instantiations for the baracuda ML kernel facade plus C-ABI FFI facades for the library-backed plans (cuDNN conv/pool, cuSOLVER linalg, cuFFT/cuRAND, CUTLASS GEMM re-export). Hosts curated CUDA kernel sources (int8/FP8/int4/bin GEMM RRR, elementwise, reduce, norm, attention, …), builds them via baracuda-forge, exposes extern "C" entry points for the safe baracuda-kernels crate. CUTLASS template kernels live in the sibling baracuda-cutlass-kernels-sys crate and are re-exported here under the unified baracuda_kernels_gemm_* namespace.

baracuda-cutlassv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Safe Rust wrapper for compiled CUTLASS kernels: plan-based GEMM and grouped GEMM with caller-supplied workspace, typed device-buffer arguments, and capture-safe launch.

baracuda-ozimmu-sysv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Build + raw FFI bindings to baracuda's clean-fork of Hiroyuki Ootomo's ozIMMU — the Ozaki-scheme FP64 GEMM library that synthesizes a DGEMM from S² int8 tensor-core matmuls. Phase 44b internalized the upstream sources under `cuda/` (no more `vendor/` subdir; cutf submodule eliminated). Linked statically into the baracuda CUDA stack; consumed by the safe wrapper crate `baracuda-ozimmu`. MIT-licensed (original ozIMMU MIT — see `ATTRIBUTION.md`).

baracuda-optimv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Optimizer kernels (Adam / LAMB / SGD) for the baracuda CUDA stack, built on the multi_tensor_apply idiom vendored from NVIDIA Apex (BSD-3-Clause). One launch over thousands of parameter tensors — critical for the optimizer step on large-model training stacks. NEW in Phase 49; deliberate scope expansion (training-framework-adjacent). Off-by-default in baracuda-kernels via the `optim` cargo feature so inference-only consumers don't pay the FFI surface cost.

baracuda-flashinferv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Safe, typed Rust wrappers for NVIDIA FlashInfer's inference-serving kernels: batched paged-KV attention decode, decode-time KV-cache append, cascade / prefix-cache attention-state merge, and sort-free top-K / top-P / min-P sampling. The canonical vLLM-style serving surface for the baracuda CUDA stack. Apache-2.0 (FlashInfer upstream).

baracuda-forgev0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Build-time CUDA kernel compiler for the baracuda ecosystem: nvcc-driven incremental builds, parallel compilation, GPU auto-detection, and CUTLASS / custom git dependency support.

baracuda-transformer-engine-sysv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Build + raw FFI bindings to baracuda's port of NVIDIA TransformerEngine's FP8 cast/transpose + delayed-scaling recipe primitives. Cast/recipe subset only — `normalization` / `fused_rope` / `fused_attn` / `fused_softmax` / `activation` / `gemm` deliberately skipped (overlap existing baracuda Phase 3/5/14/17/30/31/36/41/42). NO cuDNN dep (recipe + cast paths don't need it; `fused_attn` would, and we skip it); NO pybind11 (the safe wrapper lives in `baracuda-transformer-engine` and exposes a raw C ABI defined in `csrc/baracuda_te_shim.cu`). Apache-2.0 per upstream — see `ATTRIBUTION.md`.

baracuda-megatronv0.0.1-alpha.65

Maintained. Niche but maintained, actively maintained.

Megatron-LM-style tensor-parallel primitives (Column / Row Parallel Linear) for the baracuda CUDA stack. Pure-composition crate — local GEMM via baracuda-cublas + cross-rank collectives via baracuda-nccl. No new CUDA kernels. NEW in Phase 57; deliberate scope expansion (distributed-training-framework-adjacent). Off-by-default in baracuda-kernels via the `megatron_tp` cargo feature so non-distributed consumers don't pay the dep surface cost. Algorithmic reference: Shoeybi et al. arXiv:1909.08053 (NVIDIA Megatron-LM, Apache-2.0).

baracuda-nvrtcv0.0.1-alpha.65