File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 11# Release notes
22
3- ## Changes from 4.1.0 to 4.1.1
4-
5- XXX version-specific blurb XXX
6-
73## Changes from 4.1.1 to 4.1.2
84
9- - Update ` c-blosc2 ` version
5+ - A new fast path for src/blosc2/linalg.py that uses the matmul prefilter machinery in src/blosc2/blosc2_ext.pyx.
6+ - The fast path is only used for supported cases:
7+ - blosc2.NDArray inputs
8+ - 2-D only
9+ - floating-point only
10+ - matching dtypes
11+ - aligned chunk/block layouts that satisfy the current kernel assumptions
12+ - All other valid cases fall back to the existing chunk-by-chunk implementation in src/blosc2/linalg.py.
13+ - Some benchmarks for the supported cases show significant speedups over the chunked implementation:
14+ - aligned 400x400 float32: about 3.7x faster over chunked
15+ - aligned 400x400 float64: about 3.0x
16+ - aligned 800x800 float32: about 1.5x
17+ - misaligned case: auto correctly stays on chunked
1018
1119## Changes from 4.1.0 to 4.1.1
1220
You can’t perform that action at this time.
0 commit comments