Skip to content

Commit f2678e8

Browse files
committed
Some notes for the forthcoming release
1 parent 22ff4be commit f2678e8

1 file changed

Lines changed: 13 additions & 5 deletions

File tree

RELEASE_NOTES.md

Lines changed: 13 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,20 @@
11
# Release notes
22

3-
## Changes from 4.1.0 to 4.1.1
4-
5-
XXX version-specific blurb XXX
6-
73
## Changes from 4.1.1 to 4.1.2
84

9-
- Update `c-blosc2` version
5+
- A new fast path for src/blosc2/linalg.py that uses the matmul prefilter machinery in src/blosc2/blosc2_ext.pyx.
6+
- The fast path is only used for supported cases:
7+
- blosc2.NDArray inputs
8+
- 2-D only
9+
- floating-point only
10+
- matching dtypes
11+
- aligned chunk/block layouts that satisfy the current kernel assumptions
12+
- All other valid cases fall back to the existing chunk-by-chunk implementation in src/blosc2/linalg.py.
13+
- Some benchmarks for the supported cases show significant speedups over the chunked implementation:
14+
- aligned 400x400 float32: about 3.7x faster over chunked
15+
- aligned 400x400 float64: about 3.0x
16+
- aligned 800x800 float32: about 1.5x
17+
- misaligned case: auto correctly stays on chunked
1018

1119
## Changes from 4.1.0 to 4.1.1
1220

0 commit comments

Comments
 (0)