Skip to content

Commit 003de03

Browse files
committed
Update blog
1 parent 6438beb commit 003de03

1 file changed

Lines changed: 1 addition & 0 deletions

File tree

content/blog/2025-08-25-1756113601.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -53,6 +53,7 @@ For 10k iterations:
5353
| 2.3s | ggml + CUDA | Windows |
5454
| 5.3s | ggml + Vulkan | Windows |
5555

56+
5657
It's interesting that `torch.compile()` was slower than plain torch on both Windows and Ubuntu Linux (WSL). And plain torch was pretty close to TensorRT and TensorRT RTX on Windows.
5758

5859
Maybe the model (and data) is too small? I'll pick a more representative model next - the Unet of a Stable Diffusion 1.5 model.

0 commit comments

Comments
 (0)