Optimize xilinx_async_bram_patch.tcl by eirikwitt · Pull Request #350 · vortexgpgpu/vortex

eirikwitt · 2026-05-12T14:49:24Z

When running my Synthesis-to-Bitstream flow targeting a u250 FPGA, the runtime can get into the 100's of hours for large configurations. More than 90% of this time is spent running the xilinx_async_bram_patch.tcl script.

This patch reduces the runtime complexity from O(x²) to O(x) by not searching through all cells when finding descendants.
This reduces the worst case runtime by 94% from ~120 hours to ~7 for a 16 core 2 cluster configuration.

I do not have access to working FPGAs to test the generated bitstreams, but I verified my changes by running this modified version of the script on a wide range of configurations.

NB: This was not tested using Vortex's makefiles, but a modified version of Chipyards makefiles as a part of an attempt to run Vortex using FireSim, so results may vary.

This reduces the runtime complexity from O(x²) to O(x) by not searching through all cells when finding descendants.

Optimize xilinx_async_bram_patch.tcl

9aa078c

This reduces the runtime complexity from O(x²) to O(x) by not searching through all cells when finding descendants.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize xilinx_async_bram_patch.tcl#350

Optimize xilinx_async_bram_patch.tcl#350
eirikwitt wants to merge 1 commit into
vortexgpgpu:masterfrom
eirikwitt:xillinx-bram-patching-optimization

eirikwitt commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

eirikwitt commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant