Skip to content

Add colbert-pooling sample app: ColBERT with hierarchical token pooling#1889

Draft
thomasht86 wants to merge 1 commit intomasterfrom
thomasht86/colbert-pooling-binarized
Draft

Add colbert-pooling sample app: ColBERT with hierarchical token pooling#1889
thomasht86 wants to merge 1 commit intomasterfrom
thomasht86/colbert-pooling-binarized

Conversation

@thomasht86
Copy link
Copy Markdown
Contributor

PoolingColBertEmbedder runs ColBERT-Zero ONNX model inside Vespa and applies Ward's agglomerative clustering to reduce per-document token count. Binary pooled embeddings are HNSW-indexed in memory for fast ANN retrieval; full bfloat16 embeddings stay paged on disk for exact second-phase reranking.

ColBertSearcher handles query embedding, binarisation, and nearestNeighbor query construction so users just send text.

Verified against pylate (model authors' reference): cos_sim > 0.999 per token vs ONNX INT8 reference, 100% bit-exact binarisation.

I confirm that this contribution is made under the terms of the license found in the root directory of this repository's source tree and that I have the authority necessary to make this contribution on behalf of its copyright owner.

PoolingColBertEmbedder runs ColBERT-Zero ONNX model inside Vespa and applies
Ward's agglomerative clustering to reduce per-document token count. Binary
pooled embeddings are HNSW-indexed in memory for fast ANN retrieval;
full bfloat16 embeddings stay paged on disk for exact second-phase reranking.

ColBertSearcher handles query embedding, binarisation, and nearestNeighbor
query construction so users just send text.

Verified against pylate (model authors' reference): cos_sim > 0.999 per token
vs ONNX INT8 reference, 100% bit-exact binarisation.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This isn't needed since the inputs are specified in the rank profile.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants