Skip to content

Running Bamba on vLLM #3

@ani300

Description

@ani300

This issue tracks progress on running Bamba on vLLM.

Success for this issue implies the following:

  • Running the model successfully from the HF checkpoint in vLLM (Add Bamba Model vllm-project/vllm#10909)
  • Ensuring chunked prefill and TP work in vLLM
  • Closing the performance gap in vLLM wrt Llama of similar sizes
  • Reporting the performance results in a blog post

cc @raghukiran1224 @fabianlim @AdnanHoque

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions