update
This commit is contained in:
parent
0705bc12ce
commit
cab9b3f952
@ -7,7 +7,7 @@ The Dolphin model employs a **Swin Encoder + MBart Decoder** architecture. In th
|
||||
its architectures field is specified as "VisionEncoderDecoderModel". Dolphin, Nougat, and Donut share the same model architecture. TensorRT-LLM has already supported the Nougat model.
|
||||
Following Nougat's conversion script, we have successfully implemented Dolphin on TensorRT-LLM.
|
||||
|
||||
**Note:** prompt_ids MUST be of **int32** type, otherwise TensorRT-LLM will produce incorrect results.
|
||||
**Note:** [prompt_ids](./dolphin_runner.py#L120) MUST be of **int32** type, otherwise TensorRT-LLM will produce incorrect results.
|
||||
|
||||
## 🛠️ Installation
|
||||
> We only test TensorRT-LLM 0.18.1 on Linux.
|
||||
|
Loading…
Reference in New Issue
Block a user