update doc
This commit is contained in:
parent
cab9b3f952
commit
f171021615
@ -4,7 +4,7 @@
|
|||||||
|
|
||||||
## ✅ Introduction
|
## ✅ Introduction
|
||||||
The Dolphin model employs a **Swin Encoder + MBart Decoder** architecture. In the HuggingFace Transformers [Config](https://huggingface.co/ByteDance/Dolphin/blob/main/config.json),
|
The Dolphin model employs a **Swin Encoder + MBart Decoder** architecture. In the HuggingFace Transformers [Config](https://huggingface.co/ByteDance/Dolphin/blob/main/config.json),
|
||||||
its architectures field is specified as "VisionEncoderDecoderModel". Dolphin, Nougat, and Donut share the same model architecture. TensorRT-LLM has already supported the Nougat model.
|
its architectures field is specified as "VisionEncoderDecoderModel". **Dolphin**, **[Nougat](https://huggingface.co/docs/transformers/model_doc/nougat)**, and **[Donut](https://huggingface.co/docs/transformers/model_doc/donut)** share the same model architecture. TensorRT-LLM has already supported the Nougat model.
|
||||||
Following Nougat's conversion script, we have successfully implemented Dolphin on TensorRT-LLM.
|
Following Nougat's conversion script, we have successfully implemented Dolphin on TensorRT-LLM.
|
||||||
|
|
||||||
**Note:** [prompt_ids](./dolphin_runner.py#L120) MUST be of **int32** type, otherwise TensorRT-LLM will produce incorrect results.
|
**Note:** [prompt_ids](./dolphin_runner.py#L120) MUST be of **int32** type, otherwise TensorRT-LLM will produce incorrect results.
|
||||||
|
Loading…
Reference in New Issue
Block a user