diff --git a/deployment/tensorrt_llm/ReadMe.md b/deployment/tensorrt_llm/ReadMe.md index ffc8c42..eb51810 100644 --- a/deployment/tensorrt_llm/ReadMe.md +++ b/deployment/tensorrt_llm/ReadMe.md @@ -7,7 +7,7 @@ The Dolphin model employs a **Swin Encoder + MBart Decoder** architecture. In th its architectures field is specified as "VisionEncoderDecoderModel". Dolphin, Nougat, and Donut share the same model architecture. TensorRT-LLM has already supported the Nougat model. Following Nougat's conversion script, we have successfully implemented Dolphin on TensorRT-LLM. -**Note:** prompt_ids MUST be of **int32** type, otherwise TensorRT-LLM will produce incorrect results. +**Note:** [prompt_ids](./dolphin_runner.py#L120) MUST be of **int32** type, otherwise TensorRT-LLM will produce incorrect results. ## 🛠️ Installation > We only test TensorRT-LLM 0.18.1 on Linux.