mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-11 12:24:53 +00:00
fix: remove redundant bullet
This commit is contained in:
parent
784df59928
commit
0d03620500
@ -10,7 +10,6 @@ Below are couple of common use cases for vision language models:
|
|||||||
|
|
||||||
- **Image Captioning**: Given an image, generate a caption that describes the image.
|
- **Image Captioning**: Given an image, generate a caption that describes the image.
|
||||||
- **Visual Question Answering (VQA)**: Given an image and a question about the image, generate an answer to the question.
|
- **Visual Question Answering (VQA)**: Given an image and a question about the image, generate an answer to the question.
|
||||||
- **Visual Dialog**: Given an image and a dialog history, generate a response to the dialog.
|
|
||||||
- **Mulimodal Dialog**: Generate response to multiple turns of images and conversations.
|
- **Mulimodal Dialog**: Generate response to multiple turns of images and conversations.
|
||||||
- **Image Information Retrieval**: Given an image, retrieve information from the image.
|
- **Image Information Retrieval**: Given an image, retrieve information from the image.
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user