UDOP: Document Question Answering
UDOP, a unified model for document classification, layout parsing and visual question answering by Microsoft.
Try it in the Widget Center
Click this url to try this widget and copy the Pro Config template.
Usage
<TODO: enter description here, and remove useless inputs>
Input Parameters
Name | Type | Description | Default | Required |
---|---|---|---|---|
image |
| image url for udop to understand. | ||
prompt |
| The text prompt in natural language. You can use `Question answering. xxxxxx` to ask question about the input image | Question answering. In which date is the report made? |
Output Parameters
Name | Type | Description | File Type |
---|---|---|---|
response |
| The answer given. According to the image and prompt |
Output Example
Detailed Guidelines
Last updated