MyShell
  • About MyShell
    • What is MyShell
    • MyShell in a Nutshell
    • Quickstart
  • Explore AI Agents
    • Image Generation
    • Video Generation
    • Meme Generation
    • Role-Playing Game
    • Character
    • Utility
  • Create AI Agents
    • Classic Mode
      • Enhanced Prompt
      • Knowledge Base
      • Telegram Integration
    • Pro Config Mode
      • Core Concepts
      • Tutorial
        • Tutorial Structure
        • Hello World with Pro Config
        • Building Workflow
        • Transitions
        • Expressions and Variables
        • Integration with Any Widget
        • An Advanced Example
      • Basic
        • Common
        • Atomic State
        • Transition
        • Automata
        • Modules
      • Advanced
        • Cron Pusher
        • Neutral Language To SD Prompt
        • Advanced Input Validation
        • Advanced Memory Manager in Prompt Widget
      • Tools
        • AutoConfig Agent
        • Cache Mode
        • Knowledge Base Agent
        • Crawler Widget
      • Example
        • Homeless With You
        • Random Routing
        • Function Calling
      • API Reference
        • Atomic State
        • Transition
        • Automata
        • Context
        • Module
          • AnyWidget Module
            • Prompt Widget
            • LLM Widget
            • TTS Widget
            • Code Runner Widget
            • Melo TTS
            • Age Transformation
            • ChatImg
            • GIF Generation
            • Music Generation
          • LLM Module
          • LLM Function Module
          • TTS Module
          • Google Search Module
        • Widgets
          • Bark TTS
          • Champ
          • CoinGecko
          • ControlNet with Civitai
          • Crawler
          • Crypto News
          • Data Visualizer
          • Email Sender
          • Google Flight Search
          • Google Hotel Search
          • Google Image Search
          • Google Map Search
          • Google News Search
          • Google Scholar Search
          • Google Search
          • GroundedSAM
          • Image Text Fuser
          • Information Extractor - OpenAI Schema Generator
          • Information Extractor
          • Instagram Search
          • JSON to Table
          • LinkedIn
          • MS Word to Markdown
          • Markdown to MS Word
          • Markdown to PDF
          • Mindmap Generator
          • Notion Database
          • OCR
          • Pdf to Markdown
          • RMBG
          • Stabel-Video-Diffusion
          • Stable Diffusion Inpaint
          • Stable Diffusion Recommend
          • Stable Diffusion Transform
          • Stable Diffusion Upscale
          • Stable Diffusion with 6 fixed category
          • Stable Diffusion with Civitai
          • Storydiffusion
          • Suno Lyrics Generator
          • Suno Music Generator
          • Table to Markdown
          • TripAdvisor
          • Twitter Search
          • UDOP: Document Question Answering
          • Weather forecasting
          • Whisper large-v3
          • Wikipedia
          • Wolfram Alpha Search
          • Yelp Search
          • YouTube Downloader
          • YouTube Transcriber
          • Youtube Search
      • FAQs
      • Changelog
    • ShellAgent Mode
      • Download and Installation
      • App Builder
      • Workflow
      • Build Custom Widget
      • Publish to MyShell
      • Customized Pricing For Your Agent
      • Example
        • Child Book X Agent w/ DeepSeek
        • Kids Book NFT AI Agent w/ BNB Chain
        • DeFAI Agent w/ BNB Chain
  • Shell Launchpad
    • How to Launch a Token
    • Trade Agent Tokens
  • Tokenomics
    • $SHELL Basics
    • $SHELL Token Utility
    • How to Obtain $SHELL
    • Roadmap
  • Open-source AI Framework/SDK
    • ShellAgent
    • OpenVoice
    • MeloTTS
    • JetMoE
    • AIlice
  • Links
Powered by GitBook
On this page
  • Try it in the Widget Center
  • Usage
  • Detailed Guidelines
  1. Create AI Agents
  2. Pro Config Mode
  3. API Reference
  4. Widgets

OCR

Given a image containing text. return the text in the images

PreviousNotion DatabaseNextPdf to Markdown

Last updated 1 year ago

Try it in the Widget Center

Click this to try this widget and copy the Pro Config template.

Usage

<TODO: enter description here, and remove useless inputs>

Input Parameters

Name
Type
Description
Default
Required

url

string

URL of the jpg, this image contains text information that you want to use OCR to recognize

language

string

Main language of the content in the input image, specify the correct language will improve the ocr performance. Choose from ['EN', 'ZH', 'KR', 'JP', 'GE', 'FR']

return_image

boolean

Besides the returned list of text, return the annotated image as well

False

Output Parameters

Name
Type
Description
File Type

url

string

The annotated ocr image url

image

result

array

The result of ocr. A list, each element is a word.

Output Example

{  // input https://replicate.delivery/pbxt/KZczgVh1gAp7xfPP79GdZoGdKEoekcLPiqSqE6bEgM5pThGD/example_1.jpg
  "result": [
    "R\u0026D QUALITY IMPROVEMENT",
    "SUGGESTION/SOLUTION FORM",
    "Name/Phone Ext.:M.Hamann.P.Harper.P.Martinez",
    "Date:",
    "9/3/92",
    "Supervisor/Manager:L.S.Wigand",
    "R\u0026D Group_Licensee",
    "Suggestion:",
    "Discontinue coal retention analyses on licensee submitted",
    "product samples.",
    "Note Coal Retention testing is not",
    "performed by most licensees. Other B\u0026w physical",
    "measurements as ends stability and inspection for soft",
    "spots in cigarettes are thought to be sufficient measures",
    "to assure cigarette physical integrity.",
    "The proposed",
    "action will increase laboratory productivity.)",
    "Suggested Solutions: Delete coal retention from the list of standard",
    "analyses performed on licensee submitted",
    "product samples. Special requests for coal",
    "retention testing could still be submitted on",
    "an exception basis.",
    "Have you contacted your Manager/Supervisor?",
    "ves",
    "No",
    "Manager Comments: Nanager, please contact suggester and forward",
    "comments to the Quality Council.",
    "dmdyb",
    "597005708"
  ],
  "url": "https://image.myshell.ai/image/chat/embed_obj/40295/20240423/8fd1b3ede8be4c91ace2f67ecb783ccf.jpg" // if return_image is check
}
Http error

Detailed Guidelines

url