How can Google Cloud Vision be optimized to enhance OCR capabilities?

Question

How can Google Cloud Vision be optimized to enhance OCR capabilities?

Recently, I decided to put Google cloud vision's OCR to the test, only to be disappointed by the subpar results. Despite my documents being in French, the OCR seemed to struggle with detecting apostrophes and commas accurately. For instance, when using the following image as input https://i.sstatic.net/i1Ljv.jpg

The code used was as follows:

Request
        .post(`https://vision.googleapis.com/v1/images:annotate?key=AIzaSyAtArxxxxxxxxxxxxxxxxxpGrKrydU4`)
        .send({
          requests: [{
            image: { content: base64.replace('data:image/jpeg;base64,', '') },
            features: [{ type: 'DOCUMENT_TEXT_DETECTION' }],
            "imageContext": { "languageHints": [ "fr" ] }
          }]
        })

The result obtained (with errors highlighted in yellow) can be viewed here: https://i.sstatic.net/oKCyX.png

In contrast, Microsoft Azure OCR provided a flawless result without requiring me to specify the language, as seen here.

I'm curious if others have experienced similar accuracy issues with Google Cloud Vision's OCR.

javascript azure ocr google-cloud-vision

Answer 1

Answer №1

To specify the language, include "languageHints": ["fr"]

{
  "requests": [
    {
      "imageContext": {
        "languageHints": ["fr"]
      }
    }
  ]
}

Answer 2

To specify the language, include "languageHints": ["fr"]

{
  "requests": [
    {
      "imageContext": {
        "languageHints": ["fr"]
      }
    }
  ]
}

How can Google Cloud Vision be optimized to enhance OCR capabilities?

Answer №1

Similar questions

Examining the potential of a promise within a dynamic import feature in Angular

Tips on creating a transition in React to showcase fresh HTML content depending on the recent state changes

What is the process for invoking a server-side C# method from AJAX while transmitting parameters to the function using CommandArgument?

Generate a responsive list with a pop-up feature under each item (using Vue.js)

Preview and enlarge images similar to the way Firefox allows you to do

JavaScript functions may become unresponsive following ajax loading with tables

Encountering 404 errors on dynamic routes following deployment in Next.JS

What is the best way to reject input that includes white space characters?

How to Retrieve the Value of a Radio Button with JavaScript

Place the image on the canvas and adjust its shape to a circle

Upload an image converted to `toDataURL` to the server

What is the best way to deactivate div elements once an overlay has been applied to them?

Ways to create two separate references pointing to a single object

How can I add a group label to search options in "React Select"?

"Exploring the relationship between Javascript objects and the '

JQuery: Creating a visually striking screen flash

Passing parameters between various components in a React application

Different Ways Split Button Format Can Vary Based on Web Browser

The dispatch function in redux-thunk is not functioning as expected

How can I use React to create a dictionary that modifies several values depending on a single property?