Skip to content

Incorrect text extraction #86

@minrajnepali

Description

@minrajnepali

Current Behavior

Hi,

I was trying to use the library to parse an image file with a plot and got some incorrectly extracted text. For example, 'D' is detected as 'p' and 'j' as 'y'. My main goal here is to get data points from the plot and I was trying different tools. I came across Tesseract and wanted to give it a try. The first image is the output text and the second is the input image. I was using the library with Python 3.10.

Image
Image

Expected Behavior

No response

Suggested Fix

No response

tesseract -v

No response

Operating System

No response

Other Operating System

No response

uname -a

No response

Compiler

No response

CPU

No response

Virtualization / Containers

No response

Other Information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions