Java自学者论坛

 找回密码
 立即注册

手机号码,快捷登录

恭喜Java自学者论坛(https://www.javazxz.com)已经为数万Java学习者服务超过8年了!积累会员资料超过10000G+
成为本站VIP会员,下载本站10000G+会员资源,会员资料板块,购买链接:点击进入购买VIP会员

JAVA高级面试进阶训练营视频教程

Java架构师系统进阶VIP课程

分布式高可用全栈开发微服务教程Go语言视频零基础入门到精通Java架构师3期(课件+源码)
Java开发全终端实战租房项目视频教程SpringBoot2.X入门到高级使用教程大数据培训第六期全套视频教程深度学习(CNN RNN GAN)算法原理Java亿级流量电商系统视频教程
互联网架构师视频教程年薪50万Spark2.0从入门到精通年薪50万!人工智能学习路线教程年薪50万大数据入门到精通学习路线年薪50万机器学习入门到精通教程
仿小米商城类app和小程序视频教程深度学习数据分析基础到实战最新黑马javaEE2.1就业课程从 0到JVM实战高手教程MySQL入门到精通教程
查看: 715|回复: 0

android中ocr解决方案(tesseract)

[复制链接]
  • TA的每日心情
    奋斗
    2024-11-24 15:47
  • 签到天数: 804 天

    [LV.10]以坛为家III

    2053

    主题

    2111

    帖子

    72万

    积分

    管理员

    Rank: 9Rank: 9Rank: 9

    积分
    726782
    发表于 2021-5-3 12:21:25 | 显示全部楼层 |阅读模式

      android应用中ocr的解决方案大致有两种,而采用最多的还是tesseract.小弟就在这里将我最近两天解决思路写下来,如有缺陷,欢迎拍砖:

      有两种解决方案,一种是采用tesseract cloud-service,这钟是把图片信息发送到云端,然后获得图片分析数据;第二种就是不用联网,本地化分析图片上信息。我就说说第二种,第一种我会在最后给大家一个链接(文章很不错)。

      搜先就是下载Tesseract native android library.这里有两个链接,你选哪个链接都可以:

      a.svn checkout http://tesseract-android-tools.googlecode.com/svn/trunk/ tesseract-android-tools。(如果不能checkout到,废话别说就到官方上下:http://code.google.com/p/tesseract-android-tools/)

      b.可能上面一个下载后编译有些人会遇到一些问题,比如找不到jgep库,编译不成功。所以有了这个项目:git clone git://github.com/rmtheis/tess-two.git  (这个包里面内容太多,不过也省得下那么多库了)

      这里先说采用第一个源下载:下载成功后,打开README文件,做下修改(如下):

    git clone git://android.git.kernel.org/platform/external/jpeg.git libjpeg
    修改为:
    git clone https://android.googlesource.com/platform/external/jpeg libjpeg
    ndk-build //这个编译要到jni文件夹里面编译

      

      对于第二个源下载,由于里面没有README文件,操作命令如下:

    cd <project-directory>/tess-two
    export TESSERACT_PATH=${PWD}/external/tesseract-3.01
    export LEPTONICA_PATH=${PWD}/external/leptonica-1.68
    export LIBJPEG_PATH=${PWD}/external/libjpeg
    ndk-build
    android update project --path .
    ant release

      最终两个都得到你想要的libs里面的so文件和src里面的对so文件的封装类。这个就是我们开发所用到的东东啦。

      然后新建工程,代码如下:

    public class MainActivity extends Activity {
    private static final String TAG = "MainActivity ...";

    private static final String TESSBASE_PATH = "/mnt/sdcard/tesseract/";
    private static final String DEFAULT_LANGUAGE = "eng";
    private static final String IMAGE_PATH = "/mnt/sdcard/test1.jpg";
    private static final String EXPECTED_FILE = TESSBASE_PATH + "tessdata/" + DEFAULT_LANGUAGE
    + ".traineddata";

    private TessBaseAPI service;
    @Override
    protected void onCreate(Bundle savedInstanceState) {
    super.onCreate(savedInstanceState);
    setContentView(R.layout.main);
    testOcr();

    }

    public void testOcr(){
    mHandler.post(new Runnable() {

    @Override
    public void run() {
    Log.d(TAG, "begin>>>>>>>");
    ocr();
    //test();
    }
    });

    }
    public void test(){
    // First, make sure the eng.traineddata file exists.
    /*assertTrue("Make sure that you've copied " + DEFAULT_LANGUAGE + ".traineddata to "
    + EXPECTED_FILE, new File(EXPECTED_FILE).exists());
    */
    final TessBaseAPI baseApi = new TessBaseAPI();
    baseApi.init(TESSBASE_PATH, DEFAULT_LANGUAGE);
    final Bitmap bmp = BitmapFactory.decodeResource(getResources(), R.drawable.test);
    //digits is a .jpg image I found in one of the issues here.
    ImageView img = (ImageView) findViewById(R.id.image);
    img.setImageBitmap(bmp);//I can see the ImageView. So we know that it should work if I sent it to the setImage()
    baseApi.setImage(bmp);
    Log.v("Kishore","Kishore:Working");//This statement is never reached. Futhermore, on putting some more Log.v commands in the setImage function, I found out that the native function nativeSetImagePix is never accessed. I have attached the Logcat output below to show that it is not accessed.

    String outputText = baseApi.getUTF8Text();
    Log.v("Kishore","Kishore:"+outputText);
    baseApi.end();
    bmp.recycle();
    }

    protected void ocr() {

    BitmapFactory.Options options = new BitmapFactory.Options();
    options.inSampleSize = 2;
    Bitmap bitmap = BitmapFactory.decodeFile(IMAGE_PATH, options);

    try {
    ExifInterface exif = new ExifInterface(IMAGE_PATH);
    int exifOrientation = exif.getAttributeInt(ExifInterface.TAG_ORIENTATION, ExifInterface.ORIENTATION_NORMAL);

    Log.v(TAG, "Orient: " + exifOrientation);

    int rotate = 0;
    switch (exifOrientation) {
    case ExifInterface.ORIENTATION_ROTATE_90:
    rotate = 90;
    break;
    case ExifInterface.ORIENTATION_ROTATE_180:
    rotate = 180;
    break;
    case ExifInterface.ORIENTATION_ROTATE_270:
    rotate = 270;
    break;
    }

    Log.v(TAG, "Rotation: " + rotate);

    if (rotate != 0) {

    // Getting width & height of the given image.
    int w = bitmap.getWidth();
    int h = bitmap.getHeight();

    // Setting pre rotate
    Matrix mtx = new Matrix();
    mtx.preRotate(rotate);

    // Rotating Bitmap
    bitmap = Bitmap.createBitmap(bitmap, 0, 0, w, h, mtx, false);
    // tesseract req. ARGB_8888
    bitmap = bitmap.copy(Bitmap.Config.ARGB_8888, true);
    }

    } catch (IOException e) {
    Log.e(TAG, "Rotate or coversion failed: " + e.toString());
    }

    ImageView iv = (ImageView) findViewById(R.id.image);
    iv.setImageBitmap(bitmap);
    iv.setVisibility(View.VISIBLE);

    Log.v(TAG, "Before baseApi");

    TessBaseAPI baseApi = new TessBaseAPI();
    baseApi.setDebug(true);
    baseApi.init(TESSBASE_PATH, DEFAULT_LANGUAGE);
    baseApi.setImage(bitmap);
    String recognizedText = baseApi.getUTF8Text();
    baseApi.end();

    Log.v(TAG, "OCR Result: " + recognizedText);

    // clean up and show
    if (DEFAULT_LANGUAGE.equalsIgnoreCase("eng")) {
    recognizedText = recognizedText.replaceAll("[^a-zA-Z0-9]+", " ");
    }
    if (recognizedText.length() != 0) {
    ((TextView) findViewById(R.id.field)).setText(recognizedText.trim());
    }
    }
    private Handler mHandler = new Handler(){
    public void handleMessage(android.os.Message msg) {

    };
    };
    }


      当你很欢喜的运行程序的时候,发现事情没有你想象的那么简单。这个文件必须要用到一个语言包。不然你怎么匹配呢?想想也是:

    adb shell mkdir /mnt/sdcard/tesseract
    adb shell mkdir /mnt/sdcard/tesseract/tessdata
    adb push eng.traineddata /mnt/sdcard/tesseract/tessdata/eng.traineddata
    adb shell ls -l /mnt/sdcard/tesseract/tessdata
    ls -l bin/tesseract-android-tools-test.apk
    adb install -r -s bin/tesseract-android-tools-test.apk
    adb shell am instrument -w -e class com.googlecode.tesseract.android.test.TessBaseAPITest com.googlecode.tesseract.android.test/android.test.InstrumentationTestRunner

      上面的额eng.traineddata这个东西。你可以搜下,网络有的。(囧,我还不知到怎么上传附件)

      最后效果如图(事实上解析结果是:44m><9。只不过那个字符不认识吧):

                                        


    参考文章:

      http://wolfpaulus.com/journal/android-and-ocr

      http://labs.makemachine.net/2010/03/simple-android-photo-capture/

    哎...今天够累的,签到来了1...
    回复

    使用道具 举报

    您需要登录后才可以回帖 登录 | 立即注册

    本版积分规则

    QQ|手机版|小黑屋|Java自学者论坛 ( 声明:本站文章及资料整理自互联网,用于Java自学者交流学习使用,对资料版权不负任何法律责任,若有侵权请及时联系客服屏蔽删除 )

    GMT+8, 2025-1-12 09:01 , Processed in 0.079800 second(s), 28 queries .

    Powered by Discuz! X3.4

    Copyright © 2001-2021, Tencent Cloud.

    快速回复 返回顶部 返回列表