OCR Test

OCR Test

May 11, 2014:
Fixed a bug in the saving of character blacklist/whitelist values.
Upgraded to use Tesseract v3.03 r1098.
Dec. 13, 2013:
Fixed a forward-compatibility bug that hid menu on newer devices.
Added a preference setting to disable focus modes that don't work well on some devices.
Upgraded to use Tesseract v3.03.
Updated to retrieve newest version of Tesseract data files.
Added new OCR languages.
Added new translation languages.
展开
5分
(0人评论)
36下载 8.32M 分类:
版本号:V0.5.14 更新时间: 开发商:
加载中...

OCR Test应用介绍:

Experimental app for optical character recognition (OCR).
Runs the Tesseract 3.03 open source OCR engine to find text in images captured by the device camera.
This app runs OCR on your device--without uploading your images to a server--and is suitable for recognizing individual words or short phrases of text. Translation (powered by Google/Microsoft) can be run after OCR.
The default single-shot capture runs OCR on a snapshot image that's captured when you click the shutter button, like a regular photo.
When the "continuous preview" checkbox is checked, the app shows a dynamic, real-time display of what the device is recognizing right beside the camera viewfinder. The continuous preview mode works best on a fast device.
USING THIS APP
- Point the device at a small region of text and touch the on-screen shutter button to start OCR.
- To copy text to the clipboard or share text, long-press on the text after pressing the shutter button.
- For recognizing individual Chinese/Japanese/Korean characters, set the page segmentation mode to "single character."
RECOGNITION ACCURACY
- Various factors can cause the OCR to fail: uneven illumination, stylized text, or text without enough contrast from the background. Try to have good lighting.
- Hold the device steady, and be sure the picture is in focus.
- If you need to scan a large block of text or an entire document, consider using a flatbed scanner or a document scanning app such as TextFairy instead.
LANGUAGES
- This app supports several languages not supported by Google Goggles/Google Translate.
- Supported languages for OCR: Afrikaans, Albanian, Arabic, Azeri, Basque, Belarusian, Bengali, Bulgarian, Catalan, Chinese (Simplified), Chinese (Traditional), Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Macedonian, Malay, Malayalam, Maltese, Polish, Portuguese, Romanian, Russian, Serbian (Latin), Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, and Vietnamese.
- Arabic OCR requires a large amount of RAM. If your device doesn't have enough RAM, the app will quit during OCR.
SAMSUNG DEVICE NOTES
- On Samsung Galaxy devices, you may need to long-press the menu button to set preferences.
- You may get better results if you un-check "Standard focus mode".
DEVELOPMENT NOTES
- This is an open source project. The source code is available at https://github.com/rmtheis/android-ocr.
- Since the release of this app, Google Goggles has added a "continuous mode" and Google Translate has added OCR-based translations. There is also one VC-funded startup that has used this app as a starting point.
- Thanks to the contributors: Spoorthi, Hunvil, Jingjing, Xuyuan, and Mandar.
My latest translation app:
https://www.google.com/url?q=https://play.google.com/store/apps/details?id=com.rmtheis.translator
光学字符识别(OCR)的实验程序。
运行TESSERACT 3.03的开源OCR引擎中查找文本由设备相机拍摄的图像。
此应用程序运行OCR您的设备上 - 没有上传你的图片到服务器 - 并且是适合于识别个别单词或文字的短语。翻译(搭载谷歌/微软)可以OCR后运行。
默认的单次捕获运行OCR当您按快门按钮,像一个普通的照片,是拍摄的快照图像。
当“连续预览”复选框被选中,该应用程序显示的设备被正确识别相机的取景器旁边有什么动态,实时显示。连续预览模式运作一个快速设备上最好的。
使用这个程序
- 点设备在文本的一个小区域,然后轻触屏幕上的快门按钮开始OCR。
- 要在按下快门按钮后,对文本将文本复制到剪贴板或共享文本,长按。
- 对于认识中国个人/日文/韩文字符,设置页面分割模式为“单个字符。”
识别准确率
- 各种因素可能会导致OCR失败:光照不均匀,程式化的文字,或文字没有足够的对比度和背景。尝试有良好的照明。
- 保持设备稳定,并确保画面清晰。
- 如果您需要扫描的文字或整个文档的一大块,请考虑使用平板扫描仪或文档扫描应用程序,如TextFairy代替。
语言
- 此应用程序支持不支持谷歌护目镜/谷歌翻译几种语言。
- 用于OCR支持的语言:南非荷兰语,阿尔巴尼亚语,阿拉伯语,阿塞拜疆语,巴斯克语,白俄罗斯语,孟加拉语,保加利亚语,加泰罗尼亚语,中国(简体)中国(繁体),克罗地亚语,捷克语,丹麦语,荷兰语,英语,爱沙尼亚语,芬兰语,法语,加利西亚语,德语,希腊语,希伯来语,印地文,匈牙利语,冰岛语,印度尼西亚语,意大利语,日语,埃纳德语,韩语,拉脱维亚语,立陶宛语,马其顿语,马来语,马拉雅拉姆语,马耳他语,波兰语,葡萄牙语,罗马尼亚语,俄语,塞尔维亚语(拉丁),斯洛伐克语,斯洛文尼亚语,西班牙语,斯瓦希里语,瑞典语,塔加路语,泰米尔语,泰卢固语,泰语,土耳其语,乌克兰语和越南语。
- 阿拉伯语的OCR需要大量的RAM。如果您的设备没有足够的内存,应用程序将OCR的过程中突然停止。
Samsung驱动程序注意事项
- 三星Galaxy设备,您可能需要长按菜单按钮,设置首选项。
- 你可能会得到更好的结果,如果你取消选中“标准对焦模式”。
开发笔记
- 这是一个开源项目。源代码可在https://github.com/rmtheis/android-ocr。
- 由于这个程序的发布,谷歌护目镜增加了“连续模式”和谷歌翻译增加了基于OCR的翻译。还有一个VC投资的启动已经使用这个应用程序作为一个起点。
- 感谢提供者:Spoorthi,Hunvil,晶晶,徐园,以及文华。
我的最新翻译的应用程序:
https://www.google.com/url?q=https://play.google.com/store/apps/details?id=com.rmtheis.translator

May 11, 2014:
Fixed a bug in the saving of character blacklist/whitelist values.
Upgraded to use Tesseract v3.03 r1098.
Dec. 13, 2013:
Fixed a forward-compatibility bug that hid menu on newer devices.
Added a preference setting to disable focus modes that don't work well on some devices.
Upgraded to use Tesseract v3.03.
Updated to retrieve newest version of Tesseract data files.
Added new OCR languages.
Added new translation languages.

玩过OCR Test的用户还对以下应用感兴趣:

Robert Theis还开发了应用:

0/140
文明上网理性发言

看看用户都是怎么评价的:

  • 暂无评论!

排行