CubeOCR project is a Python-based screenshot OCR tool designed to convert scanned documents, PDF files, or images, into editable text.
- User-Friendly Interface: Simple and intuitive interface for easy use.
- Text Editor with Built-in Toolbar: Capable of processing editable text, saving time and effort.
- Transparent Background: Suitable for side-by-side paragraphs and code editing.
- Advanced Image Processing: Enhance image quality with noise filtering features.
- Multi-Language Support: Supports text recognition 107 different languages.
- Output Formats: Supports various output formats: plain text, PDF, and Word documents.
- Offline Usage: No account sign-in required for user data privacy.
- Hotkeys: Activate OCR screen, easily copy and paste to other applications.
Prerequisites:
- Operating System: Windows 10 or newer.
- Tesseract-OCR >= 5.0 [download]
- Python >= 3.11
To install and run this project:
-
Clone the repository:
git clone https://github.com/OCR-tech/CubeOCR.git
-
Navigate to the project directory:
cd CubeOCR -
Create a virtual environment:
python -m venv .venv
-
Activate the virtual environment:
.\.venv\Scripts\Activate
-
Install the dependencies:
pip install -r requirements.txt
-
Run the application:
python app/main.py
CubeOCR is an easy-to-use OCR tool designed to extract text from scanned documents, PDF files, or images. Simply select a text area, perform OCR, and be ready to paste anywhere.
- Select a text area on the screen.
- Click the "OK" button after the OCR conversion.
- Paste the editable text to other applications.
- Use the built-in text editor to edit the recognized text.
- Use the system tray icon for quick access to the application.
Note: CubeOCR is available for installation on Windows [download].
- See the CONTRIBUTING for detailed guidelines.
- This project is licensed under the MIT License.
- This project depends on other software and packages:
- Tesseract-OCR uses the Apache License.
- Leptonica library uses the BSD 2-clause License.
- Email: ocrtech.mail@gmail.com
- Website: https://ocr-tech.github.io
- GitHub: https://github.com/OCR-tech

