This project is an AI-powered Optical Character Recognition (OCR) system that compares product details, such as price and quantity, from images of product packages. Users can upload images of the backs of two product packets, and the system extracts text from the images to output a comparison table of price and quantity. This tool leverages Tesseract OCR and OpenCV to pre-process and extract information from product images, offering a convenient solution for quick product comparison. This innovatie solution helps make informed purchasing decisions, saving time and money while ensuring they don't miss out on desired items.
-
Clone the Repository:
git clone https://github.com/Preksha2135/CartWise.git cd CartWise
-
Install Dependencies: Install the required libraries listed in
requirements.txt
:pip install -r requirements.txt
-
Install Tesseract OCR: Download and install Tesseract OCR. Add its path to your system’s environment variables.
- Windows: Download Tesseract.
- Linux/MacOS: Install Tesseract via your package manager (e.g.,
sudo apt install tesseract-ocr
for Ubuntu).
-
Test the Installation: Run the sample Jupyter notebook file to verify the setup:
jupyter notebook CartWise.ipynb
- Image Upload: Upload images of the backs of two product packets.
- Comparison: The system will preprocess images and use OCR to extract text data (price and quantity).
- Output: A table displays the extracted information, comparing the products' prices and quantities.
- Programming Language: Python
- Libraries:
- Tesseract OCR: Optical Character Recognition
- OpenCV: Image processing
- Pytesseract: Python wrapper for Tesseract OCR
- Pillow: Image handling and manipulation
- Jupyter Notebook: Interactive development and testing environment
- Project Lead: Preksha Jain (https://github.com/Preksha2135)