OpenReader WebUI is a sleek, web-based PDF reader with built-in Text-to-Speech capabilities, offering a seamless reading experience with synchronized audio narration.
- 🎯 TTS API Integration: Compatible with OpenAI API TTS and Kokoro FastAPI TTS, enabling high-quality voice narration
- 💾 Local-First Architecture: Secure document handling with IndexedDB browser storage - no server uploads required
- 🔍 Smart Text Processing: Real-time sentence detection and synchronized text highlighting during playback
- ⚡ Modern Tech Stack: Built with Next.js, React, and Tailwind CSS
- 🎨 Customizable Experience:
- Adjustable playback speed
- Multiple voice options
- Dark/light/system theme support
- Persistent user settings
- 📱 Cross-Platform: Responsive design works seamlessly across desktop and mobile devices
Screen.Recording.2025-02-04.at.3.33.16.AM.mov
docker run --name openreader-webui -p 3003:3003 richardr1126/openreader-webui:latest
Visit http://localhost:3003 to run the app.
Create or add to a docker-compose.yml
:
services:
openreader-webui:
container_name: openreader-webui
image: richardr1126/openreader-webui:latest
ports:
- "3003:3003"
restart: unless-stopped
- Node.js & npm (recommended: use nvm)
-
Clone the repository:
git clone https://github.com/richardr1126/OpenReader-WebUI.git cd OpenReader-WebUI
-
Install dependencies:
npm install
-
Configure the environment:
cp .env.template .env # Edit .env with your configuration settings
-
Start the development server:
npm run dev
or build and run the production server:
npm run build npm start
Visit http://localhost:3003 to run the app.
Dev server runs on port 3000 by default, while the production server runs on port 3003.
- linux/amd64 (x86_64)
- linux/arm64 (Apple Silicon)
- linux/arm/v7 (Raspberry Pi)
- Real-time sentence by sentence text-to-speech via OpenAI API TTS, Kokoro FastAPI TTS, or others compatible with the
/v1/audio/speech
endpoint - IndexedDB-powered local storage
- Synchronized text highlighting during playback (using string similarity for best match on the page)
- Configurable playback speed and voice options, which checks
/v1/audio/voices
for available voices - Click-to-skip on the same page for navigation
- Responsive design with light, dark, and system themes
- All configuration settings saved in IndexedDB
- Framework: Next.js (React)
- Storage: IndexedDB (in broswer db store)
- PDF Processing:
- UI Components:
- Headless UI
- Tailwind CSS
- TTS Integration: anything you want
src/
├── app/ // Next.js app router
├── components/ // Reusable UI components
├── contexts/ // State management contexts
└── services/ // Utility functions & integrations
Contributions are welcome! Fork the repository and submit a pull request with your changes. For significant alterations, please open an issue first.
This project is licensed under the MIT License.
- react-pdf
- Kokoro-82M for text-to-speech
- Kokoro-FastAPI for the API wrapper
If you encounter issues or have suggestions, please open an issue on GitHub.