HearPoint

A Windows accessibility toolkit that enables users with visual or cognitive disabilities to select screen regions and have the text read aloud using text-to-speech technology.

Features

Screen Region Selection: Capture text from any area on your screen using mouse selection
OCR (Optical Character Recognition): Extract text from images using advanced OCR engines
Text-to-Speech: High-quality voice synthesis with multiple TTS engines
Translation Support: Optional translation of captured text before speech synthesis
Global Shortcuts: Customizable keyboard shortcuts for quick access
Settings Management: Comprehensive configuration for all features
Multi-language Support: Localization and OCR language selection

Technology Stack

Backend: Rust with Tauri 2
Frontend: React 19 + TypeScript + Vite
UI Framework: Tailwind CSS + shadcn/ui components
Database: SQLite with SQLx
Package Manager: Bun
Platform: Windows only

Prerequisites

Before you begin, ensure you have the following installed:

Node.js (v18 or later) or Bun
Rust (latest stable)
Visual Studio Build Tools (for Windows development)

Installation

Clone the repository:

git clone https://github.com/your-username/hearpoint.git
cd hearpoint

Install frontend dependencies:
```
bun install
# or
npm install
```
Install Rust dependencies and build the project:
```
bun tauri build
# or
npm run tauri build
```

The built application will be available in src-tauri/target/release/.

Development

bun tauri dev

Architecture

Core Pipeline (v0.1.0)

Input Capture: Low-level mouse hooks detect region selection
Screen Capture: DXGI captures the selected screen area
OCR Processing: Text extraction using configurable OCR engines
Translation (optional): Text translation with caching
TTS Synthesis: Audio generation using various TTS engines
Audio Output: Playback through system audio

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

Roadmap

v0.1.0: Core screen capture → OCR → TTS pipeline ✅
v0.1.1: Translation caching and optimization ✅
v0.1.2: On-demand service downloads ✅
v0.2.0: Voice command recognition (ASR)
v0.3.0: Game template matching for automated capture
v1.0: Advanced voice commands and accessibility features

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
public		public
resources		resources
src-tauri		src-tauri
src		src
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HearPoint

Features

Technology Stack

Prerequisites

Installation

Development

Architecture

Core Pipeline (v0.1.0)

License

Roadmap

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HearPoint

Features

Technology Stack

Prerequisites

Installation

Development

Architecture

Core Pipeline (v0.1.0)

License

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages