transcribe.dev Documentation

Welcome to the transcribe.dev documentation. This directory contains all technical documentation for the transcribe.dev voice-to-text platform.

Product Overview

transcribe.dev is a system-wide voice-to-text dictation tool with AI enhancement:

Desktop Dictation App

Wispr Flow Competitor - System-wide voice-to-text dictation with AI cleanup

  • Direction: Speech → Text (STT + AI enhancement)
  • Platform: Desktop (macOS first, Windows/Linux planned)
  • Tech: Electron, Deepgram, Claude Haiku, Convex
  • Status: Active Development

PRD: transcribe-dev-bootstrap-PRD.md

Phase 2: Voice Synthesis API (FUTURE)

ElevenLabs Competitor - Developer-first text-to-speech API

  • Direction: Text → Speech (TTS)
  • Platform: Cloud API
  • Tech: Neural TTS, FastAPI, GPU infrastructure
  • Status: Planning (begins after Phase 1)

Future docs: future/voice-synthesis/

Documentation Structure

docs/
├── README.md                         # This file
├── transcribe-dev-bootstrap-PRD.md   # PRD (PRIORITY)
├── TENETS.md                         # Core development principles
├── architecture/
│   ├── PROJECT_STRUCTURE.md      # Monorepo structure
│   └── PHASE1_IMPLEMENTATION.md  # Phase 1 technical implementation
├── authentication/
│   └── README.md                 # Auth overview (WorkOS AuthKit)
├── development/
│   ├── GIT_PREFERENCES.md        # Git workflow
│   ├── TYPESCRIPT_BEST_PRACTICES.md
│   ├── SHADCN.md                 # UI component guide
│   └── UI_STANDARDS.md           # UI/UX guidelines
└── future/
    └── voice-synthesis/          # Phase 2 documentation

Quick Start

For Development

  1. Read the PRD - Full product specification
  2. Review PHASE1_IMPLEMENTATION.md - Technical implementation details
  3. Review TENETS.md - Development principles
  4. Check PROJECT_STRUCTURE.md - Monorepo layout
  5. Follow GIT_PREFERENCES.md - Git workflow

Key Technologies (Phase 1)

ComponentTechnologyPurpose
Desktop AppElectron + ReactCross-platform desktop UI
STT EngineDeepgram (cloud) / whisper.cpp (local)Speech-to-text
AI CleanupClaude HaikuGrammar, punctuation, tone
Text Injectionnut.jsType into active application
BackendConvexReal-time sync, user data
AuthWorkOS AuthKitUser authentication

Development Guidelines

Must-Read Documents

  1. TENETS.md - Core principles guiding all decisions
  2. GIT_PREFERENCES.md - Commit workflow (never push without permission)
  3. TYPESCRIPT_BEST_PRACTICES.md - TypeScript guidelines

Code Quality

  • TypeScript strict mode always
  • No any types without justification
  • Test compilation locally before deployment
  • Follow UI standards for consistent UX

Monorepo Structure

transcribe-dev/
├── apps/
│   ├── transcribe/     # Web app / Dashboard
│   ├── desktop/        # Electron dictation app
│   └── api/            # Hono API (Cloudflare Workers)
├── convex/             # Shared Convex backend
├── packages/
│   ├── ui/             # ShadCN UI components
│   ├── shared/         # Shared types and constants
│   └── ...
└── docs/               # This documentation

Contributing

When adding documentation:

  1. Phase 1 docs go in the appropriate existing folder
  2. Phase 2 docs go in future/voice-synthesis/
  3. Use clear, descriptive filenames (CAPS_WITH_UNDERSCORES.md)
  4. Link between related documents
  5. Keep docs close to the code they describe

Current Focus: Building the desktop dictation app. All development effort should prioritize Phase 1 features.

On this page