🤟 SignHero

A real-time American Sign Language (ASL) learning platform combining rhythm-based gaming with AI-powered hand sign recognition.

🌟 Overview

SignHero is a full-stack application that teaches ASL fingerspelling through interactive gameplay. The system uses a webcam to detect hand signs in real-time via a trained machine learning model, then challenges players to sign along to beatmaps synced with music—like Guitar Hero, but with sign language.

Component	Description
Frontend Game (`asl/`)	Next.js 15 web app with rhythm game modes, visual effects, and real-time scoring
ML Backend (`Base test/`)	PyTorch model (MobileNetV2) trained on ASL alphabet data
API Server (`api_server_http.py`)	FastAPI server for real-time sign prediction via webcam frames

🎮 Features

Game Modes

Song Game — Rhythm-based gameplay with scrolling note highway and combo scoring
Training Mode — Step-by-step practice with visual hand pose hints
Testing Mode — Timed challenges with accuracy tracking
Whack-A-Sign — Arcade-style reflex game

AI Sign Detection

Real-time webcam analysis using MediaPipe hand tracking
MobileNetV2 CNN for letter classification (A-Z)
~30-50ms inference latency on localhost

Visual & Audio

Synthwave aesthetic with neon effects
Particle bursts, screen flash, streak glow
Sound effects for hits/misses

🏗️ Architecture

┌─────────────────────────────────────────────────────────┐
│                    User's Browser                        │
│  ┌─────────────────────────────────────────────────────┐│
│  │ Next.js Game (asl/)                                 ││
│  │  • GameCanvas • NoteHighway • WebcamFeed            ││
│  │  • useGameLoop • useSignDetection • useWebcam       ││
│  └────────────────────────┬────────────────────────────┘│
└───────────────────────────┼─────────────────────────────┘
                            │ HTTP POST /predict_frame
                            │ (JPEG + timestamp)
┌───────────────────────────▼─────────────────────────────┐
│        FastAPI Server (api_server_http.py)              │
│  ┌────────────────────────────────────────────────────┐ │
│  │ ASLPredictor                                       │ │
│  │  1. Decode JPEG (cv2)                              │ │
│  │  2. Hand Detection (MediaPipe)                     │ │
│  │  3. Feature Extraction (landmark mask)             │ │
│  │  4. CNN Inference (PyTorch MobileNetV2)            │ │
│  │  5. Return {letter, confidence, handDetected}      │ │
│  └────────────────────────────────────────────────────┘ │
└─────────────────────────────────────────────────────────┘

See ARCHITECTURE.md for detailed system diagrams.

📁 Project Structure

ASL-Fun-Training/
│
├── asl/                              # 🎮 Next.js Frontend Application
│   ├── src/
│   │   ├── app/                      # App Router pages (game, training, testing)
│   │   ├── components/game/          # GameCanvas, NoteHighway, effects
│   │   ├── hooks/                    # useGameLoop, useSignDetection, useWebcam
│   │   └── lib/                      # Scoring, beatmaps, utilities
│   └── README.md                     # Frontend-specific docs
│
├── Base test/Sign-Language-Recognition/  # 🧠 ML Training & Model
│   ├── app/                          # API & frame extraction scripts
│   ├── model/                        # CNN architecture (MobileNetV2)
│   ├── train/                        # Training scripts
│   ├── data/weights/                 # Trained model weights (.pth)
│   └── utils/                        # Label mapping, model loading
│
├── api_server_http.py                # 🚀 FastAPI prediction server
├── api_server_mock.py                # Mock server for testing
├── start-servers.sh                  # One-click startup (Unix)
├── start-servers.bat                 # One-click startup (Windows)
│
├── models/                           # Additional model storage
├── training/                         # Training data/scripts
├── dataset/                          # Raw dataset
├── asset_generation/                 # Hand sign SVG assets
│
├── ARCHITECTURE.md                   # System architecture diagrams
├── INTEGRATION_GUIDE.md              # Full integration documentation
├── QUICKSTART_INTEGRATION.md         # Quick setup guide
└── SETUP.md                          # Environment setup

🚀 Quick Start

Prerequisites

Python 3.10+ with pip
Node.js 20+ with pnpm
MongoDB instance
Webcam for sign detection

1. Start the ML Server

# Navigate to project root
cd ASL-Fun-Training

# Create Python virtual environment
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install torch torchvision opencv-python mediapipe fastapi uvicorn python-multipart

# Start the API server
python api_server_http.py
# Server runs at http://localhost:8000

2. Start the Frontend

# In a new terminal
cd asl

# Install dependencies
pnpm install

# Set up environment
cp .env.example .env
# Edit .env with MongoDB URI

# Start dev server
pnpm dev
# App runs at http://localhost:3000

One-Click Startup

# Unix/Mac
./start-servers.sh

# Windows
start-servers.bat

🛠️ Tech Stack

Frontend (`asl/`)

Tech	Purpose
Next.js 15	React framework (App Router, Turbopack)
TypeScript	Type safety
Tailwind CSS 4	Styling
Framer Motion	Animations
tRPC	Type-safe API layer
Prisma + MongoDB	Database

Backend (`Base test/` + `api_server_http.py`)

Tech	Purpose
FastAPI	HTTP API server
PyTorch	Deep learning framework
MobileNetV2	CNN architecture for classification
MediaPipe	Hand landmark detection
OpenCV	Image processing

🧠 Model Training

The sign detection model is a MobileNetV2 trained on hand landmark features:

Input: 224×224 RGB image (hand feature mask)
  ↓
MobileNetV2 CNN
  ↓
Output: 26 classes (A-Z)

Training Pipeline:

Webcam captures hand images
MediaPipe extracts 21 hand landmarks
Landmarks drawn as feature mask on black background
Both original and mirrored images processed
Max confidence from both used for prediction

Model weights: Base test/Sign-Language-Recognition/data/weights/asl_crop_v4_1_mobilenet_weights.pth

📚 Documentation

Document	Description
ARCHITECTURE.md	System diagrams, data flow, timing model
INTEGRATION_GUIDE.md	Full integration documentation
QUICKSTART_INTEGRATION.md	5-minute setup guide
SETUP.md	Environment configuration
asl/README.md	Frontend-specific documentation

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing)
Commit changes (git commit -m 'Add amazing feature')
Push to branch (git push origin feature/amazing)
Open a Pull Request

📄 License

Educational project for ASL learning.

🙏 Acknowledgments

T3 Stack for the Next.js template
MediaPipe for hand tracking
ASL fingerspelling resources for training data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤟 SignHero

🌟 Overview

🎮 Features

Game Modes

AI Sign Detection

Visual & Audio

🏗️ Architecture

📁 Project Structure

🚀 Quick Start

Prerequisites

1. Start the ML Server

2. Start the Frontend

One-Click Startup

🛠️ Tech Stack

Frontend (`asl/`)

Backend (`Base test/` + `api_server_http.py`)

🧠 Model Training

📚 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.claude		.claude
Base test		Base test
asl		asl
asset_generation		asset_generation
assets		assets
dataset		dataset
models		models
training		training
webscraper		webscraper
.DS_Store		.DS_Store
ARCHITECTURE.md		ARCHITECTURE.md
INTEGRATION_GUIDE.md		INTEGRATION_GUIDE.md
QUICKSTART_INTEGRATION.md		QUICKSTART_INTEGRATION.md
README.md		README.md
SETUP.md		SETUP.md
api_server.py		api_server.py
api_server_http.py		api_server_http.py
api_server_mock.py		api_server_mock.py
signhero.png		signhero.png
start-servers.bat		start-servers.bat
start-servers.sh		start-servers.sh

Folders and files

Latest commit

History

Repository files navigation

🤟 SignHero

🌟 Overview

🎮 Features

Game Modes

AI Sign Detection

Visual & Audio

🏗️ Architecture

📁 Project Structure

🚀 Quick Start

Prerequisites

1. Start the ML Server

2. Start the Frontend

One-Click Startup

🛠️ Tech Stack

Frontend (asl/)

Backend (Base test/ + api_server_http.py)

🧠 Model Training

📚 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Frontend (`asl/`)

Backend (`Base test/` + `api_server_http.py`)

Packages