GitHub - balaboom123/Sign-Language-Preprocessing: How2Sign, Youtube-ASL preprocess includes download, and Mediapipe process.

A config-driven, modular pipeline for preprocessing American Sign Language (ASL) datasets. Supports YouTube-ASL and How2Sign with two landmark extractors (MediaPipe Holistic and MMPose RTMPose3D) and two output modes (pose landmarks and video clips).

✨ Key Features

📝 Config-Driven

YAML configs with base inheritance and CLI overrides

🦴 Two Extractors

MediaPipe Holistic (553 keypoints) and MMPose RTMPose3D (133 keypoints)

🎬 Two Pipeline Modes

pose (landmarks) and video (clip extraction)

🧩 Registry Architecture

Add datasets, processors, and extractors via decorators

⚡ Parallel Processing

Multi-worker extraction, normalization, and clipping

📦 WebDataset Output

Sharded tar archives for efficient training data loading

📖 New? See the Installation Guide to get started.

Installation

git clone https://github.com/balaboom123/Sign-Language-Preprocessing.git
cd Sign-Language-Preprocessing
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Optional: MMPose (GPU required)

MediaPipe works on CPU out of the box. MMPose requires a CUDA-capable GPU and additional dependencies -- see the Installation Guide for full setup instructions.

Quick Start

# Download YouTube-ASL videos, extract MediaPipe landmarks, normalize, and package into WebDataset shards
python -m sign_prep configs/youtube_asl/pose_mediapipe.yaml

# Extract MMPose landmarks from pre-downloaded How2Sign data (CUDA required)
python -m sign_prep configs/how2sign/pose_mmpose.yaml

# Override any config value from the command line (e.g. more workers, stop after extraction)
python -m sign_prep configs/youtube_asl/pose_mediapipe.yaml \
  --override processing.max_workers=8 pipeline.stop_at=extract

Output

Both modes produce WebDataset tar shards for efficient training data loading. See Pipeline Stages for detailed output formats and data shapes.

Supported Datasets

Dataset	Venue	Description	License
YouTube-ASL	NeurIPS 2023	11,000+ videos, 73,000+ segments -- open-domain ASL-English parallel corpus	Apache-2.0
How2Sign	CVPR 2021	80+ hours of instructional ASL in a controlled studio environment	CC BY-NC 4.0

For paper-aligned preprocessing methodology, see Research-Aligned Preprocessing.

Documentation

Installation Guide -- base setup and MMPose GPU dependencies
Architecture -- system design, registry, pipeline flow
Configuration -- full config reference, inheritance, CLI overrides
Pipeline Stages -- all 6 processing stages
Datasets -- YouTube-ASL vs How2Sign setup
Research-Aligned Preprocessing -- paper-aligned preprocessing notes

License

The MIT license in this repository applies to the code and documentation in this project. Use of external datasets, research artifacts, and upstream repos referenced above must comply with their original licenses and usage terms.

MIT -- see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
assets		assets
configs		configs
docs		docs
scripts		scripts
src/sign_prep		src/sign_prep
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ Key Features

📝 Config-Driven

🦴 Two Extractors

🎬 Two Pipeline Modes

🧩 Registry Architecture

⚡ Parallel Processing

📦 WebDataset Output

Installation

Optional: MMPose (GPU required)

Quick Start

Output

Supported Datasets

Documentation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

✨ Key Features

📝 Config-Driven

🦴 Two Extractors

🎬 Two Pipeline Modes

🧩 Registry Architecture

⚡ Parallel Processing

📦 WebDataset Output

Installation

Optional: MMPose (GPU required)

Quick Start

Output

Supported Datasets

Documentation

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages