Smart text chunker for LLM preprocessing (sections → paragraphs → sentences → hard splits).
-
Updated
Dec 7, 2025 - Python
Smart text chunker for LLM preprocessing (sections → paragraphs → sentences → hard splits).
A modular Python tool that loads messy CSV/Excel files, cleans them automatically, generates analytical statistics, and produces a polished PDF report. Includes a CLI interface, advanced cleaning engine, and portfolio-grade architecture.
A fast PyQt‑based image annotation tool with customizable hotkeys, per‑folder labels, CSV export, and “next untagged” navigation — ideal for prepping ML training datasets.
Automatic caching for LLM API responses (OpenAI, Gemini, Anthropic) using a lightweight Python library.
Add a description, image, and links to the machine-learning-tools topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-tools topic, visit your repo's landing page and select "manage topics."