Pinned Loading
-
Agent-Memory-Playground
Agent-Memory-Playground PublicA playground for agent memory designed for comparing how AI agents remember and forget. Run two agents side by side, each using a different memory strategy: sequential, sliding window, summarizati…
Python
-
llm-mysql-eval
llm-mysql-eval PublicSystematically evaluate text-to-SQL systems for MySQL with automated validation and LLM-based semantic scoring.
Jupyter Notebook
-
blamechain
blamechain PublicBlamechain is a CLI tool that turns messy Git history into clear narrative. It summarizes commits, extracts TODOs, tracks ownership drift, and maps file evolution - so devs can understand not just …
JavaScript
-
RAG-hmo
RAG-hmo PublicA healthcare RAG app that builds an HMO profile, retrieves relevant medical documents, and generates personalized, bilingual answers to user questions
Python
-
LucyDetect
LucyDetect PublicLucyDetect is an LLM Drift Analyzer designed to help researchers track AI model responses over time and detect inconsistencies, including hallucinations. It stores past responses, calculates drift …
Python
-
If the problem persists, check the GitHub status page or contact support.