Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
-
Updated
May 28, 2024
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
RAG-Ingest: A tool for converting PDFs to markdown and indexing them for enhanced Retrieval Augmented Generation (RAG) capabilities.
Self-hosted RAG engine for AI coding assistants. Ingests technical docs & code repositories locally with structure-aware chunking. Serves grounded context via MCP to prevent hallucinations in software development workflows.
A simple RAG toolkit.
Production-grade RAG backend for document ingestion and semantic retrieval using embeddings and Pinecone.
Production-grade RAG chatbot with a FastAPI + LangGraph backend (Pinecone vector search + Groq LLM + Tavily web fallback) and a Streamlit chat UI, secured via API key and observable in LangSmith.
An implementation of the GraphRAG pipeline (based on the 2024 paper "From Local to Global" by Edge et al.) for query-focused summarization of large text corpora.
Self-hosted RAG prototype to ingest PDFs/HTML and chat with them via a local UI
Async document watcher that keeps your RAG index hot. Automatically ingests new or changed documents into a live RAG pipeline with built-in observability.
AI-powered RAG assistant for parents to get instant, context-aware answers on Brainwonders’ career counseling programs, pricing, and services. Built with Streamlit, LangChain, ChromaDB, and Google Gemma LLM for fast, multi-document retrieval and conversational Q&A.
An AI Analytics Dashboard for research labs analytics, collaboration, and email workflow using React and FastAPI.
Store millions of text chunks inside ultra-compact MP4 files, index them with local embeddings, and retrieve answers instantly for fully offline RAG with any LLM.
Agentic RAG Chatbot using multi-agent architecture and Streamlit. Ingests PDFs, DOCX, PPTX, CSV, TXT, and Markdown files to provide contextually accurate answers with a persistent knowledge base. Supports multi-turn conversations, source citations, and dynamic document uploads.
📊 Streamline query-focused summarization by constructing knowledge graphs and extracting insights from document corpora with the GraphRAG pipeline.
🗂️ Build a knowledge graph for global query-focused summarization from document corpora using the GraphRAG pipeline, enhancing information synthesis.
ScriptumAI is an advanced Retrieval-Augmented Generation platform designed for document ingestion and query processing.
Add a description, image, and links to the document-ingestion topic page so that developers can more easily learn about it.
To associate your repository with the document-ingestion topic, visit your repo's landing page and select "manage topics."