Is epstein-search completely free to use?

Yes, it's MIT-licensed open-source with no costs for core local search. Cloud LLMs require API keys, but Ollama or LM Studio run zero-cost locally.

How does epstein-search handle local LLMs?

It proxies through LiteLLM to Ollama (e.g., `ollama/llama3`) or LM Studio on localhost:1234. Set `OPENAI_API_BASE` in .env—no keys for pure local inference.

What document types does epstein-search cover?

Filters include court_filing, deposition, fbi_report, flight_log, financial, and other. Use `--doc-type deposition` or `--source FBI` to narrow queries.

Can epstein-search output JSON for scripting?

Yes, `epstein-search search "query" --json-output` dumps structured results. Pair with `--top-k 5` for pipelines.

Does epstein-search require an internet connection after setup?

No for searches—index is local. Cloud LLMs or initial `setup` download need it.

epstein-search: The Best Local RAG Tools for Investigative Researchers in 2026

epstein-search indexes 100K+ Epstein document chunks into a local vector database for semantic search and RAG queries via CLI, supporting free local LLMs without API keys.

epstein-search: Local-First Killer of Manual Document Dumps

epstein-search nukes sifting through raw Epstein court filings, FBI reports, and flight logs by PDF. It pre-chunks and embeds 100K+ public documents into a local vector index for instant semantic queries. Ditch keyword grep or paid discovery tools—this handles sworn depositions and financial wires offline.

Under the Hood: Pre-Built Vector Index + LiteLLM RAG

epstein-search ships a pyproject.toml-based Python CLI that downloads a pre-computed FAISS-like vector store (~1-2 min setup) from document splits by type (court_filing, deposition, fbi_report, flight_log, financial). Queries hit cosine similarity on Gemini Flash or Ollama Llama3 embeddings via LiteLLM proxy, which routes to local servers (Ollama, LM Studio on port 1234) or cloud (OpenAI, Anthropic). RAG mode fetches top-K chunks, stuffs into LLM prompt; search mode dumps raw snippets with --json-output or --source filters.

The Good & The Bad

Pros:

Zero API for core search—pure local vectors after epstein-search setup.
Swappable LLMs mid-chat via /model anthropic/claude-haiku or .env defaults.
Granular filters like --doc-type flight_log or --source FBI prune 100K chunks fast.
Interactive REPL with /topk 5, /ask, /search toggles sources on/off.
JSON output for scripting: epstein-search search "testimony" --json-output.
Plays with free local setups: Ollama Mistral or LM Studio without keys.

Cons:

Locked to Epstein docs only—no general corpus ingestion.
1-2GB index download bloats disk on setup.
Cloud LLMs need keys (GEMINI_API_KEY); local Ollama chews RAM for bigger models.
No custom embedding fine-tuning or multi-user indexing.
CLI-only—no GUI or web demo for quick peeks.

Quickstart

pip install epstein-search
epstein-search setup  # downloads/indexes 100K+ chunks in 1-2 min
epstein-search chat    # launches REPL for queries

setup builds the local vector store from pre-chunked Epstein files (court docs to flight logs). chat drops you into an interactive prompt—type "who's on the flight logs?" for top-10 semantic matches. Hit /model ollama/llama3 for free local inference or /quit to bail.

Who Should Use This (and Who Shouldn't)

Use it if: You're a solo investigator cross-referencing FBI reports with depositions offline; scripting RAG pipelines on niche corpora; testing LiteLLM routing without infra hassle.

Skip it if: You need general-purpose search beyond Epstein files; handling massive custom datasets (no upload API); preferring GUI over CLI for non-devs; running on low-RAM devices (Ollama Llama3 demands 8GB+).

Alternatives & When to Switch

Pick Haystack if building custom RAG on your own PDFs—supports Elasticsearch backends over fixed indexes. Go RAGFlow for self-hosted web UI on diverse docs, skipping CLI purity. Use LanceDB raw if you want vector DB primitives without Epstein bloat or LLM glue.

epstein-search: The Best Local RAG Tools for Investigative Researchers in 2026

epstein-search: Local-First Killer of Manual Document Dumps

Under the Hood: Pre-Built Vector Index + LiteLLM RAG

The Good & The Bad

Quickstart

Who Should Use This (and Who Shouldn't)

Alternatives & When to Switch

Frequently Asked Questions

You Might Also Like

yapsnap: Best Local Audio Transcription CLI for Developers in 2026

Claude Design Free: Best AI UI/UX Framework for React Devs in 2026

webchat2api: Best AI API Proxies for Developers in 2026