Welcome to Lite Semantic Search

AI-Powered Semantic Search for Your Documents

What is Semantic Search?

Search your documents by meaning, not just keywords. Our local semantic search engine uses AI embeddings to understand context and find relevant information even when exact words don't match. Perfect for researchers, writers, developers, and anyone managing large document collections.

Unlike traditional keyword search, semantic search understands the meaning and context of your query, returning results that are conceptually related even if they don't contain the exact search terms.

Key Features

Semantic Search

Find documents by meaning using vector embeddings and AI, not just keyword matching. Search for "happy" and find results containing "joyful" or "delighted".

100% Private & Secure

All data stays in your browser using IndexedDB. No server, no uploads, complete privacy. Your documents never leave your device.

Local AI Processing

Uses Transformers.js for in-browser embeddings with all-MiniLM-L12-v2 model (384 dimensions). No external API calls required.

Fast Vector Search

How to Use

1
Add Documents
Upload .txt, .md, or .pdf files, or paste text directly. Click the "Add Document" button to get started.
2
Automatic Processing
Documents are automatically chunked into smaller pieces and converted to vector embeddings using AI. This happens locally in your browser.
3
Semantic Search
Type natural language queries to find relevant chunks. Ask questions or describe what you're looking for - no need for exact keywords.
4
Hybrid Search
Our system combines semantic similarity with keyword matching for the best results, giving you both conceptual relevance and exact matches.
5
Manage Documents
View all your documents in the document list and delete ones you no longer need. Your data is always under your control.

Perfect For

Academic Research

Find related papers, notes, and concepts across your research library. Perfect for literature reviews and connecting ideas.

Technical Documentation

Search technical docs, API references, and code documentation to find solutions faster.

Personal Knowledge Base

Build a personal knowledge base with notes, articles, and bookmarks. Your second brain for information retrieval.

Technology Stack

Vector Database

PGlite - PostgreSQL running entirely in your browser with the pgvector extension for vector similarity search. Stores data in IndexedDB for persistence.

Embeddings Model

Xenova/all-MiniLM-L12-v2 via Transformers.js - Generates 384-dimensional vectors representing the semantic meaning of text. Runs completely in-browser using WebAssembly.

Search Algorithm

HNSW (Hierarchical Navigable Small World) indexing for approximate nearest neighbor search. Provides fast and accurate similarity search.

Hybrid Search

Combines cosine similarity (semantic vector search) with PostgreSQL full-text search (keyword matching) for comprehensive results.

Text Processing

Smart chunking with overlap for better context preservation. Documents are split into ~300 character chunks with 50 character overlap to maintain semantic continuity.

PDF Support

PDF.js for parsing and extracting text from PDF documents directly in the browser.

🔒 Your Privacy Matters

This is a fully client-side application. All document processing, embeddings generation, and search happen locally in your browser. Your documents never leave your device.

No data is sent to any server. Your documents are stored in IndexedDB and persist only in your browser. You have complete control over your data - you can export, delete, or clear everything at any time.

We don't track you, collect analytics, or store any information about your searches or documents. This tool is built with privacy as a core principle.

Frequently Asked Questions

How is this different from Ctrl+F search?

Traditional keyword search (Ctrl+F) only finds exact matches. Semantic search understands meaning - if you search for "automobile accidents", it will also find content about "car crashes" or "vehicle collisions" even without those exact words.

Is my data really private?

Yes, 100%. Everything runs in your browser. Your documents are stored in IndexedDB (browser storage) and never sent to any server. You can verify this by checking your browser's network tab - no document data is transmitted.

How much storage do I have?

Browser storage limits vary by browser (usually 50-100GB for modern browsers). IndexedDB will use as much as your browser allows. The app will notify you if you're running low on storage.

What file formats are supported?

Currently supports .txt (plain text), .md (Markdown), and .pdf (PDF) files. You can also paste text directly into the app.

Can I use this offline?

After the initial load and model download, the app works offline. Your documents and search functionality will be available without an internet connection.

How accurate is semantic search?

The all-MiniLM-L12-v2 model is trained on millions of text pairs and provides excellent semantic understanding for most use cases. Results combine both semantic similarity and keyword matching for optimal accuracy.

About Vector Search & RAG

This tool implements Retrieval Augmented Generation (RAG) techniques using vector embeddings and similarity search. Vector databases like PGlite with pgvector enable efficient storage and retrieval of high-dimensional embeddings.

By converting text into numerical vectors, we can measure semantic similarity using cosine distance. This enables powerful features like finding similar documents, clustering related content, and building intelligent search systems.

Whether you're building a knowledge base, document management system, or research tool, semantic search provides a more natural and effective way to find information compared to traditional keyword-based approaches.

Ready to Get Started?

Start searching your documents with AI-powered semantic search. Add your first document and experience the power of vector embeddings.

Start Searching Now

What is Semantic Search?

Key Features

Semantic Search

100% Private & Secure

Local AI Processing

Fast Vector Search

How to Use

Add Documents

Automatic Processing

Semantic Search

Hybrid Search

Manage Documents

Perfect For

Academic Research

Technical Documentation

Personal Knowledge Base

Technology Stack

Vector Database

Embeddings Model

Search Algorithm

Hybrid Search

Text Processing

PDF Support

🔒 Your Privacy Matters

Frequently Asked Questions

How is this different from Ctrl+F search?

Is my data really private?

How much storage do I have?

What file formats are supported?

Can I use this offline?

How accurate is semantic search?

About Vector Search & RAG

Ready to Get Started?