Client-SideInteractive
Document Processing Pipeline
Watch documents flow through OCR, parsing, chunking, and embedding stages in real time. Click "Run Pipeline Demo" to see each processing stage animate with realistic timing.
Document Upload
Drop a document to start the pipeline
Supports PDF, DOCX, TXT, images (OCR)
sample_research_paper.pdf2.4 MB12 pages
Pipeline Stages
1/6
pendingIngestion
2/6
pendingOCR / Text Extraction
3/6
pendingParsing
4/6
pendingChunking
5/6
pendingEmbedding Generation
6/6
pendingVector Storage
1/6
pendingIngestion
2/6
pendingOCR / Text Extraction
3/6
pendingParsing
4/6
pendingChunking
5/6
pendingEmbedding Generation
6/6
pendingVector Storage
Processing Log
$ awaiting pipeline start...
Tesseract OCRLangChain SplittersSentence TransformersHNSW IndexVector StoreDocument AI