⭐handwriting = can extract handwritten text
⭐table = can extract tabular data into markdown table
⭐scanned = supports OCR to extract text from scanned image
⭐VLM = Just a Vision Language model, requires prompt
⭐handwriting = can extract handwritten text
⭐table = can extract tabular data into markdown table
⭐scanned = supports OCR to extract text from scanned image
⭐VLM = Just a Vision Language model, requires prompt
⭐equations = can detect and extract mathematical equations as LaTeX
⭐equations = can detect and extract mathematical equations as LaTeX
⭐opensource = can be self-hosted; does not rely on proprietary APIs or cloud services.
⭐images = can extract images embedded in the PDF and optionally include them in the markdown
⭐opensource = can be self-hosted; does not rely on proprietary APIs or cloud services.
⭐images = can extract images embedded in the PDF and optionally include them in the markdown