PyMuPDF
RepoPyMuPDF is an open-source Python binding to the MuPDF engine that allows developers to load, render, search, and extract text or layout information from PDF, XPS, and other document formats. It is widely used in data-extraction and AI/RAG pipelines that need fast, programmatic access to document content and geometry.