So I’m looking for a service I can run that can search the internal contents of multiple PDFs (multiple 1000+ page reference manuals) for a a phrase/word, similar to Adobe acrobats advance search function.
Bonus points of I can control the scope of which documents it searches through through some sort of interface.
You must log in or # to comment.
For Linux command line, there is pdfgrep. It can be found e.g. in the official Debian repository.
Look at the subreddit sidebar and find this: awesome-selfhosted, category document management, and more.
https://github.com/phiresky/ripgrep-all
Made by the same phiresky that’s been contributing incredible improvements to lemmy
You should checkout a free program called Seekfast