You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm looking to extract the layout of a PDF document. By layout, I mean the name of each section, sub section, etc. There is a tool that acts as a PDF viewer called "Google Scholar PDF Reader" which displays the headers and subheaders of a PDF, and I am wondering if it is possible to extract using PyMuPDF. A screenshot of how the headers are extracted is attached.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm looking to extract the layout of a PDF document. By layout, I mean the name of each section, sub section, etc. There is a tool that acts as a PDF viewer called "Google Scholar PDF Reader" which displays the headers and subheaders of a PDF, and I am wondering if it is possible to extract using PyMuPDF. A screenshot of how the headers are extracted is attached.
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions