Hi, I am doing a very tedious manual analysis of PDF documents to list each text part with its corresponding font attributes like this:
La salutiamo cordialmente,
blablabla
variousnames
,[969,1269],Font: Univers-Medium (Embedded),Type: TrueType,Encoding: WinAnsiEncoding,Object Number: 18, Global Object ID: 0, Font Size: 10.0 pt, Horizontal Scaling: 100%, Baseline Offset: 0.0 pt
Orari di apertura di blablabla
,[1270,1305],Font: Univers-Bold (Embedded),Type: TrueType,Encoding: WinAnsiEncoding,Object Number: 19, Global Object ID: 0, Font Size: 9.0 pt, Horizontal Scaling: 100%, Baseline Offset: 0.0 pt
do you know a tool that can produce a similar output that is a list of text+styleattributes in any format?
Or can this be achieved with a plugin? Possibly adapting an existing one?
Any plugin to extract font styles from PDF text?
Moderators: Hacker, petermad, Stefan2, white
-
- Junior Member
- Posts: 91
- Joined: 2006-11-07, 16:36 UTC
- Location: Trieste, Italy
- Contact: