Help with viewing xml file
Moderators: Hacker, petermad, Stefan2, white
Help with viewing xml file
As I have mentioned in another post I miss the possibility to quickly review the contents of OpenOffice files. The files themselves are archive files, which I can view with ctrl-pgdown, and the document contents always reside in a file in the archive called content.xml, which I also can view with the lister.
What I would want is (1) to combine the two steps involved here into one, so that one e.g. can quickly go through the files in QuickView mode; and (2) to strip off most of the xml tags, so that I can review the text contents and not have to search through the whole list of header tags etc. Something, in other words, like viewing html files in the Lister, only with slightly more information retained, since everything but the text disappears if I try that. Ideally, paragraphs should be separated, text defined as headers should be in bold, and some other style information might also be used (indented blockquotes, etc.), but apart from that, all I want to see is the text contents. (But since all the style information is there in the file, I assume that it would, again, be fairly straightforward to quickly format the whole text.)
The ieview plugin works, but since what it does is to open an instance of the OpenOffice program, it is quite unpractical - it is just as quick to open the file directly in the program itself.
I have only the vaguest experience in programming myself, but I would think that someone with such skills would quite easily be able to make such a plugin, either from scratch, or using some pre-existent tool. I understand that there is an xml parser included with Windows XP - is that something one might use? Or some other parser?
I have read through the instructions for writing plugins, but I don't really understand too much of it.
Forgive me if my wishes way exceed my capacity, but it is so vexing to see my MS Word popping up in the Lister in a split second and my OpenOffice files not. There must be more people than me who might find this a useful tool...
Anyone who care to help?
What I would want is (1) to combine the two steps involved here into one, so that one e.g. can quickly go through the files in QuickView mode; and (2) to strip off most of the xml tags, so that I can review the text contents and not have to search through the whole list of header tags etc. Something, in other words, like viewing html files in the Lister, only with slightly more information retained, since everything but the text disappears if I try that. Ideally, paragraphs should be separated, text defined as headers should be in bold, and some other style information might also be used (indented blockquotes, etc.), but apart from that, all I want to see is the text contents. (But since all the style information is there in the file, I assume that it would, again, be fairly straightforward to quickly format the whole text.)
The ieview plugin works, but since what it does is to open an instance of the OpenOffice program, it is quite unpractical - it is just as quick to open the file directly in the program itself.
I have only the vaguest experience in programming myself, but I would think that someone with such skills would quite easily be able to make such a plugin, either from scratch, or using some pre-existent tool. I understand that there is an xml parser included with Windows XP - is that something one might use? Or some other parser?
I have read through the instructions for writing plugins, but I don't really understand too much of it.
Forgive me if my wishes way exceed my capacity, but it is so vexing to see my MS Word popping up in the Lister in a split second and my OpenOffice files not. There must be more people than me who might find this a useful tool...
Anyone who care to help?
Last edited by eyolf on 2005-08-28, 13:45 UTC, edited 1 time in total.
http://oestrem.com/test_doc_TC.odt
As I said, you will find the files content.xml and style.xml within it
As I said, you will find the files content.xml and style.xml within it
I'm working on it. I've found some things that will help, but don't understand it yet
It uses regular expressions to convert content.xml from odt files to html, and I've an allergy to regular expressions
But I've cried for help, and received some pointers. Be patient, and you might get something much better than you asked for 



Best regards,
Christian Tiberg
Christian Tiberg
LOLeyolf wrote:Hey! That's great! Det var väl klart, att det skulle vara en svensk som löser det!

Screamer (se sig) är också gjort av en svensk!
Get Radio? Sure! - the best Internet Radio Player ever! 

Are you still working on it? I am just curious .. and patient ..ctiberg wrote:Be patient, and you might get something much better than you asked for
TC plugins: Autodesk 3ds Max / Inventor / Revit Preview, FileInDir, ImageMetaData (JPG Comment/EXIF/IPTC/XMP), MATLAB MAT-file Viewer, Mover, SetFolderDate, Solid Edge Preview, Zip2Zero and more