On Tue, 2004-09-28 at 14:25 +0100, Peter McEvoy wrote:
> Hi,
> I've got over 2000 word documents I need to convert to html.
...
> Oowriter from openoffice does a marvellous job when I open the doc
> and manually save it as html, but it would seem far from trivial to
> script.
In oowriter New->AutoPilot->Document Converter-> there is a wizard for
converting mass amounts of documents from .doc to the .sxw format, the
source for the wizard is in
Tools->Macros->Macro... soffice->ImportWizard,
perhaps with a bit of digging and hacking you could be able to find the
bit which is presumably hardcoded to ".sxw" or "OpenOffice File Format"
and change it to .html.
If not, googling or searching the openoffice.org website for stuff to do
with the ImportWizard might throw up a "I've done this already" post.
Certainly if all quick fixes fail you can script OOo externally from
python (and java/c++ etc) to do the job, the brave can have a look at a
python example cgi example to convert a single document to PDF or
something. http://www.skynet.ie/~caolan/Fragments/ooo-cgi.html
C.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
Url : http://mail.linux.ie/pipermail/ilug/attachments/20040928/26562993/attachment.pgp
Maintained by the ILUG website team. The aim of Linux.ie is to
support and help commercial and private users of Linux in Ireland. You can
display ILUG news in your own webpages, read backend
information to find out how. Networking services kindly provided by HEAnet, server kindly donated by
Dell. Linux is a trademark of Linus Torvalds,
used with permission. No penguins were harmed in the production or maintenance
of this highly praised website. Looking for the
Indian Linux Users' Group? Try here. If you've read all this and aren't a lawyer: you should be!