ServiceTech - Technology in Service to People

The difference between technology being "in service" vs being a hinderance is often a small amount of missing information.

My Photo
Name:
Location: Seattle, Washington, United States

I'm an entrepreneur and consultant working at the convergence of leadership development, human potential, emergence and technology. I've worked with small startups and Fortune 50 companies on cutting edge projects with people who are passionate about making a difference in their organizations, industries, and communities.

Tuesday, September 06, 2005

MS Word HTML Clean Up

Many folks have discovered the ability of Microsoft Word to export to HTML. Using this feature, one would think you could grab the HTML source code from your favorite Word document and plop it into a web page or discussion forum or other web form that accepts HTML. The result should be nicely formatted text, right? Not so.

In order to provide "round-trip" support (the ability to export a Word doc, edit it in another application, then open it in Word again), Word embeds many non-standard tags when it exports HTML. The result is that a Word-generated HTML document will wreak havoc in many online forums and will not display properly in browsers other than Internet Explorer.

There are several ways to clean up Word-generated HTML. Here are a few:


  • In Word 2003 you can use the built-in functionality:

    Save As -> Web Page, Filtered (*.htm, *.html)


  • For Word 2000, Microsoft has a plug-in that adds similar functionality.


  • For Word 2004 for Mac, try this:

    Save As -> Format: Web Page (HTML) -> Click the "Save only display information into HTML" option

    If that doesn't work for your needs, try one of the options below.


  • For other versions of Word, or if you would like a different solution, try uploading your Word-generated HTML file to the Textism Word Cleaner and it will give you cleaned-up HTML for cutting and pasting wherever your heart desires.


  • If you're technically inclined and would like to take things a step further, jTidy will clean up your Word HTML and then validate it so you can be sure it's standards-compliant (hint: It probably won't be on the first pass).

0 Comments:

Post a Comment

<< Home