Learn@UW - Content Topics as HTML Documents Created from Word Should be UTF-8 Encoded

If you create an HTML 'Web page' document from Microsoft Word and use it as course content, the HTML document should be saved as UTF-8 encoded.

This will ensure that correct characters display in the topic and prevent future saving of the topic with the HTML editor from corrupting it. We have had cases where editing and saving a topic with the HTML editor has corrupted the topic by adding 'garbage characters' in place of characters and formatting commands encoded with Microsoft-specific code that were used in the original Word document.

If you have a Content topic in your course that appears to have been originally created from Microsoft Word and becomes corrupted when saved with the HTML editor, do the following to create a topic that will not cause problems when later edited:

  1. Download a 'clean' copy of the .html file corresponding to the topic that has not been corrupted.
  2. Open your Word application (do not double click on the file).
  3. Within Word, open the file.
  4. Save the file as a Web Page .html document, setting the encoding to UTF-8. (The instructions for this vary with the version of Word, see below.)
  5. Create a new Content topic in your course by uploading the newly created file.
  6. Save the new topic.
  7. Delete the original topic from your course.

You should be able to safely edit the new topic later with the HTML editor.

 

Instructions for Saving a Word Document as a UTF-8 Encoded HTML File

From within Word 2003 or 2007 (PC)

  1. Choose File (2003) or the Office button (2007).
  2. Choose Save As.
  3. Choose Other Formats.
  4. Choose Web page (*.htm; *.html).
  5. At the bottom left of the dialog click on "Tools".
  6. Choose Web Options...
  7. Select Encoding tab.
  8. Save this document as Unicode (UTF-8).

From within Word 2010 or 2013 (PC)

  1. Choose File.
  2. Choose Save As.
  3. Select Web page (*.htm; *.html) from the drop-down by "Save as type:".
  4. Select Web Options... from the drop-down by "Tools".
  5. Select Encoding tab.
  6. Select Unicode (UTF-8) from the drop-down by "Save this document as:"
  7. Press OK and then Save to complete the saving process.

From within Word 2004 or 2008 (Mac)

  1. Save as Web page (format is Web Page (HTML) or Web Page (.htm), depending on version of Word for Mac)
  2. At the bottom left of the dialog click "Web Options".
  3. Select Encoding tab.
  4. Save this document as Unicode (UTF-8).
  5. Click OK.



Keywords:brightspace learn@uw learnuw d2l desire2learn content topic html editor document Microsoft Office Word UTF-8 unicode encode corrupt garbage character known issue   Doc ID:8982
Owner:Joan W.Group:Learn@UW Utility
Created:2009-02-03 19:00 CDTUpdated:2016-05-24 14:31 CDT
Sites:DoIT Help Desk, DoIT Tech Store, Learn@UW Madison, Learn@UW Utility
Feedback:  0   1