Results 1 to 9 of 9
  1. #1
    Platinum Lounger
    Join Date
    Jan 2001
    Location
    Quedgeley, Gloucester, England
    Posts
    5,333
    Thanks
    0
    Thanked 1 Time in 1 Post

    Pasting Word 2002 data into FrontPage 2000 (2000)

    I copied one of my standard Web pages, deleted most of the data from it, then cut and pasted some text from Word 2002. I had to delete two symbols next to each other from every line/sentence (which I didn't recognise, other than they looked a little like the Bookmark flag).

    The end result is that the new page is about twice the size of other, vaguely similar, pages, which were built just by typing into FrontPage 2000.

    Looking at the HTML indicates that there's a whole bunch of stuff there that I don't recognise from other pages (I'll post some of it if anyone wants).

    Is this something that has been encountered before? Can I easily remove the what-appears-to-be-extraneous additions?

    Thanks!
    <font face="Script MT Bold"><font color=blue><big><big>John</big></big></font color=blue></font face=script>

    Ita, esto, quidcumque...

  2. #2
    Plutonium Lounger Leif's Avatar
    Join Date
    Dec 2000
    Location
    U.K.
    Posts
    14,010
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    Rule 1:
    Never paste directly from Word unless the trade-off from keeping whatever formatting you have in Word outweighs the additional bloat. Copy from Word and paste to (e.g.) Notepad; copy from Notepad and paste into FP.

    Rule 2:
    Always follow rule 1.

    <img src=/w3timages/blackline.gif width=33% height=2>

    Edited to add:

    Check out these two earlier posts: <post#=27461>post 27461</post#> and <post#=233837>post 233837</post#>

  3. #3
    Platinum Lounger
    Join Date
    Jan 2001
    Location
    Quedgeley, Gloucester, England
    Posts
    5,333
    Thanks
    0
    Thanked 1 Time in 1 Post

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    Oh woe, oh woe! O me miserium! [etc...]

    Is there a "debloater" you would recommend to apply to a thusly expanded web page, or is the best thing to start from scratch and follow your Rules 1&2 Infinite Loop?

    I remember (now!) reading the second of your references, mainly because of the cheerful rant of WebGenii (surely MsWebGenius?), but this thread didn't conclude whether the FrontPagePowerEdit was necessarily a Good Thing (in the "1066 And All That" sense)...
    <font face="Script MT Bold"><font color=blue><big><big>John</big></big></font color=blue></font face=script>

    Ita, esto, quidcumque...

  4. #4
    Lounger
    Join Date
    Feb 2001
    Posts
    48
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    John,

    I also have found that Word XP (2002) puts an excessive amount of overhead code in FrontPage 2000. What I have done is copy from Word to the buffer and then in FrontPage 2000 perform a Paste Special - Normal Paragraphs with line breaks (edit/paste special). This will reduce all the 'junk' that Word added. Note that if you have any special formatting in Word you will lose it upon a Paste Special. Anyway, play with it until you find a satisfying setting.

    LMD

  5. #5
    Super Moderator WebGenii's Avatar
    Join Date
    Jan 2001
    Location
    Redcliff, Alberta, Canada
    Posts
    4,066
    Thanks
    2
    Thanked 5 Times in 5 Posts

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    Surely, Flattery will get you everywhere <img src=/S/grin.gif border=0 alt=grin width=15 height=15>!
    I've just done a series of tests comparing different methods of converting docs from Word 2002 to FrontPage 2000. And for everyone reading the thread - the versions are important. This is going to be lengthy. I replaced the angle brackets with curly brackets.

    Method 1 - <font color=red>copy and paste from Word to FrontPage.</font color=red>
    Sample Code:
    <pre>{body}
    {h1}{span lang="EN-US" style="mso-ansi-language:EN-US"}Here is my Word Document{o}
    {/o}
    {/span}{/h1}
    {p class="MsoNormal"}{span lang="EN-US" style="mso-ansi-language:EN-US"}Lorem
    ipsum {b style="mso-bidi-font-weight:normal"}dolor sit amet,{/b} consectetuer
    Vestibulum vitae dui at magna viverra consequat. Sed blandit fermentum lorem.
    Morbi sit amet ipsum. In malesuada. {i style="mso-bidi-font-style:normal"}Nulla
    egestas {/i}{/span}{st1:City}
    {st1lace}
    </pre>

    Conclusion: - <font color=red> looks like <img src=/w3timages/censored.gif alt=censored border=0>.</font color=red>

    Method 2 - <font color=red>Save as a Web Page in Word, Insert HTML File in FrontPage.</font color=red>
    In FrontPage, Open an empty page and use the Insert, File menu to insert the HTML file into the page.
    Sample Code:
    <pre>{body}
    {div class="Section1"}
    {h1}{span lang="EN-US" style="mso-ansi-language:EN-US"}Here is my Word
    Document{o}
    {/o}
    {/span}{/h1}
    {p class="MsoNormal"}{span lang="EN-US" style="mso-ansi-language:
    EN-US" class="SpellE"}Lorem{/span}{span lang="EN-US"
    style="mso-ansi-language:EN-US"} {span class="SpellE"}ipsum{/span} {span
    class="GramE"}{b style="mso-bidi-font-weight:
    normal"}dolor sit{/b}{/span}{b style="mso-bidi-font-weight:normal"} {span
    class="SpellE"}amet{/span},{/b} {span class="SpellE"}consectetuer{/span} {span
    </pre>

    Conclusion: - <font color=red> looks <big>even more like </big> <img src=/w3timages/censored.gif alt=censored border=0>.</font color=red>

    Method 3 - <font color=red>Insert the Word Document in FrontPage.</font color=red>
    Sample Code:
    <pre>{body}
    {b}{font FACE="Arial" SIZE="5"}
    {p}Here is my Word Document{/p}
    {/font}{/b}
    {p}Lorem ipsum {b}dolor sit amet,{/b} consectetuer adipiscing elit. Donec
    suscipit ultrices ligula. In hac habitasse platea dictumst. Ut eget enim. Donec
    placerat sollicitudin risus.{/p}
    {b}{i}{font FACE="Arial" SIZE="4"}
    {p}Second level Heading{/p}
    {/font}{/i}{/b}
    </pre>

    Conclusion: - <font color=red> looks better, but who wants those font tags <img src=/S/shrug.gif border=0 alt=shrug width=39 height=15>. </font color=red>

    Method 4 - <font color=red>Save Word Documented as Filtered HTML, and Insert into FrontPage. </font color=red>
    Hidden in the Word Save As dialog box is the option to save as "Filtered" HTML.
    Sample Code:
    <pre>{body}
    {div class="Section1"}
    {h1}{span lang="EN-US"}Here is my Word Document{/span}{/h1}
    {p class="MsoNormal"}{span lang="EN-US"}Lorem ipsum {b}dolor sit amet,{/b}
    fermentum lorem. Morbi sit amet ipsum. In malesuada. {i}Nulla egestas {/i}{/span}{i}{span
    lang="EN-US"}massa{/span}{/i}{span lang="EN-US"} vel mi. {/span}{span
    lang="FR-CA"}Curabitur gravida. Vestibulum placerat sollicitudin risus.{/span}{/p}
    </pre>

    Conclusion: - <font color=red> Gets rid of the Font tags - but brings back the stupid span tags and the incredibly annoying class=MsoNormal paragraph style. And why is it swapping the language around anyway?? </font color=red>


    Method 5 - <font color=red>Double Filter the Word Document and Insert into FrontPage. </font color=red>
    First save the Word document as "Filtered" HTML.
    Second, use the Office HTML Filter 2.0 http://office.microsoft.com/download.../Msohtmf2.aspx, to refilter the file.
    Third, Insert into FrontPage.
    Sample Code:
    <pre>{body}

    {div}
    {h1}Here is my Word Document{/h1}
    {p}Lorem ipsum {b}dolor sit amet,{/b} consectetuer adipiscing elit. Donec
    magna viverra consequat. Sed blandit fermentum lorem. Morbi sit amet ipsum. In
    malesuada. {i}Nulla egestas massa{/i} vel mi. Curabitur gravida.
    Vestibulum placerat sollicitudin risus.{/p}
    {h2}Second level Heading{/h2}
    </pre>

    Conclusion: - <font color=red><big>GREAT!</big></font color=red> The div tags will still need to be removed manually, but since there are only 2 per document I can live with that. What is also interesting about this is that the Office 2.0 filter doesn't work very well at all on "unfiltered" Word 2002 documents. It still leaves some proprietary MS code in place, regardless of your settings. It seems that the first filtering (in Word) removes a switch and then the Office filter can get on with the job.

    A few other notes.
    - All of these tests were done on a properly formatted word document. In other words, I used the Word Heading styles.
    - I'd probably convert all the Bold and Italics to the HTML tags - STRONG and EM.
    - I looked at the options in Word under Tools, Options, Save, Embedding Smart Tags (switching this on/off seemed to make no difference at all). I also looked at Tools, Options, General and the Web Options button. I didn't find the options here to be especially useful either. I point them out for completeness.
    - The second filtering with the Office filter is annoying (one would assume that "Filtered" in Word, would mean exactly that), but an upside is that you can do large batches of documents at the same time. Just remember that they cannot be open in Word at the same time.
    - If you are using Word 2000, you may want to check out this page at MS, which talks about using the Office 2.0 filter with Word 2000.

    Cheers
    [b]Catharine Richardson (WebGenii)
    WebGenii Home Page
    Moderator: Spreadsheets, Other MS Apps, Presentation Apps, Visual Basic for Apps, Windows Mobile

  6. #6
    Super Moderator jscher2000's Avatar
    Join Date
    Feb 2001
    Location
    Silicon Valley, USA
    Posts
    23,112
    Thanks
    5
    Thanked 93 Times in 89 Posts

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    Search the Lounge for Tiny HTML. Not used it myself, but read the guy's page once and it sounded useful.

  7. #7
    Platinum Lounger
    Join Date
    Jan 2001
    Location
    Quedgeley, Gloucester, England
    Posts
    5,333
    Thanks
    0
    Thanked 1 Time in 1 Post

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    After that post, I think my attribution of Catharine as MsWebGenius is entirely justified! Quick, find the colour printer!

    Thanks to everyone for their comments - I couldn't find any decent reference to Tiny HTML on Google, and I suspect that my best bet is probably to go the Leif way ("Way to go, Leif?"!), since I'm not pasting any great amount of text in from Word, and I can reformat it without days of editing. I hadn't come across the Office HTML Filter 2.0, but will acquire it and play with it.

    Thanks again!
    <font face="Script MT Bold"><font color=blue><big><big>John</big></big></font color=blue></font face=script>

    Ita, esto, quidcumque...

  8. #8
    Super Moderator jscher2000's Avatar
    Join Date
    Feb 2001
    Location
    Silicon Valley, USA
    Posts
    23,112
    Thanks
    5
    Thanked 93 Times in 89 Posts

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    Sorry! Turns out it was tidy not tiny.

    See: HTML Tidy Library Project

    And this article sounds interesting: Integrating HTML Tidy into Microsoft FrontPage

  9. #9
    Super Moderator WebGenii's Avatar
    Join Date
    Jan 2001
    Location
    Redcliff, Alberta, Canada
    Posts
    4,066
    Thanks
    2
    Thanked 5 Times in 5 Posts

    Re: Pasting Word 2002 data into FrontPage 2000 (2000)

    nifty - I'll be taking a closer look.

    Cheers
    [b]Catharine Richardson (WebGenii)
    WebGenii Home Page
    Moderator: Spreadsheets, Other MS Apps, Presentation Apps, Visual Basic for Apps, Windows Mobile

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •