Results 1 to 5 of 5
  1. #1
    3 Star Lounger
    Join Date
    Mar 2003
    Location
    Elkins Park, Pennsylvania, USA
    Posts
    325
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Unicode by-product? (Word XP/2003)

    Hi, all. I hope I can make this concise.

    I've seen and, occasionally created, documents that contain characters that I can't manipulate as I can other characters. Specifically, when typing in Word, I or some other hapless victim will enter a certain character and format it in such a way that Word "converts" it, leaving me with something other than the regular ANSI character with a bit of font information applied. In fact, when I highlight the text and use CTRL+Spacebar (to strip off formatting), I'm left with a non-printing character that looks like a period BELOW the baseline of text and well to the left of the cursor. When I catch these (or when some innocent parsing routine passes over it), I can re-apply the Symbol font and things sort of change back round, with the "character" (which by now usually looks like a space!) falling predictably in-line and in-sequence with the rest of the characters.

    I know this may be hard to visualize, but hopefully someone has seen enough of this phenomenon to know what I'm talking about.
    <font face="Comic Sans MS">That's what you do in a herd; you look out for each other!</font face=comic> - Mike

  2. Get our unique weekly Newsletter with tips and techniques, how to's and critical updates on Windows 7, Windows 8, Windows XP, Firefox, Internet Explorer, Google, etc. Join our 480,000 subscribers!

    Excel 2013: The Missing Manual

    + Get this BONUS — free!

    Get the most of Excel! Learn about new features, basics of creating a new spreadsheet and using the infamous Ribbon in the first chapter of Excel 2013: The Missing Manual - Subscribe and download Chapter 1 for free!

  3. #2
    Super Moderator jscher2000's Avatar
    Join Date
    Feb 2001
    Location
    Silicon Valley, USA
    Posts
    23,112
    Thanks
    5
    Thanked 93 Times in 89 Posts

    Re: Unicode by-product? (Word XP/2003)

    When a "symbol" (decorative font) is deliberately inserted in Word, usually it is protected from the ordinary formatting commands, so the characters should not be mangled by Ctrl+Spacebar. Perhaps the poorly protected characters were inserted using AutoComplete or AutoText??

  4. #3
    3 Star Lounger
    Join Date
    Mar 2003
    Location
    Elkins Park, Pennsylvania, USA
    Posts
    325
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Unicode by-product? (Word XP/2003)

    I am at a loss as to what the heck this character may be, let alone how to search for or handle it. I am attaching a very small document that has an example of just such a character in the caption. Immediately after AUC0-infinity, there are TWO characters. The first one is the "mystery character," which with Show All turned off, looks just like a space formatted with the Symbol font. The second character is a legitimate space and with "Show All" turned on, displays the non-printing "dot" character. There is no non-printing placeholder for the msytery character, as there is for a space. But if you highlight it (probably easiest to do by using the arrow keys and the Shift key) and use Ctrl+Spacebar, and if you're zoomed in at least 200%, you will then see a dot below the baseline underneath the infinity character.

    The reason this is a problem is because of another process here at work, which parses the caption and halts when it hits this unknown character. If anyone has any clue as to what preventative or remedial actions I can do or write, I would be grateful!
    Attached Files Attached Files
    <font face="Comic Sans MS">That's what you do in a herd; you look out for each other!</font face=comic> - Mike

  5. #4
    Super Moderator jscher2000's Avatar
    Join Date
    Feb 2001
    Location
    Silicon Valley, USA
    Posts
    23,112
    Thanks
    5
    Thanked 93 Times in 89 Posts

    Re: Unicode by-product? (Word XP/2003)

    If I select the mystery "space" and press Alt+X, I get the Greek character Phi followed by 020. If I first change the font to a regular font and then press Alt+X I get F020. Neither of these seems to be a valid character. However, if it is always the same invalid character, you could find and replace it.

    The Immediate window reports

    <code>?ascw(selection.Text)
    -4064 </code>

    A quickie macro can be found here: Finding and replacing symbols @ mvps.org

  6. #5
    Platinum Lounger
    Join Date
    Feb 2001
    Location
    Yilgarn region of Toronto, Ontario
    Posts
    5,453
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Unicode by-product? (Word XP/2003)

    >A quickie macro can be found here: Finding and replacing symbols @ mvps.org
    Thanks.
    (How do you anticipate my needs?!!)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •