A Microsoft Office (Excel, Word) forum. OfficeFrustration

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

Go Back   Home » OfficeFrustration forum » Microsoft Word » General Discussion
Site Map Home Register Authors List Search Today's Posts Mark Forums Read  

search/find by unicode codepoint



 
 
Thread Tools Display Modes
  #1  
Old April 29th, 2010, 03:04 PM posted to microsoft.public.word.docmanagement
da9ve
external usenet poster
 
Posts: 10
Default search/find by unicode codepoint

I've been trying to use the search/Find and Replace tool to search by Unicode
codepoint. I know I'm using the right syntax/punctuation, because when I
search on ^100, it correctly finds all the 'd' characters. But when I try
searching on a codepoint in a "private use" codepage *, it just doesn't find
anything.

Specifically, there are numerous 0xF020 characters plopped into this
document alongside (badly used) Symbol font versions of ≤ ≥ characters, and
Word displays them sometimes as combining dots underneath these adjacent
characters, and sometimes as little question-mark boxes (ï€*ï€*) beside them.

(Hm, when I paste them in this browser window, they show up as the combining
dots, and mess up the cursor position on the line of text where they sit.)
But either way, when I toggle the Unicode, Word tells me they're 0xF020's,
and that hex translates to 61472 decimal - and a search on that always
returns "Word has finished searching the document. The search item was not
found."

(* Yeah, long story: I get documents from the EMEA - European Medicines
Agency - and some of them have,... strange characters and formatting. I know
these private use characters aren't SUPPOSED to be used, but for some reason
they're there, and I need to be able to programmatically find and replace
them.

Is this just a known loophole, that Word doesn't bother searching on
characters in "private use" or undefined codepages, or is there something
else that it's hiding from me?
  #2  
Old April 29th, 2010, 03:53 PM posted to microsoft.public.word.docmanagement
Klaus Linke
external usenet poster
 
Posts: 401
Default search/find by unicode codepoint

To search for (decimal) Unicode, you can use ^u61472 ...

But since you usually don't know the decimal code, it's easier to copy/paste
the character into the Find dialog (Ctrl+C/Ctrl+V), or to type the hex code
into "Find what" (F020) and the toggle it with Ctrl+X (ToggleCharacterCode):
That works in most dialogs, too.

Regards,
Klaus

 




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump


All times are GMT +1. The time now is 05:36 PM.


Powered by vBulletin® Version 3.6.4
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 OfficeFrustration.
The comments are property of their posters.