Notepad bug.... Or quoted for truth?

b0ld · #1 (**permalink**) Jun 18, 06

open up notepad.exe and type in: "Bush hid the facts" without the quotes, and without hitting enter.. then save it as a .txt, close notepad, and reopen the .txt and see what it says.

project.one · #2 (**permalink**) Jun 18, 06

i get little boxes!
are you susposed to save the file as something in particular?

NinjaBoy · #3 (**permalink**) Jun 18, 06

畂桳栠摩琠敨映捡獴

You have to have all the language packs installed for it to work.

http://blog.wired.com/27BStroke6/ind...try_id=1502576

mapleleaf4ever · #4 (**permalink**) Jun 18, 06

Quote:

Originally Posted by projectone

i get little boxes!

Me too.

e_BoY · #5 (**permalink**) Jun 18, 06

just tried it. a bunch of chinese/korean/jap letters came up. i let my dad readt it but he said it wasnt chinese. So i tried bable fish to translate chinese simplified - english and it only transalated 2 words out of the bunch, same results for chinese tradtional - english. tried japanese - english and it transalted 1 of the words. Last i tried korean - english and it didnt transalate anything. very cruious to what it says

LastWerd · #6 (**permalink**) Jun 18, 06

Tell Us What It Says!!!!

project.one · #7 (**permalink**) Jun 18, 06

Yeah... spill it joe!

trance4life · #8 (**permalink**) Jun 18, 06

lol neat

project.one · #9 (**permalink**) Jun 18, 06

Somebody just post it!

thebobman · #10 (**permalink**) Jun 18, 06

Quote:

Originally Posted by Aftermarket Pipes

But we can't even blame Notepad: it's a limitation of Windows itself, specifically the Windows function that Notepad uses to figure out if a text file is Unicode or not.

You see, text files containing Unicode (more correctly, UTF-16-encoded Unicode) are supposed to start with a "Byte-Order Mark" (BOM), which is a two-byte flag that tells a reader how the following UTF-16 data is encoded. Given that these two bytes are exceedingly unlikely to occur at the beginning of an ASCII text file, it's commonly used to tell whether a text file is encoded in UTF-16.

But plenty of applications don't bother writing this marker at the beginning of a UTF-16-encoded file. So what's an app like Notepad to do?

Windows helpfully provides a function called IsTextUnicode()--you pass it some data, and it tells you whether it's UTF-16-encoded or not.

It actually runs a couple of heuristics over the first 256 bytes of the data and provides its best guess. As it turns out, these tests aren't terribly reliable for very short ASCII strings that contain an even number of lower-case letters, like "this app can break", or more appropriately, "this api can break".

The documentation for IsTextUnicode says:

These tests are not foolproof. The statistical tests assume certain amounts of variation between low and high bytes in a string, and some ASCII strings can slip through. For example, if lpBuffer points to the ASCII string 0x41, 0x0A, 0x0D, 0x1D (A\n\r^Z), the string passes the IS_TEXT_UNICODE_STATISTICS test, though failure would be preferable.

Problem Explained

project.one · #11 (**permalink**) Jun 18, 06

Ok, so now we know why

But now, what actually happens when we enter what B0ld told us to?