... Pause, right click in your Firefox, go to View Page Info, Encoding: What it says? Do you care, I am sure, you don't, coz you know English.
But well, west is not all the world, there is sky out side theirs too.
We, at RXS, is in process of capturing that, I chose to begin with SniperSpy for the conversion, and discovered:- "PHP represents strings “abc…” as bytes not individual characters, that is a will take '1' byte, 'b' another 1byte. But Unicode takes up to 6 bytes for some characters, that is 捲 might take '4' bytes, and plus 'a' can take '2' bytes, hence the ? after each letter if you try to print a Unicode character. All internals string functions in PHP are bytes, that is (strlen) string length of 'abc' is 3 normally but 'abc' of Unicode will come 6! VB6 also don’t supports Unicode for its internal controls, BUT characters are stored as Unicode (UCS-2) and we have script support (VBScript) that can write files in Unicode plus API to convert ASCII to Unicode so the module part is not as difficult to switch to multi-lingual data capture.
Added to this module stores file in Unicodes, UCS-2 format, and MySQL needs UTF-8 format! There is again 8byte difference and again conversions needed, in my text, UCS-2 to UTF-8 leads to conversion of Russian letters to something like Japanese!"
PHP 6 is coming, that is finally going to progess in that line, till then here is a good solution:-
HereI will also continue as working demo get introduced.
0 comments:
Post a Comment