Help.
We have an application that will generate PDF documents from input data. We have been sending this program Latin-1 (which requires turning the upper 128 bit values of a byte into UTF-8), but we now need to send it katakana data in UTF-8 representation.
We've looked around and see the Unicode values going from 30A0 through 30FF, but we understand that UTF-8 Katakana takes 3 bytes, not 2.
Can anyone provide the specific byte values (in order) for a number of katakana characters that are true UTF-8? I think the ideal format would be a snipped from an HTML file so that we can view these, but we really need to understand how many bytes UTF-8 katakana should be and to get some sample values (if not the entire character set) showing both the values and the character. We're going to have to create a replacement table to convert our source file into this.
Thanks,
We have an application that will generate PDF documents from input data. We have been sending this program Latin-1 (which requires turning the upper 128 bit values of a byte into UTF-8), but we now need to send it katakana data in UTF-8 representation.
We've looked around and see the Unicode values going from 30A0 through 30FF, but we understand that UTF-8 Katakana takes 3 bytes, not 2.
Can anyone provide the specific byte values (in order) for a number of katakana characters that are true UTF-8? I think the ideal format would be a snipped from an HTML file so that we can view these, but we really need to understand how many bytes UTF-8 katakana should be and to get some sample values (if not the entire character set) showing both the values and the character. We're going to have to create a replacement table to convert our source file into this.
Thanks,