The compressed XML document can be viewed as the original XML document with its tags and element/ attribute values replaced by their corresponding encodings. 可以将被压缩的XML文档视为原始XML文档,相应的编码取代其标记和元素/属性值。
The caveat here is that not all Web clients can handle the encodings that they advertise. 这里需切记的是:并不是所有的Web客户机都可以处理它们宣称的编码。
Unicode defines character encodings in three distinct sizes UTF-8, UTF-16, and UTF-32 while the traditional character type is8 bits. Unicode用三种不同的大小定义字符编码UTF-8、UTF-16和UTF-32而传统的字符类型是8位的。
This is necessary because in UTF-8, it is often possible to represent the same character with multiple encodings. 这是必要的,因为在UTF-8中,常常可能用多个编码来代表同一个字符。
Developers often overlook the fact that encodings do not limit the set of characters that XML supports. 开发人员常常忽视了一个问题,编码没有限制XML支持的字符集。
Several encodings are used for Unicode: the two most popular are UTF-8 and UTF-16. 有几种编码可用于Unicode:最为常用的两个是UTF-8和UTF-16。
Characters in the ASCII range occupy only half the space in UTF-8 that they do in some other encodings of Unicode, particularly UTF-16. 与其他Unicode编码特别是UTF-16相比,在UTF-8中ASCII字符占用的空间只有一半。
This made data exchange problematic and required application software to be programmed for differences between these encodings. 这使得数据交换出现了很多问题,并且需要为这些编码之间的差异编写应用软件。
The command locale-m provides a list with the names of all installed character encodings. 命令locale-m提供一张已安装的所有字符编码名称的列表。
UTF-8 is less likely than UTF-16 or other Unicode encodings to cause problems for systems that are unaware of Unicode and XML. 与UTF-16或其他Unicode编码相比,对于不支持Unicode和XML的系统,UTF-8更不容易造成问题。
Acceptable names for encodings compatible with W3C specification contain Latin characters only. 与W3C规范兼容的编码可接受的名称只包括拉丁字符。
Remember what the world looked like before Unicode encodings became established practice. 还记得采用Unicode编码成为惯例之前世界是什么样吗?
It defines a model of mathematical objects and various encodings of that model, the most important one now being XML encoding. 它定义数学对象的模型以及该模型的各种编码,现在最重要的一个模型是XML编码的。
More serious problems can arise from the use of encodings. 编码的使用可能带来更严重的问题。
Unicode encodings and algorithms are used for internal and external communications in the AIX system. AIX系统中的内部和外部通信使用Unicode编码和算法。
It does have some notable weaknesses, including primitive data typing and lack of support for character encodings ( an astonishing flaw given its use of XML). 它确实有些明显的不足,包括原始的数据类型化和缺乏字符编码支持(这对XML使用来说是个令人吃惊的缺陷)。
If the properties file contains non-ASCII characters, such as the product name or about text, you need to convert the file from native encodings to ASCII with escaped Unicode. 如果属性文件包含了非ASCII字符(产品名或者about文本),需要使用转义Unicode把文件从原生编码转换成ASCII。
To clear up this confusion, I will extract the essence of the concept of encodings. 为了消除这种误解,我将总结编码概念的精髓。
You should now have a good understanding of how to serialize XML data into different types of output streams with different encodings. 现在,您应该很好地理解了如何将XML数据序列化成具有不同编码的不同类型的输出流。
If, on the other hand, you store the binary data as is, then you might end up with illegal sequences in UTF-8 and other encodings. 另一方面,如果按原样存储二进制数据,最终可能得到非法的UTF-8序列和其他编码。
Internationalization has had another impact on programs: character encodings. 国际化对程序还有另外一方面的影响:字符编码。
Encodings for base class, sub-class, and programming interface are provided in Appendix D. 附录D提供了基类、子类和程序接口的具体编码。
All localizations should have at least a UTF-8 version, but may optionally add versions in other character encodings popular for that locale. 所有的本地化翻译应该至少有一个UTF-8版本,但是可能会随意地在本地化翻译受欢迎的其它字符编码中添加版本。
For example, many user agents use a heuristic to distinguish the various encodings used for Japanese text. 例如,很多用户代理都会用启发的方式来决定日文文本的多种编码方式。
A single UTF document can be in various languages with no need to switch encodings halfway through. 一个单一的UTF文件可以有不同的语言,但是你不需要在这些语言之间切换代码。
The te header, which specifies the transfer encodings that are acceptable for the response. te标头,指定响应可接受的传输编码方式。
Code conversion to multibyte encodings. 代码转换成多字节编码。
An introduction to these classes is provided in understanding encodings. 了解编码对这些类进行了介绍。
As a consequence of this change in philosophy, pretty much all code that uses Unicode, encodings or binary data most likely has to change. 基于这个哲学来修改的结果是,只使用Unicode的代码清爽了许多,编码和二进制数据都需要为此做出修改。