CEN Guide to the Use of Character Sets in EuropeTC 304

8-Bit Character Sets


This part of the Guide to the Use of Character Sets in Europe provides more detailed information about 8-bit character set standards than is found in the main body of the Guide. Another part of the Guide deals in more detail with the Universal Multi-octet Coded Character Set (UCS) specified in ISO/IEC 10646-1.

The need to represent characters by bit combinations (binary numbers) is central to the storage and processing of data by computer systems and the interchange of data between such systems. This is a guide to the many standards and other specifications have been developed to address the issues that arise from this need up and until the advent of the Multi-Octet code structure embodied in ISO/IEC 10646-1:1993. You may select the major sections of this guide directly from the following table of contents, or may read on below the contents for further information.

Table of Contents

More about this guide

The requirement for compatibility between newer and older equipment has led to the standards of the present day containing legacies from decisions taken many years ago. The reasons behind those decisions are often no longer relevant and their present day legacies may appear merely as unnecessary oddities and complexities. This guide provides some historical background but this is not necessary for an understanding of the remainder of this guide.

As work on character sets has developed, there has been a gradual refinement of the concepts involved. This has led to character set standards and other literature making use of technical terms that can be a barrier to the reader. It may be helpful to read the section on concepts and terminology before exploring the remaining sections of the guide in detail.

This gradual evolution of character set standards has led to technical innovations designed to increase the capabilites of coded character sets while remaining backwardly compatible with what has gone before. Within this evolved framework it is now possible to support a wide range of languages. The wider the range that it is required to support simultaneously, however, the more complex is the technical innovation required. For further information see the section on language support.

Not all the technical innovations are compatible with all the ways that character data may be used by applications. The section on application environments provides guidance on these limitations.

Other sections of this guide provide greater detail on particular issues. They may be selected directly from the table of contents, but they are also cross-referenced from the sections of general guidance. An index is also provided that enables direct access to tutorial information on individual standards.

Limitations of this guide

This guide does not cover, or only briefly covers, the following topics:


To Top of 8-Bit Guide