Glossary of Unicode Terms
Sources:
http://www.ibm.com/developerworks/library/glossaries/unicode.html
http://www.stevenblack.com/intl%20glossary.html
http://osulibrary.oregonstate.edu/archives/handbook/definitions/
Key Definitions:
Unicode : A character set that encompasses all of the world's living languages. Unicode is the basis of most modern software internationalization.
Script : A set of characters used to write a particular set of languages. For example, the Latin (or Roman) script is used to write English, French, Spanish, and most other European languages; the Cyrillic script is used to write Russian and Serbian.
Transcript : 1. A copy or reproduction, in so far as the resources of script and/or typography allow, of an original document.
2. In legal proceedings, an exact copy of a text.
3. A verbatim written, typed, or printed version of the spoken word, e.g. proceedings in a court of law or an oral history interview. (SAA)
4. The official copy of a student's educational record.
Coding or Character Set : Assigning numerical code to each character in a collection of characters so that it can be represented on a computer. Most traditional character sets contain characters from only one or two scripts.
Transcoding : Conversion of character data from one character set to another.
Translation : The conversion of text from one human language to another.
Transliteration : Transformation of text from one script to another, usually based on phonetic equivalencies. For example, Greek text might be transliterated into the Latin script so that it can be pronounced by English speakers.
Input Method Editor (IME) : A program that performs the conversion between keystrokes and ideographs or other characters, usually by user-guided dictionary lookup.
http://www.ibm.com/developerworks/library/glossaries/unicode.html
http://www.stevenblack.com/intl%20glossary.html
http://osulibrary.oregonstate.edu/archives/handbook/definitions/
Key Definitions:
Unicode : A character set that encompasses all of the world's living languages. Unicode is the basis of most modern software internationalization.
Script : A set of characters used to write a particular set of languages. For example, the Latin (or Roman) script is used to write English, French, Spanish, and most other European languages; the Cyrillic script is used to write Russian and Serbian.
Transcript : 1. A copy or reproduction, in so far as the resources of script and/or typography allow, of an original document.
2. In legal proceedings, an exact copy of a text.
3. A verbatim written, typed, or printed version of the spoken word, e.g. proceedings in a court of law or an oral history interview. (SAA)
4. The official copy of a student's educational record.
Coding or Character Set : Assigning numerical code to each character in a collection of characters so that it can be represented on a computer. Most traditional character sets contain characters from only one or two scripts.
Transcoding : Conversion of character data from one character set to another.
Translation : The conversion of text from one human language to another.
Transliteration : Transformation of text from one script to another, usually based on phonetic equivalencies. For example, Greek text might be transliterated into the Latin script so that it can be pronounced by English speakers.
Input Method Editor (IME) : A program that performs the conversion between keystrokes and ideographs or other characters, usually by user-guided dictionary lookup.
Comments