半形及全形字元 (Unicode區段)
半形及全形字元是 Unicode 基本多文種平面內的最後第二個區段,範圍包括 U+FF00..U+FFEF。本區段包含全形和半形字元以便可以無失真從舊編碼系統轉換至 Unicode。該區段於 Unicode 1.0 的舊稱為 半形及全形變體(Halfwidth and Fullwidth Variants)。[4]
半形及全形字元 Halfwidth and Fullwidth Forms | |
---|---|
範圍 | U+FF00..U+FFEF (240個碼位) |
平面 | 基本多文種平面(BMP) |
文字 | 諺文(52 個字元) 片假名(55 個字元) 拉丁文字(52 個字元) 普通(66 char.) |
符號系列 | 異體寬度字元 |
已分配 | 225個碼位 |
未分配 | 15個保留碼位 |
統一碼版本歷史 | |
1.0.0 | 216 (+216) |
1.0.1 | 223 (+7) |
3.2 | 225 (+2) |
註釋:[1][2][3] |
U+FF01..U+FF5E 編碼了 ASCII 內 21 至 7E 的全形版本。U+FF00 未對應 ASCII 20 (空格)字元,該字元已經由 U+3000 IDEOGRAPHIC SPACE 表意空格取代。
U+FF65..U+FF9F 編碼了JIS X 0201 內的半形片假名——參見半形假名。
U+FFA0..U+FFDC 編碼了諺文KS C 5601 1974年版的諺文相容字母。其對應的字元用於 IBM 的韓文編碼頁 933,其中包括 Shift In 和 Shift Out 的字元以對應雙位元組編碼。[5]基於雙位元組編碼可能包括相容字母,本區段包含了部分半形變體以便可以無失真從 KS C 5601 轉碼至 Unicode。[6][7]
U+FFE0..U+FFEE 編碼了全形及半形符號。
碼表
編輯半形及全形字元 Halfwidth and Fullwidth Forms[1][2] Unicode 聯盟官方碼表(PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+FF0x | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / | |
U+FF1x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
U+FF2x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
U+FF3x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
U+FF4x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
U+FF5x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ⦅ |
U+FF6x | ⦆ | 。 | 「 | 」 | 、 | ・ | ヲ | ァ | ィ | ゥ | ェ | ォ | ャ | ュ | ョ | ッ |
U+FF7x | ー | ア | イ | ウ | エ | オ | カ | キ | ク | ケ | コ | サ | シ | ス | セ | ソ |
U+FF8x | タ | チ | ツ | テ | ト | ナ | ニ | ヌ | ネ | ノ | ハ | ヒ | フ | ヘ | ホ | マ |
U+FF9x | ミ | ム | メ | モ | ヤ | ユ | ヨ | ラ | リ | ル | レ | ロ | ワ | ン | ゙ | ゚ |
U+FFAx | HW HF |
ᄀ | ᄁ | ᆪ | ᄂ | ᆬ | ᆭ | ᄃ | ᄄ | ᄅ | ᆰ | ᆱ | ᆲ | ᆳ | ᆴ | ᆵ |
U+FFBx | ᄚ | ᄆ | ᄇ | ᄈ | ᄡ | ᄉ | ᄊ | ᄋ | ᄌ | ᄍ | ᄎ | ᄏ | ᄐ | ᄑ | ᄒ | |
U+FFCx | ᅡ | ᅢ | ᅣ | ᅤ | ᅥ | ᅦ | ᅧ | ᅨ | ᅩ | ᅪ | ᅫ | ᅬ | ||||
U+FFDx | ᅭ | ᅮ | ᅯ | ᅰ | ᅱ | ᅲ | ᅳ | ᅴ | ᅵ | |||||||
U+FFEx | ¢ | £ | ¬ |  ̄ | ¦ | ¥ | ₩ | │ | ← | ↑ | → | ↓ | ■ | ○ | ||
註解 |
異體字選擇器
編輯本區段包含部分標點位置的異體字選擇器,[8][9] 其中使用了 U+FE00 ︀ VARIATION SELECTOR-1 (VS01) 及 U+FE01 ︁ VARIATION SELECTOR-2 (VS02):
U+ | FF01 | FF0C | FF0E | FF1A | FF1B | FF1F | 位置 |
基本字元 | ! | , | . | : | ; | ? | |
基本字元 + VS01 | !︀ | ,︀ | .︀ | :︀ | ;︀ | ?︀ | 左下角 |
基本字元 + VS02 | !︁ | ,︁ | .︁ | :︁ | ;︁ | ?︁ | 置中 |
歷史
編輯以下的Unicode相關檔案記錄了本區域中出現之字元的提議及定稿。
Unicode 版本 |
最終碼位[a] | 碼位數 | L2 ID | WG2 ID | 文件 |
---|---|---|---|---|---|
1.0.0 | U+FF01..FF5E, FF61..FFBE, FFC2..FFC7, FFCA..FFCF, FFD2..FFD7, FFDA..FFDC, FFE0..FFE6 | 216 | (to be determined) | ||
N4403 (pdf, doc (頁面存檔備份,存於互聯網檔案館)) | Umamaheswaran, V. S., Resolution M61.01, Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14, 2014-01-28 | ||||
L2/17-056 | Lunde, Ken, Proposal to add standardized variation sequences, 2017-02-13 | ||||
L2/17-436 | Lunde, Ken, Proposal to add standardized variation sequences for fullwidth East Asian punctuation, 2018-01-21 | ||||
L2/18-039 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Cook, Richard, 24. Fullwidth East Asian Punctuation, Recommendations to UTC #154 January 2018 on Script Proposals, 2018-01-19 | ||||
L2/17-362 | Moore, Lisa, B.4.1 New Proposal to add standardized variation sequence for U+FF10 FULL WIDTH DIGIT ZERO, UTC #153 Minutes, 2018-02-02 | ||||
L2/18-115 | Moore, Lisa, Consensus 154-C17, UTC #155 Minutes, 2018-05-09, Add 16 standardized variation sequences based on L2/17-436R, for Unicode 12.0. | ||||
L2/19-055 | Iancu, Laurențiu, Proposed Changes in the Segmentation Property Values for Fullwidth Digits, 2019-01-14 | ||||
L2/19-008 | Moore, Lisa, B.11.11.1.2 Proposed changes in the segmentation property values for fullwidth digits, UTC #158 Minutes, 2019-02-08 | ||||
1.0.1 | U+FFE8..FFEE | 7 | (to be determined) | ||
3.2 | U+FF5F..FF60 | 2 | L2/99-052 | Freytag, Asmus, The math pieces from the symbol font, 1999-02-05 | |
L2/01-033 | Karlsson, Kent; Freytag, Asmus, Disunify braces/brackets for math, computing science, and Z notation from similar-looking CJK braces/brackets, 2001-01-16 | ||||
L2/01-159 | N2344 (頁面存檔備份,存於互聯網檔案館) | Ad-hoc report on Mathematical Symbols, 2001-04-03 | |||
L2/01-157 | N2345R (頁面存檔備份,存於互聯網檔案館) | Karlsson, Kent, Proposal to disunify certain fencing CJK punctuation marks from similar-looking Math fences, 2001-04-04 | |||
L2/01-168 | Whistler, Ken, Bracket Disunification & Normalization Hell, 2001-04-10 | ||||
L2/01-012R | Moore, Lisa, Disunifying Braces and Brackets, Minutes UTC #86 in Mountain View, Jan 2001, 2001-05-21 | ||||
L2/01-223 | Suignard, Michel, Discussion of Issues Regarding Bracket Disunification, 2001-05-23 | ||||
L2/01-184R | Moore, Lisa, Motion 87-M21, Minutes from the UTC/L2 meeting, 2001-06-18, Reverse the decision made in motion 86-M6 not to disunify brackets. | ||||
L2/01-317 | Suignard, Michel, Bracket Disunification & Normalization, 2001-08-14 | ||||
L2/01-295R | Moore, Lisa, Bracket Disunification and Normalization, Minutes from the UTC/L2 meeting #88, 2001-11-06 | ||||
L2/02-154 | N2403 (頁面存檔備份,存於互聯網檔案館) | Umamaheswaran, V. S., Resolution M41.1, Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001-10-15/19, 2002-04-22 | |||
|
另見
編輯- 全形和半形
- 中日韓符號和標點
- 諺文字母
- 片假名 (Unicode區段)
- Unicode中的拉丁字母
- 圍繞字母和數字 (Unicode區段) —— 部分字元可能在不同字型內顯示為全形(例如 ⒈,⓵,⑴,⒜,ⓐ)
參見
編輯- ^ Unicode 1.0.1 Addendum (PDF). The Unicode Standard. 1992-11-03 [2016-07-09]. (原始內容存檔 (PDF)於2016-07-02).
- ^ Unicode character database. The Unicode Standard. [2016-07-09]. (原始內容存檔於2016-07-10).
- ^ Enumerated Versions of The Unicode Standard. The Unicode Standard. [2016-07-09]. (原始內容存檔於2016-06-29).
- ^ 3.8: Block-by-Block Charts (PDF). The Unicode Standard. version 1.0. Unicode Consortium. [2021-02-04]. (原始內容存檔 (PDF)於2021-02-11).
- ^ ICU Demonstration - Converter Explorer. demo.icu-project.org. [7 May 2018].
- ^ 存档副本. [2021-02-04]. (原始內容存檔於2021-02-16).
- ^ 存档副本. [2021-02-04]. (原始內容存檔於2021-01-16).
- ^ Lunde, Ken. L2/17-436: Proposal to add standardized variation sequences for fullwidth East Asian punctuation (PDF). 2018-01-21 [2021-02-04]. (原始內容存檔 (PDF)於2020-01-31).
- ^ 9.0 9.1 Unicode Character Database: Standardized Variation Sequences. The Unicode Consortium. [2021-02-04]. (原始內容存檔於2021-02-09).
- ^ Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray. L2/15-268: Proposal to Represent the Slashed Zero Variant of Empty Set (PDF). 2015-10-30 [2021-02-04]. (原始內容存檔 (PDF)於2016-10-21).