孟加拉文 (Unicode区段)
孟加拉文是一个位于基本多文种平面的Unicode区块,收录了用于书写孟加拉语,阿萨姆语, 比什努普里亚-曼尼普尔语,达夫拉语,加罗语,卡西语,米佐语,蒙达里语,那加语,桑塔利语和特里普拉语的孟加拉文字符。在最初的版本中,码位U+0981..U+09CD是对1988年ISCII标准中孟加拉文字符A1-ED的直接复制,两个阿萨姆语字符U+09F0与U+09F1亦在其中。另外,天城文,古木基文,古吉拉特文,卡纳达文,马拉雅拉姆文,奥里亚文,泰米尔文和泰卢固文等Unicode区块也都是基于ISCII编码的。
孟加拉文 Bengali | |
---|---|
范围 | U+0980..U+09FF (128个码位) |
平面 | 基本多文种平面(BMP) |
文字 | 孟加拉文 |
应用 | 孟加拉语 阿萨姆语 比什努普里亚-曼尼普尔语 |
已分配 | 96个码位 |
未分配 | 32个保留码位 |
来源标准 | ISCII |
统一码版本历史 | |
1.0.0 | 89 (+89) |
4.0 | 90 (+1) |
4.1 | 91 (+1) |
5.2 | 92 (+1) |
7.0 | 93 (+1) |
10.0 | 95 (+2) |
11.0 | 96 (+1) |
码表 | |
点击此处 | |
注释:[1][2] |
区块
编辑孟加拉文 Bengali[1][2] Unicode Consortium 官方码表 (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+098x | ঀ | ঁ | ং | ঃ | অ | আ | ই | ঈ | উ | ঊ | ঋ | ঌ | এ | |||
U+099x | ঐ | ও | ঔ | ক | খ | গ | ঘ | ঙ | চ | ছ | জ | ঝ | ঞ | ট | ||
U+09Ax | ঠ | ড | ঢ | ণ | ত | থ | দ | ধ | ন | প | ফ | ব | ভ | ম | য | |
U+09Bx | র | ল | শ | ষ | স | হ | ় | ঽ | া | ি | ||||||
U+09Cx | ী | ু | ূ | ৃ | ৄ | ে | ৈ | ো | ৌ | ্ | ৎ | |||||
U+09Dx | ৗ | ড় | ঢ় | য় | ||||||||||||
U+09Ex | ৠ | ৡ | ৢ | ৣ | ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ | ||
U+09Fx | ৰ | ৱ | ৲ | ৳ | ৴ | ৵ | ৶ | ৷ | ৸ | ৹ | ৺ | ৻ | ৼ | ৽ | ৾ | |
注释 |
历史
编辑以下Unicode文档记录了与这一区块相关的提案:
版本 | 最终码位[a] | 码位数 | UTC ID | L2 ID | WG2 ID | 文档 |
---|---|---|---|---|---|---|
1.0.0 | U+0981..0983, 0985..098C, 098F..0990, 0993..09A8, 09AA..09B0, 09B2, 09B6..09B9, 09BC, 09BE..09C4, 09C7..09C8, 09CB..09CD, 09D7, 09DC..09DD, 09DF..09E3, 09E6..09FA | 89 | UTC/1991-056 | Whistler, Ken, Indic Charts: Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, Malayalam | ||
UTC/1991-057 | Whistler, Ken, Indic names list | |||||
UTC/1991-048B | Whistler, Ken, III. L. Walk In proposals, Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple, 1991-03-27 | |||||
L2/01-303 | Vikas, Om, Letter from the Government from India on "Draft for Unicode Standard for Indian Scripts", 2001-07-26 | |||||
L2/01-304 | Feedback on Unicode Standard 3.0, 2001-08-02 | |||||
L2/01-305 | McGowan, Rick, Draft UTC Response to L2/01-304, "Feedback on Unicode Standard 3.0", 2001-08-08 | |||||
L2/01-430R | McGowan, Rick, UTC Response to L2/01-304, "Feedback on Unicode Standard 3.0", 2001-11-20 | |||||
L2/03-113 | Everson, Michael, Conjuncts: making sure we are right, 2003-03-05 | |||||
L2/08-288 | Whistler, Ken, Public Review Issue #123: Bengali Currency Numerator Values, 2008-08-04 | |||||
L2/08-361 | Moore, Lisa, Bengali Currency Numerator Values (B.11.1), UTC #117 Minutes, 2008-12-02 | |||||
L2/09-225R | Moore, Lisa, E.1.2, UTC #120 / L2 #217 Minutes, 2009-08-17 | |||||
L2/20-055 | Pournader, Roozbeh, Proposed sequences for composition exclusions, 2020-01-16 | |||||
L2/20-015 | Moore, Lisa, B.13.1.1 Proposed sequences for composition exclusions, Draft Minutes of UTC Meeting 162, 2020-01-23 | |||||
4.0 | U+09BD | 1 | L2/01-431R[b] | McGowan, Rick, Actions for UTC and Editorial Committee in response to L2/01-430R, 2001-11-08 | ||
L2/01-405R | Moore, Lisa, Consensus 89-C19, Minutes from the UTC/L2 meeting in Mountain View, November 6-9, 2001, 2001-12-12, Accept the twelve Indic characters with names and coding positions as documented in L2/01-431R | |||||
L2/02-117 | N2425 (页面存档备份,存于互联网档案馆) | McGowan, Rick, Additional Characters for Indic Scripts, 2002-03-21 | ||||
L2/03-084 | Jain, Manoj, Proposed changes in the Unicode Standards for Indic Scripts - Bengali, 2003-03-03 | |||||
L2/03-102 | Vikas, Om, Unicode Standard for Indic Scripts, 2003-03-04 | |||||
L2/03-101.1 | Proposed Changes in Indic Scripts [Bengali document], 2003-03-04 | |||||
L2/03-104 | Jain, Manoj, Sample Text for Bengali Sign Avagraha, 2003-03-04 | |||||
L2/04-102 | Pavanaja, U. B., Bug in Kannada collation, 2004-02-10 | |||||
L2/04-432 | Wissink, Cathy, Indic collation: action items 99-20 and 99-29, 2004-12-31 | |||||
4.1 | U+09CE | 1 | L2/00-303 | N2261 (页面存档备份,存于互联网档案馆) | Incorporation of Bangla (Bengali) Coded Character in ISO/IEC 10646-1, 2000-08-23 | |
L2/00-304 | N2261-1 | Proposal Summary Form for character U+09BA, KHANDATA, 2000-08-23 | ||||
L2/01-050 | N2253 (页面存档备份,存于互联网档案馆) | Umamaheswaran, V. S., 7.12 Proposal to synchronize Bengali standard with 10646, Minutes of the SC2/WG2 meeting in Athens, September 2000, 2001-01-21 | ||||
L2/03-084 | Jain, Manoj, Proposed changes in the Unicode Standards for Indic Scripts - Bengali, 2003-03-03 | |||||
L2/03-102 | Vikas, Om, Unicode Standard for Indic Scripts, 2003-03-04 | |||||
L2/03-101.1 | Proposed Changes in Indic Scripts [Bengali document], 2003-03-04 | |||||
L2/04-060 | Sengupta, Gautam, Encoding Bangla Khanda-Ta With Ta+Virama, 2004-02-01 | |||||
L2/04-062 | Constable, Peter, Encoding Bangla Khanda-Ta With Ta+Virama, 2004-02-01 | |||||
L2/04-102 | Pavanaja, U. B., Bug in Kannada collation, 2004-02-10 | |||||
L2/04-262 | N2810 (页面存档备份,存于互联网档案馆) | Constable, Peter, Encoding of Bengali Khanda Ta in Unicode (PRI #30 document), 2004-02-17 | ||||
L2/04-192 | N2811 (页面存档备份,存于互联网档案馆) | Sengupta, Gautam, Feedback on PR-30: Encoding of Bangla Khanda Ta in Unicode, 2004-06-07 | ||||
L2/04-233 | N2812 (页面存档备份,存于互联网档案馆) | Vikas, Om, Letter to Mark Davis re Bengali Khanda Ta, 2004-06-10 | ||||
L2/04-252 | N2813 (页面存档备份,存于互联网档案馆) | Constable, Peter, Review of Bengali Khanda Ta and PRI-30 Feedback, 2004-06-15 | ||||
L2/04-264 | N2809 (页面存档备份,存于互联网档案馆) | Constable, Peter, Proposal to encode Bengali Khanda Ta in the UCS, 2004-06-17 | ||||
L2/04-432 | Wissink, Cathy, Indic collation: action items 99-20 and 99-29, 2004-12-31 | |||||
5.2 | U+09FB | 1 | N3353 (pdf, doc (页面存档备份,存于互联网档案馆)) | Umamaheswaran, V. S., M51.18, Unconfirmed minutes of WG 2 meeting 51 Hanzhou, China; 2007-04-24/27, 2007-10-10 | ||
L2/07-192 | N3311 (页面存档备份,存于互联网档案馆) | Pandey, Anshuman, Proposal to Encode the Ganda Currency Mark for Bengali in the BMP of the UCS, 2007-05-21 | ||||
L2/07-225 | Moore, Lisa, Bengali, UTC #112 Minutes, 2007-08-21 | |||||
7.0 | U+0980 | 1 | L2/11-359 | Pandey, Anshuman, Proposal to Encode the Sign Anji for Bengali, 2011-10-21 | ||
L2/11-403 | Anderson, Deborah; McGowan, Rick; Whistler, Ken, IV. BENGALI, Review of Indic-related L2 documents and Recommendations to the UTC, 2011-10-26 | |||||
L2/11-408 | Lata, Swaran, Letter from Swaran Lata, Gov't of India, re proposals, 2011-10-27 | |||||
L2/12-079 | Lata, Swaran, Inputs of Govt. of India on various documents, 2012-02-07 | |||||
L2/12-121 | N4157 (页面存档备份,存于互联网档案馆) | Pandey, Anshuman, Proposal to Encode the Sign ANJI for Bengali, 2012-04-23 | ||||
L2/12-147 | Anderson, Deborah; McGowan, Rick; Whistler, Ken, VII. BENGALI, Review of Indic-related L2 documents and Recommendations to the UTC, 2012-04-25 | |||||
L2/12-184 | Lata, Swaran, GOI Feedback on the various Indic related documents, 2012-05-07 | |||||
L2/12-277 | Lata, Swaran, GOI Feedback on the various Indic related document submitted to UTC, 2012-07-26 | |||||
10.0 | U+09FC | 1 | L2/15-204 | Anderson, Deborah; et al, 3. Bengali, Recommendations to UTC #144 July 2015 on Script Proposals, 2015-07-25 | ||
L2/15-161 | Sharma, Shriramana, Proposal to encode 09CF BENGALI LETTER VEDIC ANUSVARA, 2015-07-31 | |||||
L2/15-187 | Moore, Lisa, D.6.1, UTC #144 Minutes, 2015-08-11 | |||||
N4739 (页面存档备份,存于互联网档案馆) | M64.06, Unconfirmed minutes of WG 2 meeting 64, 2016-08-31 | |||||
U+09FD | 1 | L2/15-172R | A, Srinidhi; A, Sridatta, Proposal to Encode an Abbreviation Sign for Bengali, 2015-07-09 | |||
L2/15-204 | Anderson, Deborah; et al, 3. Bengali, Recommendations to UTC #144 July 2015 on Script Proposals, 2015-07-25 | |||||
L2/15-187 | Moore, Lisa, D.6.2, UTC #144 Minutes, 2015-08-11 | |||||
N4739 (页面存档备份,存于互联网档案馆) | M64.06, Unconfirmed minutes of WG 2 meeting 64, 2016-08-31 | |||||
11.0 | U+09FE | 1 | L2/16-322 | N4808 (页面存档备份,存于互联网档案馆) | A, Srinidhi; A, Sridatta, Proposal to encode the SANDHI MARK for Bengali, 2016-11-01 | |
L2/17-037 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu; Moore, Lisa; Liang, Hai; Ishida, Richard; Misra, Karan; McGowan, Rick, 4. Bengali, Recommendations to UTC #150 January 2017 on Script Proposals, 2017-01-21 | |||||
L2/17-016 | Moore, Lisa, D.3.2, UTC #150 Minutes, 2017-02-08 | |||||
L2/17-130 | Anderson, Deborah, Comments on L2/16-322 and L2/16-383, Sandhi marks for Bengali and Newa, 2017-04-19 | |||||
L2/17-153 | Anderson, Deborah, 4. Bengali and Newa, Recommendations to UTC #151 May 2017 on Script Proposals, 2017-05-17 | |||||
L2/17-103 | Moore, Lisa, D.3 Sandhi Mark, UTC #151 Minutes, 2017-05-18 | |||||
参考资料
编辑- ^ Unicode character database. The Unicode Standard. [2016-07-09]. (原始内容存档于2017-09-25).
- ^ Enumerated Versions of The Unicode Standard. The Unicode Standard. [2016-07-09]. (原始内容存档于2016-06-29).