諺文音節

諺文音節是一個位於基本多文種平面的Unicode區塊,包含了現代標準語中使用的預組諺文音節。這些音節可以通過算法直接映射到諺文字母區塊中的兩個或三個字符的序列:

  • 位於U+1100..U+1112的字符:19個現代諺文的初聲輔音字母;
  • 位於U+1161..U+1175的字符:21個現代諺文的元音字母;
  • 無,或位於U+11A8..U+11C2的字符:27個現代諺文的終聲輔音字母。
諺文音節
Hangul Syllables
範圍U+AC00..U+D7AF
(11,184個碼位)
平面基本多文種平面()
文字諺文
應用朝鮮語
已分配11,172個碼位
未分配12個保留碼位
來源標準KS X 1001
Unicode版本歷史
2.011,172 (+11,172)
官方碼表
點擊此處
註釋[1][2]
在1.1版本中,6656個字符編碼於U+3400U+4DFF。但他們在2.0版本時被移至現在的碼位,並額外增加了4516個字符。

本區塊是根據2個或3個諺文字母(上述每個諺文字母子範圍中的一個)组成的每一個音节的規範順序編碼的。

區塊

諺文音節
Hangul Syllables[1][2]
Unicode Consortium 官方碼表 (PDF)
 0123456789ABCDEF
U+AC0x
U+AC1x
U+AC2x
U+AC3x 갿
U+AC4x
U+AC5x
U+AC6x
U+AC7x 걿
U+AC8x
U+AC9x
U+ACAx
U+ACBx 겿
U+ACCx
U+ACDx
U+ACEx
U+ACFx 곿
U+AD0x
U+AD1x
U+AD2x
U+AD3x 괿
U+AD4x
U+AD5x
U+AD6x
U+AD7x 굿
U+AD8x
U+AD9x
U+ADAx
U+ADBx 궿
U+ADCx
U+ADDx
U+ADEx
U+ADFx 귿
U+AE0x
U+AE1x
U+AE2x
U+AE3x 긿
U+AE4x
U+AE5x
U+AE6x
U+AE7x 깿
U+AE8x
U+AE9x
U+AEAx
U+AEBx 꺿
U+AECx
U+AEDx
U+AEEx
U+AEFx 껿
U+AF0x
U+AF1x
U+AF2x
U+AF3x 꼿
U+AF4x
U+AF5x
U+AF6x
U+AF7x 꽿
U+AF8x
U+AF9x
U+AFAx
U+AFBx 꾿
U+AFCx
U+AFDx
U+AFEx
U+AFFx 꿿
U+B00x 뀀
U+B01x
U+B02x
U+B03x
U+B04x
U+B05x
U+B06x
U+B07x
U+B08x
U+B09x
U+B0Ax
U+B0Bx
U+B0Cx
U+B0Dx
U+B0Ex
U+B0Fx
U+B10x
U+B11x
U+B12x
U+B13x
U+B14x
U+B15x
U+B16x
U+B17x
U+B18x
U+B19x
U+B1Ax
U+B1Bx
U+B1Cx
U+B1Dx
U+B1Ex
U+B1Fx
U+B20x
U+B21x
U+B22x
U+B23x
U+B24x
U+B25x
U+B26x
U+B27x
U+B28x
U+B29x
U+B2Ax
U+B2Bx
U+B2Cx
U+B2Dx
U+B2Ex
U+B2Fx
U+B30x
U+B31x
U+B32x
U+B33x
U+B34x
U+B35x
U+B36x
U+B37x
U+B38x
U+B39x
U+B3Ax
U+B3Bx
U+B3Cx
U+B3Dx
U+B3Ex
U+B3Fx
U+B40x
U+B41x
U+B42x
U+B43x
U+B44x
U+B45x
U+B46x
U+B47x
U+B48x
U+B49x
U+B4Ax
U+B4Bx
U+B4Cx
U+B4Dx
U+B4Ex
U+B4Fx
U+B50x
U+B51x
U+B52x
U+B53x
U+B54x
U+B55x
U+B56x
U+B57x
U+B58x
U+B59x
U+B5Ax
U+B5Bx
U+B5Cx
U+B5Dx
U+B5Ex
U+B5Fx
U+B60x
U+B61x
U+B62x
U+B63x
U+B64x
U+B65x
U+B66x
U+B67x
U+B68x
U+B69x
U+B6Ax
U+B6Bx
U+B6Cx
U+B6Dx
U+B6Ex
U+B6Fx
U+B70x
U+B71x
U+B72x
U+B73x
U+B74x
U+B75x
U+B76x
U+B77x
U+B78x
U+B79x
U+B7Ax
U+B7Bx
U+B7Cx
U+B7Dx
U+B7Ex
U+B7Fx
U+B80x
U+B81x
U+B82x
U+B83x
U+B84x
U+B85x
U+B86x
U+B87x
U+B88x
U+B89x
U+B8Ax
U+B8Bx
U+B8Cx
U+B8Dx
U+B8Ex
U+B8Fx
U+B90x
U+B91x
U+B92x
U+B93x 뤿
U+B94x
U+B95x
U+B96x
U+B97x 륿
U+B98x
U+B99x
U+B9Ax
U+B9Bx 릿
U+B9Cx
U+B9Dx
U+B9Ex
U+B9Fx 맿
U+BA0x
U+BA1x
U+BA2x
U+BA3x 먿
U+BA4x
U+BA5x
U+BA6x
U+BA7x 멿
U+BA8x
U+BA9x
U+BAAx
U+BABx 몿
U+BACx
U+BADx
U+BAEx
U+BAFx 뫿
U+BB0x
U+BB1x
U+BB2x
U+BB3x 묿
U+BB4x
U+BB5x
U+BB6x
U+BB7x 뭿
U+BB8x
U+BB9x
U+BBAx
U+BBBx 뮿
U+BBCx
U+BBDx
U+BBEx
U+BBFx 믿
U+BC0x
U+BC1x
U+BC2x
U+BC3x 밿
U+BC4x
U+BC5x
U+BC6x
U+BC7x 뱿
U+BC8x
U+BC9x
U+BCAx
U+BCBx 벿
U+BCCx
U+BCDx
U+BCEx
U+BCFx 볿
U+BD0x
U+BD1x
U+BD2x
U+BD3x 봿
U+BD4x
U+BD5x
U+BD6x
U+BD7x 뵿
U+BD8x
U+BD9x
U+BDAx
U+BDBx 붿
U+BDCx
U+BDDx
U+BDEx
U+BDFx 뷿
U+BE0x
U+BE1x
U+BE2x
U+BE3x 븿
U+BE4x
U+BE5x
U+BE6x
U+BE7x 빿
U+BE8x
U+BE9x
U+BEAx
U+BEBx 뺿
U+BECx
U+BEDx
U+BEEx
U+BEFx 뻿
U+BF0x
U+BF1x
U+BF2x
U+BF3x 뼿
U+BF4x
U+BF5x
U+BF6x
U+BF7x 뽿
U+BF8x
U+BF9x
U+BFAx
U+BFBx 뾿
U+BFCx
U+BFDx
U+BFEx
U+BFFx 뿿
U+C00x 쀀
U+C01x
U+C02x
U+C03x
U+C04x
U+C05x
U+C06x
U+C07x
U+C08x
U+C09x
U+C0Ax
U+C0Bx
U+C0Cx
U+C0Dx
U+C0Ex
U+C0Fx
U+C10x
U+C11x
U+C12x
U+C13x
U+C14x
U+C15x
U+C16x
U+C17x
U+C18x
U+C19x
U+C1Ax
U+C1Bx
U+C1Cx
U+C1Dx
U+C1Ex
U+C1Fx
U+C20x
U+C21x
U+C22x
U+C23x
U+C24x
U+C25x
U+C26x
U+C27x
U+C28x
U+C29x
U+C2Ax
U+C2Bx
U+C2Cx
U+C2Dx
U+C2Ex
U+C2Fx
U+C30x
U+C31x
U+C32x
U+C33x
U+C34x
U+C35x
U+C36x
U+C37x
U+C38x
U+C39x
U+C3Ax
U+C3Bx
U+C3Cx
U+C3Dx
U+C3Ex
U+C3Fx
U+C40x
U+C41x
U+C42x
U+C43x
U+C44x
U+C45x
U+C46x
U+C47x
U+C48x
U+C49x
U+C4Ax
U+C4Bx
U+C4Cx
U+C4Dx
U+C4Ex
U+C4Fx
U+C50x
U+C51x
U+C52x
U+C53x
U+C54x
U+C55x
U+C56x
U+C57x
U+C58x
U+C59x
U+C5Ax
U+C5Bx
U+C5Cx
U+C5Dx
U+C5Ex
U+C5Fx
U+C60x
U+C61x
U+C62x
U+C63x
U+C64x
U+C65x
U+C66x
U+C67x
U+C68x
U+C69x
U+C6Ax
U+C6Bx
U+C6Cx
U+C6Dx
U+C6Ex
U+C6Fx
U+C70x
U+C71x
U+C72x
U+C73x
U+C74x
U+C75x
U+C76x
U+C77x
U+C78x
U+C79x
U+C7Ax
U+C7Bx
U+C7Cx
U+C7Dx
U+C7Ex
U+C7Fx
U+C80x
U+C81x
U+C82x
U+C83x
U+C84x
U+C85x
U+C86x
U+C87x
U+C88x
U+C89x
U+C8Ax
U+C8Bx
U+C8Cx
U+C8Dx
U+C8Ex
U+C8Fx
U+C90x
U+C91x
U+C92x
U+C93x 줿
U+C94x
U+C95x
U+C96x
U+C97x 쥿
U+C98x
U+C99x
U+C9Ax
U+C9Bx 즿
U+C9Cx
U+C9Dx
U+C9Ex
U+C9Fx 짿
U+CA0x
U+CA1x
U+CA2x
U+CA3x 쨿
U+CA4x
U+CA5x
U+CA6x
U+CA7x 쩿
U+CA8x
U+CA9x
U+CAAx
U+CABx 쪿
U+CACx
U+CADx
U+CAEx
U+CAFx 쫿
U+CB0x
U+CB1x
U+CB2x
U+CB3x 쬿
U+CB4x
U+CB5x
U+CB6x
U+CB7x 쭿
U+CB8x
U+CB9x
U+CBAx
U+CBBx 쮿
U+CBCx
U+CBDx
U+CBEx
U+CBFx 쯿
U+CC0x
U+CC1x
U+CC2x
U+CC3x 찿
U+CC4x
U+CC5x
U+CC6x
U+CC7x 챿
U+CC8x
U+CC9x
U+CCAx
U+CCBx 첿
U+CCCx
U+CCDx
U+CCEx
U+CCFx 쳿
U+CD0x
U+CD1x
U+CD2x
U+CD3x 촿
U+CD4x
U+CD5x
U+CD6x
U+CD7x 쵿
U+CD8x
U+CD9x
U+CDAx
U+CDBx 춿
U+CDCx
U+CDDx
U+CDEx
U+CDFx 췿
U+CE0x
U+CE1x
U+CE2x
U+CE3x 츿
U+CE4x
U+CE5x
U+CE6x
U+CE7x 칿
U+CE8x
U+CE9x
U+CEAx
U+CEBx 캿
U+CECx
U+CEDx
U+CEEx
U+CEFx 컿
U+CF0x
U+CF1x
U+CF2x
U+CF3x 켿
U+CF4x
U+CF5x
U+CF6x
U+CF7x 콿
U+CF8x
U+CF9x
U+CFAx
U+CFBx 쾿
U+CFCx
U+CFDx
U+CFEx
U+CFFx 쿿
U+D00x 퀀
U+D01x
U+D02x
U+D03x
U+D04x
U+D05x
U+D06x
U+D07x
U+D08x
U+D09x
U+D0Ax
U+D0Bx
U+D0Cx
U+D0Dx
U+D0Ex
U+D0Fx
U+D10x
U+D11x
U+D12x
U+D13x
U+D14x
U+D15x
U+D16x
U+D17x
U+D18x
U+D19x
U+D1Ax
U+D1Bx
U+D1Cx
U+D1Dx
U+D1Ex
U+D1Fx
U+D20x
U+D21x
U+D22x
U+D23x
U+D24x
U+D25x
U+D26x
U+D27x
U+D28x
U+D29x
U+D2Ax
U+D2Bx
U+D2Cx
U+D2Dx
U+D2Ex
U+D2Fx
U+D30x
U+D31x
U+D32x
U+D33x
U+D34x
U+D35x
U+D36x
U+D37x
U+D38x
U+D39x
U+D3Ax
U+D3Bx
U+D3Cx
U+D3Dx
U+D3Ex
U+D3Fx
U+D40x
U+D41x
U+D42x
U+D43x
U+D44x
U+D45x
U+D46x
U+D47x
U+D48x
U+D49x
U+D4Ax
U+D4Bx
U+D4Cx
U+D4Dx
U+D4Ex
U+D4Fx
U+D50x
U+D51x
U+D52x
U+D53x
U+D54x
U+D55x
U+D56x
U+D57x
U+D58x
U+D59x
U+D5Ax
U+D5Bx
U+D5Cx
U+D5Dx
U+D5Ex
U+D5Fx
U+D60x
U+D61x
U+D62x
U+D63x
U+D64x
U+D65x
U+D66x
U+D67x
U+D68x
U+D69x
U+D6Ax
U+D6Bx
U+D6Cx
U+D6Dx
U+D6Ex
U+D6Fx
U+D70x
U+D71x
U+D72x
U+D73x
U+D74x
U+D75x
U+D76x
U+D77x
U+D78x
U+D79x
U+D7Ax
註釋
1.^ 依據 Unicode 14.0
2.^ 灰色區域表示未分配的碼位

歷史

碼位的移動使在Unicode中編碼諺文音節一事變得複雜了起來:

  • 版本1.0.0時,在U+3400..U+3D2D編碼了2350個來自KS C 5601-1987的現代諺文音節。而碼位U+3400..U+3D2D現在是中日韓統一表意文字擴展區A的一部分。
  • 版本1.1時,在U+3D2E..U+44B7編碼了額外1930個來自KS C 5657-1991的現代諺文音節,在U+44B8..U+44BD編碼了6個來自GB 12052-89的現代諺文音節,在U+44BE..U+4DFF編碼了2370個不在以上標準的的現代諺文音節。這些字符的碼位範圍涵蓋了中日韓統一表意文字擴展區A的剩餘部分及易經六爻符號的全部。
    • 另外,版本1.1尚有3處錯誤:[3]
      • U+384E:삤在Unicode字符數據庫中,但삣在ISO/IEC 10646-1:1993碼表和源標準的映射中。
      • U+40BC:삣在Unicode字符數據庫中,但삤在ISO/IEC 10646-1:1993碼表和源標準的映射中。
      • U+436C:콫在Unicode字符數據庫中,但콪在ISO/IEC 10646-1:1993碼表和源標準的映射中。
  • 版本2.0增加了KS C 5601-1992中剩餘的4516個可能的音節,並將所有已編碼的音節重新排列[4][5]到目前的碼位範圍U+AC00..U+D7AF內,這樣就可以通過算法將諺文音節分解為單個的諺文字母。

RFC 2279解釋說,這一重大的不兼容變化是在假設不存在使用Unicode的諺文數據或軟件的情況下做出的:

“允許這種不兼容變化的官方理由是,不存在包含諺文的應用和數據,這種說法很可能是真的,但仍然無法證實。 這一事件被稱爲‘Korean mess’,相關委員會已承諾永遠不會再做出這種不兼容的變化。” — RFC 2279
"The official justification for allowing such an incompatible change was that no implementations and no data containing Hangul existed, a statement that is likely to be true but remains unprovable. The incident has been dubbed the 'Korean mess', and the relevant committees have pledged to never, ever again make such an incompatible change." — RFC 2279

隨後,Unicode採用了一項編碼穩定性政策,即“一旦一个字符被編碼,它將不會被移動或刪除”。[6]

以下Unicode文檔記錄了定義本區塊中特定字符的目的與過程:

版本最終碼位[lower-alpha 1]碼位数UTC IDL2 IDWG2 ID文檔
2.0U+AC00..D7A311,172N767 页面存档备份,存于Ksar, Mike, , 1991-11-25
X3L2/93-078N848, 1992-07-02
UTC/1994-xxx, , 1994-09-30
X3L2/95-031N1158, 1995-03-08
N1170, 1995-03-10
UTC/1995-021BAliprand, Joan, , 1995-03-10
N1198, 1995-04-05
X3L2/95-053.1N1199, 1995-04-30
N1203 页面存档备份,存于Umamaheswaran, V. S.; Ksar, Mike, , , 1995-05-03
X3L2/95-053N1209, 1995-05-09
UTC/1995-xxx, , 1995-06-02
X3L2/95-090N1253 (doc, txt 页面存档备份,存于)Umamaheswaran, V. S.; Ksar, Mike, , , 1995-09-09
N1285, 1995-11-08
N1303 (html, doc 页面存档备份,存于)Umamaheswaran, V. S.; Ksar, Mike, , , 1996-01-26
N1331Paterson, Bruce, , 1996-03-14
N1332Paterson, Bruce, , 1996-03-14
N1391Paterson, Bruce, , 1996-05-18
N1353 页面存档备份,存于Umamaheswaran, V. S.; Ksar, Mike, , , 1996-06-25
N1537, 1997-01-29
L2/97-125N1561Paterson, Bruce, , 1997-05-27
N1570Paterson, Bruce, , 1997-06-23
L2/97-288N1603 页面存档备份,存于Umamaheswaran, V. S., , , 1997-10-24
N1806 (pdf, doc 页面存档备份,存于)Kim, Kyongsok; Paterson, Bruce, , 1998-07-08
L2/99-022N1942 页面存档备份,存于Paterson, Bruce, , 1998-12-08
L2/99-010N1903 (pdf, html 页面存档备份,存于, doc 页面存档备份,存于)Umamaheswaran, V. S., , , 1998-12-30
L2/99-114N2018 页面存档备份,存于Paterson, Bruce, , 1999-03-31
L2/99-232N2003 页面存档备份,存于Umamaheswaran, V. S., , , 1999-08-03
L2/99-297N2119 页面存档备份,存于, 1999-09-20
L2/99-298N2120 页面存档备份,存于Paterson, Bruce, , 1999-09-21
L2/03-100Edberg, Peter, , 2002-11-05
L2/02-463N2564 页面存档备份,存于Kim, Kyongsok, , 2002-11-30
L2/04-392West, Andrew, , 2004-10-21
L2/04-361Moore, Lisa, , , 2004-11-23
L2/17-080Chung, Jaemin, , 2017-03-29
  1. 建議的碼位和字符名稱可能與最終的結果不同。

另見

參考資料

  1. . The Unicode Standard. [2016-07-09]. (原始内容存档于2017-09-25).
  2. . The Unicode Standard. [2016-07-09]. (原始内容存档于2016-06-29).
  3. Chung, Jaemin. (PDF). 2017-03-29 [2022-05-31]. (原始内容 (PDF)存档于2022-04-20).
  4. Chang, K. D.; Choi, In Sook; Kim, Jung Ho. . 1995-10-04 [2022-05-31]. (原始内容存档于2022-04-20).
  5. . 2005-10-13 [2022-05-31]. (原始内容存档于2021-10-06).
  6. . Unicode Consortium. 2016-11-14 [2022-05-31]. (原始内容存档于2022-07-14).
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.