pbootcms网站模板|日韩1区2区|织梦模板||网站源码|日韩1区2区|jquery建站特效-html5模板网

  1. <i id='ADd1r'><tr id='ADd1r'><dt id='ADd1r'><q id='ADd1r'><span id='ADd1r'><b id='ADd1r'><form id='ADd1r'><ins id='ADd1r'></ins><ul id='ADd1r'></ul><sub id='ADd1r'></sub></form><legend id='ADd1r'></legend><bdo id='ADd1r'><pre id='ADd1r'><center id='ADd1r'></center></pre></bdo></b><th id='ADd1r'></th></span></q></dt></tr></i><div class="g0im2ao" id='ADd1r'><tfoot id='ADd1r'></tfoot><dl id='ADd1r'><fieldset id='ADd1r'></fieldset></dl></div>
    <tfoot id='ADd1r'></tfoot>
  2. <legend id='ADd1r'><style id='ADd1r'><dir id='ADd1r'><q id='ADd1r'></q></dir></style></legend>
  3. <small id='ADd1r'></small><noframes id='ADd1r'>

      <bdo id='ADd1r'></bdo><ul id='ADd1r'></ul>

    1. 什么是標準化的 UTF-8?

      What is normalized UTF-8 all about?(什么是標準化的 UTF-8?)

      <small id='yW1Aq'></small><noframes id='yW1Aq'>

      1. <legend id='yW1Aq'><style id='yW1Aq'><dir id='yW1Aq'><q id='yW1Aq'></q></dir></style></legend>
        <tfoot id='yW1Aq'></tfoot>
        • <i id='yW1Aq'><tr id='yW1Aq'><dt id='yW1Aq'><q id='yW1Aq'><span id='yW1Aq'><b id='yW1Aq'><form id='yW1Aq'><ins id='yW1Aq'></ins><ul id='yW1Aq'></ul><sub id='yW1Aq'></sub></form><legend id='yW1Aq'></legend><bdo id='yW1Aq'><pre id='yW1Aq'><center id='yW1Aq'></center></pre></bdo></b><th id='yW1Aq'></th></span></q></dt></tr></i><div class="q2u0wgc" id='yW1Aq'><tfoot id='yW1Aq'></tfoot><dl id='yW1Aq'><fieldset id='yW1Aq'></fieldset></dl></div>
              <tbody id='yW1Aq'></tbody>
                <bdo id='yW1Aq'></bdo><ul id='yW1Aq'></ul>
              • 本文介紹了什么是標準化的 UTF-8?的處理方法,對大家解決問題具有一定的參考價值,需要的朋友們下面隨著小編來一起學習吧!

                問題描述

                ICU 項目(現在也有一個 PHP 庫) 包含幫助規范化 UTF-8 字符串所需的類,以便在搜索時更容易比較值.

                The ICU project (which also now has a PHP library) contains the classes needed to help normalize UTF-8 strings to make it easier to compare values when searching.

                但是,我正在嘗試弄清楚這對應用程序意味著什么.例如,在哪些情況下我需要規范等效"而不是兼容性等效",或者反之亦然?

                However, I'm trying to figure out what this means for applications. For example, in which cases do I want "Canonical Equivalence" instead of "Compatibility equivalence", or vis-versa?

                推薦答案

                關于 Unicode 規范化你從未想知道的一切

                規范歸一化

                Unicode 包括多種對某些字符進行編碼的方法,尤其是重音字符.規范歸一化將代碼點更改為規范編碼形式.除字體或渲染引擎中的任何錯誤外,生成的代碼點應與原始代碼點相同.

                Everything You Never Wanted to Know about Unicode Normalization

                Canonical Normalization

                Unicode includes multiple ways to encode some characters, most notably accented characters. Canonical normalization changes the code points into a canonical encoding form. The resulting code points should appear identical to the original ones barring any bugs in the fonts or rendering engine.

                因為結果看起來相同,所以在存儲或顯示字符串之前對字符串應用規范化始終是安全的,只要您能容忍結果與輸入不完全相同.

                Because the results appear identical, it is always safe to apply canonical normalization to a string before storing or displaying it, as long as you can tolerate the result not being bit for bit identical to the input.

                規范標準化有兩種形式:NFD 和 NFC.從可以在這兩種形式之間轉換而不會丟失的意義上說,這兩者是等效的.在 NFC 下比較兩個字符串將始終給出與在 NFD 下比較它們相同的結果.

                Canonical normalization comes in 2 forms: NFD and NFC. The two are equivalent in the sense that one can convert between these two forms without loss. Comparing two strings under NFC will always give the same result as comparing them under NFD.

                NFD 將字符完全展開.這是計算速度更快的歸一化形式,但會產生更多代碼點(即使用更多空間).

                NFD has the characters fully expanded out. This is the faster normalization form to calculate, but the results in more code points (i.e. uses more space).

                如果您只想比較兩個尚未標準化的字符串,這是首選的標準化形式,除非您知道需要兼容性標準化.

                If you just want to compare two strings that are not already normalized, this is the preferred normalization form unless you know you need compatibility normalization.

                NFC 會在運行 NFD 算法后盡可能重新組合代碼點.這需要更長的時間,但會產生更短的字符串.

                NFC recombines code points when possible after running the NFD algorithm. This takes a little longer, but results in shorter strings.

                Unicode 還包括許多真正不屬于的字符,但在遺留字符集中使用.Unicode 添加這些是為了允許將這些字符集中的文本作為 Unicode 處理,然后無損地轉換回來.

                Unicode also includes many characters that really do not belong, but were used in legacy character sets. Unicode added these to allow text in those character sets to be processed as Unicode, and then be converted back without loss.

                兼容性規范化將這些轉換為相應的真實"序列.字符,并且還執行規范歸一化.兼容性規范化的結果可能與原始結果不一致.

                Compatibility normalization converts these to the corresponding sequence of "real" characters, and also performs canonical normalization. The results of compatibility normalization may not appear identical to the originals.

                包含格式信息的字符將替換為不包含的字符.例如,字符 ? 被轉換為 9.其他不涉及格式差異.例如羅馬數字字符被轉換為常規字母IX.

                Characters that include formatting information are replaced with ones that do not. For example the character ? gets converted to 9. Others don't involve formatting differences. For example the roman numeral character is converted to the regular letters IX.

                顯然,一旦執行了這種轉換,就不可能再無損地轉換回原始字符集.

                Obviously, once this transformation has been performed, it is no longer possible to losslessly convert back to the original character set.

                Unicode Consortium 建議將兼容性規范化考慮為 ToUpperCase 轉換.它在某些情況下可能有用,但您不應隨意應用.

                The Unicode Consortium suggests thinking of compatibility normalization like a ToUpperCase transform. It is something that may be useful in some circumstances, but you should not just apply it willy-nilly.

                一個很好的用例是搜索引擎,因為您可能希望搜索 9 以匹配 ?.

                An excellent use case would be a search engine since you would probably want a search for 9 to match ?.

                您可能不應該做的一件事是向用戶顯示應用兼容性規范化的結果.

                One thing you should probably not do is display the result of applying compatibility normalization to the user.

                兼容性規范化形式有NFKD和NFKC兩種形式.它們與 NFD 和 C 之間的關系相同.

                Compatibility normalization form comes in two forms NFKD and NFKC. They have the same relationship as between NFD and C.

                NFKC 中的任何字符串本質上也在 NFC 中,對于 NFKD 和 NFD 也是如此.因此NFKD(x)=NFD(NFKC(x)),和NFKC(x)=NFC(NFKD(x)),等等

                Any string in NFKC is inherently also in NFC, and the same for the NFKD and NFD. Thus NFKD(x)=NFD(NFKC(x)), and NFKC(x)=NFC(NFKD(x)), etc.

                如果有疑問,請使用規范標準化.根據適用的空間/速度權衡選擇 NFC 或 NFD,或根據您要與之互操作的事物的要求選擇 NFC 或 NFD.

                If in doubt, go with canonical normalization. Choose NFC or NFD based on the space/speed trade-off applicable, or based on what is required by something you are inter-operating with.

                這篇關于什么是標準化的 UTF-8?的文章就介紹到這了,希望我們推薦的答案對大家有所幫助,也希望大家多多支持html5模板網!

                【網站聲明】本站部分內容來源于互聯網,旨在幫助大家更快的解決問題,如果有圖片或者內容侵犯了您的權益,請聯系我們刪除處理,感謝您的支持!

                相關文檔推薦

                MySQLi prepared statement amp; foreach loop(MySQLi準備好的語句amp;foreach 循環)
                Is mysqli_insert_id() gets record from whole server or from same user?(mysqli_insert_id() 是從整個服務器還是從同一用戶獲取記錄?)
                PHP MySQLi doesn#39;t recognize login info(PHP MySQLi 無法識別登錄信息)
                mysqli_select_db() expects exactly 2 parameters(mysqli_select_db() 需要 2 個參數)
                Php mysql pdo query: fill up variable with query result(Php mysql pdo 查詢:用查詢結果填充變量)
                MySQLI 28000/1045 Access denied for user #39;root#39;@#39;localhost#39;(MySQLI 28000/1045 用戶“root@“localhost的訪問被拒絕)

                <legend id='uRUDZ'><style id='uRUDZ'><dir id='uRUDZ'><q id='uRUDZ'></q></dir></style></legend>

                <small id='uRUDZ'></small><noframes id='uRUDZ'>

                  <bdo id='uRUDZ'></bdo><ul id='uRUDZ'></ul>

                  • <i id='uRUDZ'><tr id='uRUDZ'><dt id='uRUDZ'><q id='uRUDZ'><span id='uRUDZ'><b id='uRUDZ'><form id='uRUDZ'><ins id='uRUDZ'></ins><ul id='uRUDZ'></ul><sub id='uRUDZ'></sub></form><legend id='uRUDZ'></legend><bdo id='uRUDZ'><pre id='uRUDZ'><center id='uRUDZ'></center></pre></bdo></b><th id='uRUDZ'></th></span></q></dt></tr></i><div class="0kueuy0" id='uRUDZ'><tfoot id='uRUDZ'></tfoot><dl id='uRUDZ'><fieldset id='uRUDZ'></fieldset></dl></div>
                    <tfoot id='uRUDZ'></tfoot>
                      <tbody id='uRUDZ'></tbody>

                          主站蜘蛛池模板: 不干胶标签,不干胶标签纸_厂家-山东同力胶粘制品 | Dataforth隔离信号调理模块-信号放大模块-加速度振动传感器-北京康泰电子有限公司 | 色谱柱-淋洗液罐-巴罗克试剂槽-巴氏吸管-5ml样品瓶-SBS液氮冻存管-上海希言科学仪器有限公司 | 桁架机器人_桁架机械手_上下料机械手_数控车床机械手-苏州清智科技装备制造有限公司 | 智能垃圾箱|垃圾房|垃圾分类亭|垃圾分类箱专业生产厂家定做-宿迁市传宇环保设备有限公司 | 云南标线|昆明划线|道路标线|交通标线-就选云南云路施工公司-云南云路科技有限公司 | 品牌设计_VI设计_电影海报设计_包装设计_LOGO设计-Bacross新越品牌顾问 | 平面钻,法兰钻,三维钻-山东兴田阳光智能装备股份有限公司 | 无菌实验室规划装修设计-一体化实验室承包-北京洁净净化工程建设施工-北京航天科恩实验室装备工程技术有限公司 | 造价工程师网,考试时间查询,报名入口信息-网站首页 | 长江船运_国内海运_内贸船运_大件海运|运输_船舶运输价格_钢材船运_内河运输_风电甲板船_游艇运输_航运货代电话_上海交航船运 | 液氮罐_液氮容器_自增压液氮罐-北京君方科仪科技发展有限公司 | 茶叶百科网-茶叶知识与茶文化探讨分享平台 | 上海租奔驰_上海租商务车_上海租车网-矢昂汽车服务公司 | 定制异形重型钢格栅板/钢格板_定做踏步板/排水沟盖板_钢格栅板批发厂家-河北圣墨金属制品有限公司 | 水质监测站_水质在线分析仪_水质自动监测系统_多参数水质在线监测仪_水质传感器-山东万象环境科技有限公司 | 标准品网_标准品信息网_【中检计量】 | 沧州友城管业有限公司-内外涂塑钢管-大口径螺旋钢管-涂塑螺旋管-保温钢管生产厂家 | 电地暖-电采暖-发热膜-石墨烯电热膜品牌加盟-暖季地暖厂家 | 手持式浮游菌采样器-全排二级生物安全柜-浙江孚夏医疗科技有限公司 | 苏州同创电子有限公司 - 四探针测试仪源头厂家 | 合肥网带炉_安徽箱式炉_钟罩炉-合肥品炙装备科技有限公司 | 旗杆生产厂家_不锈钢锥形旗杆价格_铝合金电动旗杆-上海锥升金属科技有限公司 | 卫生人才网-中国专业的医疗卫生医学人才网招聘网站! | 红外光谱仪维修_二手红外光谱仪_红外压片机_红外附件-天津博精仪器 | 诗词大全-古诗名句 - 古诗词赏析 | 化妆品加工厂-化妆品加工-化妆品代加工-面膜加工-广东欧泉生化科技有限公司 | 5L旋转蒸发器-20L-50L旋转蒸发器-上海越众仪器设备有限公司 | 厂房出售_厂房仓库出租_写字楼招租_土地出售-中苣招商网-中苣招商网 | 碳化硅,氮化硅,冰晶石,绢云母,氟化铝,白刚玉,棕刚玉,石墨,铝粉,铁粉,金属硅粉,金属铝粉,氧化铝粉,硅微粉,蓝晶石,红柱石,莫来石,粉煤灰,三聚磷酸钠,六偏磷酸钠,硫酸镁-皓泉新材料 | 上海办公室装修,办公楼装修设计,办公空间设计,企业展厅设计_写艺装饰公司 | 高扬程排污泵_隔膜泵_磁力泵_节能自吸离心水泵厂家-【上海博洋】 | 酒店厨房设计_中央厨房设计_北京商用厨房设计公司-奇能商厨 | 上海电子秤厂家,电子秤厂家价格,上海吊秤厂家,吊秤供应价格-上海佳宜电子科技有限公司 | 阳光模拟试验箱_高低温试验箱_高低温冲击试验箱_快速温变试验箱|东莞市赛思检测设备有限公司 | 上海软件开发-上海软件公司-软件外包-企业软件定制开发公司-咏熠科技 | 上海皓越真空设备有限公司官网-真空炉-真空热压烧结炉-sps放电等离子烧结炉 | 长信科技产业园官网_西安厂房_陕西标准工业厂房 | 沧州友城管业有限公司-内外涂塑钢管-大口径螺旋钢管-涂塑螺旋管-保温钢管生产厂家 | 电脑知识|软件|系统|数据库|服务器|编程开发|网络运营|知识问答|技术教程文章 - 好吧啦网 | 早报网|