V2EX = way to explore
V2EX 是一个关于分享和探索的地方
现在注册
已注册用户请  登录
MySQL 5.5 Community Server
MySQL 5.6 Community Server
Percona Configuration Wizard
XtraBackup 搭建主从复制
Great Sites on MySQL
Percona
MySQL Performance Blog
Severalnines
推荐管理工具
Sequel Pro
phpMyAdmin
推荐书目
MySQL Cookbook
MySQL 相关项目
MariaDB
Drizzle
参考文档
http://mysql-python.sourceforge.net/MySQLdb.html
c4f36e5766583218
V2EX  ›  MySQL

mysql regexp 中文匹配问题???

  •  
  •   c4f36e5766583218 · 2019-03-27 11:03:35 +08:00 · 3453 次点击
    这是一个创建于 2074 天前的主题,其中的信息可能已经有所发展或是发生改变。
    SELECT
    	'测试好友' REGEXP '[123456789 一二三]',
    	'测试好友' REGEXP '[123456789 一二三四]',
    	'测试好友' REGEXP '[123456789 一二三五]' 
    FROM
    	DUAL;
    

    输出为 0, 1, 0,这是为何?中文是按照拼音首字母判断的吗?

    上面 REGEXP 里没有空格,保存的时候自动添加的,有没有结果都一样

    3 条回复    2019-03-27 17:31:22 +08:00
    c4f36e5766583218
        1
    c4f36e5766583218  
    OP
       2019-03-27 17:14:41 +08:00
    https://mariadb.com/kb/en/library/pcre/
    ```text
    Also, REGEXP/RLIKE, and the new functions, now work correctly with all multi-byte character sets supported by MariaDB, including East-Asian character sets (big5, gb2313, gbk, eucjp, eucjpms, cp932, ujis, euckr), and Unicode character sets (utf8, utf8mb4, ucs2, utf16, utf16le, utf32). In earlier versions of MariaDB (and all MySQL versions) REGEXP/RLIKE works correctly only with 8-bit character sets.
    ```
    c4f36e5766583218
        2
    c4f36e5766583218  
    OP
       2019-03-27 17:17:13 +08:00
    https://dev.mysql.com/doc/refman/8.0/en/regexp.html
    ```text
    MySQL implements regular expression support using International Components for Unicode (ICU), which provides full Unicode support and is multibyte safe. (Prior to MySQL 8.0.4, MySQL used Henry Spencer's implementation of regular expressions, which operates in byte-wise fashion and is not multibyte safe. For information about ways in which applications that use regular expressions may be affected by the implementation change, see Regular Expression Compatibility Considerations.)
    ```
    c4f36e5766583218
        3
    c4f36e5766583218  
    OP
       2019-03-27 17:31:22 +08:00
    Starting with MariaDB 10.0.5, MariaDB switched to the PCRE regular expression library for enhanced regular expressions.
    关于   ·   帮助文档   ·   博客   ·   API   ·   FAQ   ·   实用小工具   ·   1312 人在线   最高记录 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 23ms · UTC 17:50 · PVG 01:50 · LAX 09:50 · JFK 12:50
    Developed with CodeLauncher
    ♥ Do have faith in what you're doing.