mysql进阶
1.数据类型
mysql的数据类型众多,但是从大的分类上,可以分为下面几种类型
- 整数,tinyint、smallint、mediumint、int、bigint
- 定点数,decimal(m,n)
- 浮点数,float(m,d)、real(m,d)、DOUBLE PRECISION(M,D)
- bit
- 字符串:CHAR、VARCHAR、BINARY、VARBINARY、BLOB、TEXT、ENUM和SET
- 日期,date、time、datetime、timestamp、year
在使用字段类型时,我们一般的原则都是用最小满足原则,比如能用tinyint就不用int
2.mysql存储引擎
show storage engines;
+--------------------+---------+----------------------------------------------------------------+--------------+------+------------+
| Engine             | Support | Comment                                                        | Transactions | XA   | Savepoints |
+--------------------+---------+----------------------------------------------------------------+--------------+------+------------+
| FEDERATED          | NO      | Federated MySQL storage engine                                 | NULL         | NULL | NULL       |
| MRG_MYISAM         | YES     | Collection of identical MyISAM tables                          | NO           | NO   | NO         |
| MyISAM             | YES     | MyISAM storage engine                                          | NO           | NO   | NO         |
| BLACKHOLE          | YES     | /dev/null storage engine (anything you write to it disappears) | NO           | NO   | NO         |
| CSV                | YES     | CSV storage engine                                             | NO           | NO   | NO         |
| MEMORY             | YES     | Hash based, stored in memory, useful for temporary tables      | NO           | NO   | NO         |
| ARCHIVE            | YES     | Archive storage engine                                         | NO           | NO   | NO         |
| PERFORMANCE_SCHEMA | YES     | Performance Schema                                             | NO           | NO   | NO         |
| InnoDB             | DEFAULT | Supports transactions, row-level locking, and foreign keys     | YES          | YES  | YES        |
+--------------------+---------+----------------------------------------------------------------+--------------+------+------------+
在这么多的存引擎中,我们常用的有两种,一种是InnoDB,另外一种是MyISAM;其中InnoDB支持事务,大多数情况下我们都使用该引擎;如果数据仅仅是查询,变更的很少,不需要事务的支持,那么可以采用MyISAM,查询性能会更高
3.mysql执行计划
执行计划,简单的来说,是SQL在数据库中执行时的表现情况,通常用于SQL性能分析,优化等场景
我们直接看一个sql的执行计划
mysql> explain select a.* from member a
    -> inner join memberinfo b on a.mid = b.mid
    -> where
    ->     a.mid in (select mid from product where pid>10000);
+----+--------------------+---------+----------------+---------------------------+-------------------+---------+-------------------+--------+--------------------------+
| id | select_type        | table   | type           | possible_keys             | key               | key_len | ref               | rows   | Extra                    |
+----+--------------------+---------+----------------+---------------------------+-------------------+---------+-------------------+--------+--------------------------+
|  1 | PRIMARY            | b       | index          | PRIMARY,PK_memberinfo_mid | PK_memberinfo_mid | 4       | NULL              | 186174 | Using where; Using index |
|  1 | PRIMARY            | a       | eq_ref         | PRIMARY                   | PRIMARY           | 4       | ucunion_ovs.b.mid |      1 |                          |
|  2 | DEPENDENT SUBQUERY | product | index_subquery | PRIMARY,key_mid           | key_mid           | 4       | func              | 105562 | Using index; Using where |
+----+--------------------+---------+----------------+---------------------------+-------------------+---------+-------------------+--------+--------------------------+
在上面的表格中,怎么解读呢
- id:表示查询中select操作表的顺序,按顺序从大到小依次执行,比如上面的sql中,in的子查询会先执行
- select_type:表示选择类型,常见可选择有:SIMPLE(简单的), PRIMARY(最外层) ,SUBQUERY(子查询)
- type:表示访问类型,常见有:ALL(全表扫描),index(所以扫描),range(范围扫描),ref(非唯一索引扫描)
- table,数据所在的表
- possible_keys,可能使用的索引
- key:实际使用的索引
- key_len:索引列所用的字节数
- ref:连接匹配条件,如果走主键索引的话,该值为: const, 全表扫描的话,为null值
- row,扫描的行数,行数越少,查询效率就会越高,我们的优化大部分都是为了降低该值
- extra:这个属性非常重要,该属性中包括执行SQL时的真实情况信息,常用的有"Using temporary",使用临时表;"using filesort": 使用文件排序
4.mysql的索引
关于mysql索引的细节非常多,不打算展开,可以参考下面的内容
mysql索引
5.mysql类型转换
在java中,我们都知道int与Integer的自动转化关系,同理在mysql中也有相应的类型转换,程序总是相通的...
比如mysql中,int与varchar会隐形转换
除了使用mysql的隐形转换,还可以CAST函数进行显示的类型转换,支持的数据类型如下
+----------------------+
| Database             |
+----------------------+
|     date             |
|     datetime         |
|     time             |
|     decimal          |
|     char             |
|     nchar            |
|     signed           |
|     unsigned         |
|     binary           |
|     json             |
+----------------------+
mysql> select cast('2018-12-12' as date);
+----------------------------+
| cast('2018-12-12' as date) |
+----------------------------+
| 2018-12-12                 |
+----------------------------+
6.mysql常见的坑
- 创建表时,数据类型错误,比如id使用了uuid
- 主外键字段类型不一致,导致join的性能低
- 没有给字段、表添加合适的注释
- 创建表时没有添加索引,项目初始时,由于数据小,感觉不到性能问题,但是当项目运行一段时间后,数据规模如果增长过快,但是没能及时添加索引,后续查询性能问题严重,但是添加索引的成本非常高,比如在2000W的表中,要做表的变更会非常麻烦
- 索引加错列,使用下面的命令,可以快速统计某个列的数据离散程度
mysql> select count(distinct(mid))/count(1) from member;
+-------------------------------+
| count(distinct(mid))/count(1) |
+-------------------------------+
|                        1.0000 |
+-------------------------------+
1 row in set (0.51 sec)
mysql> select count(distinct(mgid))/count(1) from member;
+--------------------------------+
| count(distinct(mgid))/count(1) |
+--------------------------------+
|                         0.0001 |
+--------------------------------+
在这两个列中,我们发现mid的离散程度是100%,也就是该列适合做为索引,而mgid列的离散程度却非常低,不适合作为索引
- 不该加索引的列添加了索引,索引是很需要占用存储空间的,并且索引越多,插入和update、delete的性能也会受到影响
7.10个使用的mysql命令
- show databases
+----------------------+
| Database             |
+----------------------+
| information_schema   |
| 100.84.72.153        |
| autotest             |
| data_generate        |
| dps_infobright       |
+----------------------+
- use database_name
- show tables
+---------------------------------------------------+
| Tables_in_dps_stat                                |
+---------------------------------------------------+
| dps_report_publisher_pubid_pid_country_subpub_day |
| dps_scheduler_x_dataset                           |
| dps_scheduler_x_dataset_instance                  |
| dps_scheduler_x_job                               |
| dps_scheduler_x_job_instance                      |
+---------------------------------------------------+
- show full columns from table_name
+---------------------------------------------------------+
| Tables_in_ucunion_ovs                                   |
+---------------------------------------------------------+
| 0601_memberinfo                                         |
| 0601_product                                            |
| 0601_publisher                                          |
| 0601_publisher_info                                     |
| 1_bak_config_keyvalue                                   |
| INS_DATA_STATUS                                         |
| STAT_UCWEB_GJ_USR_INC_HIVE_NEW_FR_VER_COU_CH            |
+---------------------------------------------------------+
- select version()
- select current_user()
- show table status like "table_name"
+--------+--------+---------+------------+--------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+-----------+
| Name   | Engine | Version | Row_format | Rows   | Avg_row_length | Data_length | Max_data_length | Index_length | Data_free | Auto_increment | Create_time         | Update_time | Check_time | Collation       | Checksum | Create_options | Comment   |
+--------+--------+---------+------------+--------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+-----------+
| member | InnoDB |      10 | Compact    | 192938 |            198 |    38338560 |               0 |     17891328 |   7340032 |         197256 | 2018-07-16 10:29:50 | NULL        | NULL       | utf8_general_ci |     NULL |                | 用户表    |
+--------+--------+---------+------------+--------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+-----------+
- show processlist
- show index from table_name
+--------+------------+-------------------+--------------+-------------+-----------+-------------+----------+--------+------+------------+---------+---------------+
| Table  | Non_unique | Key_name          | Seq_in_index | Column_name | Collation | Cardinality | Sub_part | Packed | Null | Index_type | Comment | Index_comment |
+--------+------------+-------------------+--------------+-------------+-----------+-------------+----------+--------+------+------------+---------+---------------+
| member |          0 | PRIMARY           |            1 | mid         | A         |      192646 |     NULL | NULL   |      | BTREE      |         |               |
| member |          0 | PK_member_account |            1 | account     | A         |      192646 |     NULL | NULL   |      | BTREE      |         |               |
| member |          1 | PK_member_mgid    |            1 | mgid        | A         |           6 |     NULL | NULL   |      | BTREE      |         |               |
| member |          1 | index_amid        |            1 | amid        | A         |           6 |     NULL | NULL   |      | BTREE      |         |               |
| member |          1 | index_amgid       |            1 | amgid       | A         |           6 |     NULL | NULL   |      | BTREE      |         |               |
+--------+------------+-------------------+--------------+-------------+-----------+-------------+----------+--------+------+------------+---------+---------------+
查看表的索引,其中有个很重要的概念,叫做:Cardinality
这个值越大,那么该列的索引效果越好;比如在上面的索引中,amid,mgid,amgid都是不适合作为索引列
- explain select * from lh_user where created_at>"2017-12-09"
+----+-------------+-------+-------+---------------------------+---------+---------+-------+------+-------------+
| id | select_type | table | type  | possible_keys             | key     | key_len | ref   | rows | Extra       |
+----+-------------+-------+-------+---------------------------+---------+---------+-------+------+-------------+
|  1 | SIMPLE      | a     | const | PRIMARY                   | PRIMARY | 4       | const |    1 |             |
|  1 | SIMPLE      | b     | const | PRIMARY,PK_memberinfo_mid | PRIMARY | 4       | const |    1 | Using index |
+----+-------------+-------+-------+---------------------------+---------+---------+-------+------+-------------+