在查询的优化中为什么要永远小表驱动大表？-营销专业-111升学论坛

吉祥素食 发表于 2021-2-22 21:40:40

在查询的优化中为什么要永远小表驱动大表？

1.为什么要小表驱动大表呢

类似循环嵌套
for(int i=5;.......)
{
for(int j=1000;......)
{}
}如果小的循环在外层，对于数据库连接来说就只连接5次，进行5000次操作，如果1000在外，则需要进行1000次数据库连接，从而浪费资源，增加消耗。这就是为什么要小表驱动大表。
比如：我们在tb_dept_bigdata表中插入100条数据，在tb_emp_bigdata表中插入5000条数据。

注：100个部门，5000个员工。tb_dept_bigdata（小表），tb_emp_bigdata（大表）。
①当B表的数据集小于A表数据集时，用in由于exists。
select *from tb_emp_bigdata A where A.deptno in (select B.deptno from tb_dept_bigdata B)B表为tb_dept_bigdata：100条数据，A表tb_emp_bigdata：5000条数据。
用in的查询时间为：

经对比可看到，在B表数据集小于A表的时候，用in要由于exists，当前的数据集并不大，所以查询时间相差并不多。
②当A表的数据集小于B表的数据集时，用exists由于in。
select *from tb_dept_bigdata A where A.deptno in(select B.deptno from tb_emp_bigdata B);用in的查询时间为：

将上面sql转换成exists：
select *from tb_dept_bigdata A where exists(select 1 from tb_emp_bigdata B where B.deptno=A.deptno);用exists的查询时间：

由于数据量并不是很大，因此对比并不是难么的强烈。
附上结论截图：

.总结

下面结论都是针对in或exists的。
in后面跟的是小表，exists后面跟的是大表。
简记：in小，exists大。
对于exists
select .....from table where exists(subquery);
可以理解为：将主查询的数据放入子查询中做条件验证，根据验证结果（true或false）来决定主查询的数据是否得以保留。

微凉小妹 发表于 2021-2-22 21:44:57

开头的连接五次，，，你从哪里资料得来的。

五四一九四九 发表于 2021-2-22 21:49:14

这个没绝对，还是要看有没有索引的。如果大表没索引，小表有，肯定大表快

mmm小超子 发表于 2021-2-22 21:53:31

转发了

Ne_home新_ 发表于 2021-2-22 21:57:48

转发了

流浪sombody 发表于 2021-2-22 22:02:05

转发了

hyally 发表于 2021-2-22 22:06:22

转发了

吴韵之风 发表于 2021-2-22 22:10:39

转发了

结木※弥耶 发表于 2021-2-22 22:14:56

转发了

凤凰号 发表于 2021-2-22 22:19:13

转发了

页: [1] 2

111升学论坛's Archiver

在查询的优化中为什么要永远小表驱动大表？