PostgreSQL 9種索引的原理和應用場景

標簽

PostgreSQL , btree , hash , gin , gist , sp-gist , brin , bloom , rum , zombodb , bitmap

背景

PostgreSQL 擁有眾多開放特性，例如

1、開放的數據類型接口，使得PG支持超級豐富的數據類型，除了傳統數據庫支持的類型，還支持GIS，JSON，RANGE，IP，ISBN，圖像特征值，化學，DNA等等擴展的類型，用戶還可以根據實際業務擴展更多的類型。

2、開放的操作符接口，使得PG不僅僅支持常見的類型操作符，還支持擴展的操作符，例如距離符，邏輯並、交、差符號，圖像相似符號，幾何計算符號等等擴展的符號，用戶還可以根據實際業務擴展更多的操作符。

3、開放的外部數據源接口，使得PG支持豐富的外部數據源，例如可以通過FDW讀寫MySQL, redis, mongo, oracle, sqlserver, hive, www, hbase, ldap, 等等隻要你能想到的數據源都可以通過FDW接口讀寫。

4、開放的語言接口，使得PG支持幾乎地球上所有的編程語言作為數據庫的函數、存儲過程語言，例如plpython , plperl , pljava , plR , plCUDA , plshell等等。用戶可以通過language handler擴展PG的語言支持。

5、開放的索引接口，使得PG支持非常豐富的索引方法，例如btree , hash , gin , gist , sp-gist , brin , bloom , rum , zombodb , bitmap (greenplum extend)，用戶可以根據不同的數據類型，以及查詢的場景，選擇不同的索引。

6、PG內部還支持BitmapAnd, BitmapOr的優化方法，可以合並多個索引的掃描操作，從而提升多個索引數據訪問的效率。

不同的索引接口針對的數據類型、業務場景是不一樣的，接下來針對每一種索引，介紹一下它的原理和應用場景。

一、btree

原理

《深入淺出PostgreSQL B-Tree索引結構》

應用場景

b-tree適合所有的數據類型，支持排序，支持大於、小於、等於、大於或等於、小於或等於的搜索。

索引與遞歸查詢結合，還能實現快速的稀疏檢索。

《PostgrSQL 遞歸SQL的幾個應用 - 極客與正常人的思維》

例子

postgres=# create table t_btree(id int, info text);  
CREATE TABLE  
postgres=# insert into t_btree select generate_series(1,10000), md5(random()::text) ;  
INSERT 0 10000  
postgres=# create index idx_t_btree_1 on t_btree using btree (id);  
CREATE INDEX  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_btree where id=1;  
                                                          QUERY PLAN                                                             
-------------------------------------------------------------------------------------------------------------------------------  
 Index Scan using idx_t_btree_1 on public.t_btree  (cost=0.29..3.30 rows=1 width=37) (actual time=0.027..0.027 rows=1 loops=1)  
   Output: id, info  
   Index Cond: (t_btree.id = 1)  
   Buffers: shared hit=1 read=2  
 Planning time: 0.292 ms  
 Execution time: 0.050 ms  
(6 rows)

二、hash

原理

src/backend/access/hash/README

（hash index entries store only the hash code, not the actual data value, for each indexed item. ）

應用場景

hash索引存儲的是被索引字段VALUE的哈希值，隻支持等值查詢。

hash索引特別適用於字段VALUE非常長（不適合b-tree索引，因為b-tree一個PAGE至少要存儲3個ENTRY，所以不支持特別長的VALUE）的場景，例如很長的字符串，並且用戶隻需要等值搜索，建議使用hash index。

例子

postgres=# create table t_hash (id int, info text);  
CREATE TABLE  
postgres=# insert into t_hash select generate_series(1,100), repeat(md5(random()::text),10000);  
INSERT 0 100  
  
-- 使用b-tree索引會報錯，因為長度超過了1/3的索引頁大小
postgres=# create index idx_t_hash_1 on t_hash using btree (info);  
ERROR:  index row size 3720 exceeds maximum 2712 for index "idx_t_hash_1"  
HINT:  Values larger than 1/3 of a buffer page cannot be indexed.  
Consider a function index of an MD5 hash of the value, or use full text indexing.  
  
postgres=# create index idx_t_hash_1 on t_hash using hash (info);  
CREATE INDEX  
  
postgres=# set enable_hashjoin=off;  
SET  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_hash where info in (select info from t_hash limit 1);  
                                                             QUERY PLAN                                                                
-------------------------------------------------------------------------------------------------------------------------------------  
 Nested Loop  (cost=0.03..3.07 rows=1 width=22) (actual time=0.859..0.861 rows=1 loops=1)  
   Output: t_hash.id, t_hash.info  
   Buffers: shared hit=11  
   ->  HashAggregate  (cost=0.03..0.04 rows=1 width=18) (actual time=0.281..0.281 rows=1 loops=1)  
         Output: t_hash_1.info  
         Group Key: t_hash_1.info  
         Buffers: shared hit=3  
         ->  Limit  (cost=0.00..0.02 rows=1 width=18) (actual time=0.012..0.012 rows=1 loops=1)  
               Output: t_hash_1.info  
               Buffers: shared hit=1  
               ->  Seq Scan on public.t_hash t_hash_1  (cost=0.00..2.00 rows=100 width=18) (actual time=0.011..0.011 rows=1 loops=1)  
                     Output: t_hash_1.info  
                     Buffers: shared hit=1  
   ->  Index Scan using idx_t_hash_1 on public.t_hash  (cost=0.00..3.02 rows=1 width=22) (actual time=0.526..0.527 rows=1 loops=1)  
         Output: t_hash.id, t_hash.info  
         Index Cond: (t_hash.info = t_hash_1.info)  
         Buffers: shared hit=6  
 Planning time: 0.159 ms  
 Execution time: 0.898 ms  
(19 rows)

三、gin

原理

gin是倒排索引，存儲被索引字段的VALUE或VALUE的元素，以及行號的list或tree。

（ col_val:(tid_list or tid_tree) ， col_val_elements:(tid_list or tid_tree) ）

《PostgreSQL GIN索引實現原理》

《寶劍贈英雄 - 任意組合字段等效查詢, 探探PostgreSQL多列展開式B樹 (GIN)》

應用場景

1、當需要搜索多值類型內的VALUE時，適合多值類型，例如數組、全文檢索、TOKEN。（根據不同的類型，支持相交、包含、大於、在左邊、在右邊等搜索）

2、當用戶的數據比較稀疏時，如果要搜索某個VALUE的值，可以適應btree_gin支持普通btree支持的類型。（支持btree的操作符）

3、當用戶需要按任意列進行搜索時，gin支持多列展開單獨建立索引域，同時支持內部多域索引的bitmapAnd, bitmapOr合並，快速的返回按任意列搜索請求的數據。

例子

1、多值類型搜索

postgres=# create table t_gin1 (id int, arr int[]);  
CREATE TABLE  
  
postgres=# do language plpgsql $$  
postgres$# declare  
postgres$# begin  
postgres$#   for i in 1..10000 loop  
postgres$#     insert into t_gin1 select i, array(select random()*1000 from generate_series(1,10));  
postgres$#   end loop;  
postgres$# end;  
postgres$# $$;  
DO  
postgres=# select * from t_gin1 limit 3;  
 id |                    arr                      
----+-------------------------------------------  
  1 | {128,700,814,592,414,838,615,827,274,210}  
  2 | {284,452,824,556,132,121,21,705,537,865}  
  3 | {65,185,586,872,627,330,574,227,827,64}  
(3 rows)  
  
postgres=# create index idx_t_gin1_1 on t_gin1 using gin (arr);  
CREATE INDEX  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gin1 where arr && array[1,2];  
                                                       QUERY PLAN                                                          
-------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_gin1  (cost=8.93..121.24 rows=185 width=65) (actual time=0.058..0.207 rows=186 loops=1)  
   Output: id, arr  
   Recheck Cond: (t_gin1.arr && '{1,2}'::integer[])  
   Heap Blocks: exact=98  
   Buffers: shared hit=103  
   ->  Bitmap Index Scan on idx_t_gin1_1  (cost=0.00..8.89 rows=185 width=0) (actual time=0.042..0.042 rows=186 loops=1)  
         Index Cond: (t_gin1.arr && '{1,2}'::integer[])  
         Buffers: shared hit=5  
 Planning time: 0.208 ms  
 Execution time: 0.245 ms  
(10 rows)  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gin1 where arr @> array[1,2];  
                                                     QUERY PLAN                                                        
---------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_gin1  (cost=7.51..9.02 rows=1 width=65) (actual time=0.022..0.022 rows=0 loops=1)  
   Output: id, arr  
   Recheck Cond: (t_gin1.arr @> '{1,2}'::integer[])  
   Buffers: shared hit=5  
   ->  Bitmap Index Scan on idx_t_gin1_1  (cost=0.00..7.51 rows=1 width=0) (actual time=0.020..0.020 rows=0 loops=1)  
         Index Cond: (t_gin1.arr @> '{1,2}'::integer[])  
         Buffers: shared hit=5  
 Planning time: 0.116 ms  
 Execution time: 0.044 ms  
(9 rows)

2、單值稀疏數據搜索

postgres=# create extension btree_gin;  
CREATE EXTENSION  
postgres=# create table t_gin2 (id int, c1 int);  
CREATE TABLE  
postgres=# insert into t_gin2 select generate_series(1,100000), random()*10 ;  
INSERT 0 100000  
postgres=# create index idx_t_gin2_1 on t_gin2 using gin (c1);  
CREATE INDEX  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gin2 where c1=1;  
                                                         QUERY PLAN                                                            
-----------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_gin2  (cost=84.10..650.63 rows=9883 width=8) (actual time=0.925..3.685 rows=10078 loops=1)  
   Output: id, c1  
   Recheck Cond: (t_gin2.c1 = 1)  
   Heap Blocks: exact=443  
   Buffers: shared hit=448  
   ->  Bitmap Index Scan on idx_t_gin2_1  (cost=0.00..81.62 rows=9883 width=0) (actual time=0.867..0.867 rows=10078 loops=1)  
         Index Cond: (t_gin2.c1 = 1)  
         Buffers: shared hit=5  
 Planning time: 0.252 ms  
 Execution time: 4.234 ms  
(10 rows)

3、多列任意搜索

postgres=# create table t_gin3 (id int, c1 int, c2 int, c3 int, c4 int, c5 int, c6 int, c7 int, c8 int, c9 int);  
CREATE TABLE  
postgres=# insert into t_gin3 select generate_series(1,100000), random()*10, random()*20, random()*30, random()*40, random()*50, random()*60, random()*70, random()*80, random()*90;  
INSERT 0 100000  
postgres=# create index idx_t_gin3_1 on t_gin3 using gin (c1,c2,c3,c4,c5,c6,c7,c8,c9);  
CREATE INDEX  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gin3 where c1=1 or c2=1 and c3=1 or c4=1 and (c6=1 or c7=2) or c8=9 or c9=10;  
                                                                                              QUERY PLAN                                                                                                 
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_gin3  (cost=154.03..1364.89 rows=12286 width=40) (actual time=1.931..5.634 rows=12397 loops=1)  
   Output: id, c1, c2, c3, c4, c5, c6, c7, c8, c9  
   Recheck Cond: ((t_gin3.c1 = 1) OR ((t_gin3.c2 = 1) AND (t_gin3.c3 = 1)) OR (((t_gin3.c4 = 1) AND (t_gin3.c6 = 1)) OR ((t_gin3.c4 = 1) AND (t_gin3.c7 = 2))) OR (t_gin3.c8 = 9) OR (t_gin3.c9 = 10))  
   Heap Blocks: exact=834  
   Buffers: shared hit=867  
   ->  BitmapOr  (cost=154.03..154.03 rows=12562 width=0) (actual time=1.825..1.825 rows=0 loops=1)  
         Buffers: shared hit=33  
         ->  Bitmap Index Scan on idx_t_gin3_1  (cost=0.00..83.85 rows=9980 width=0) (actual time=0.904..0.904 rows=10082 loops=1)  
               Index Cond: (t_gin3.c1 = 1)  
               Buffers: shared hit=6  
         ->  Bitmap Index Scan on idx_t_gin3_1  (cost=0.00..9.22 rows=172 width=0) (actual time=0.355..0.355 rows=164 loops=1)  
               Index Cond: ((t_gin3.c2 = 1) AND (t_gin3.c3 = 1))  
               Buffers: shared hit=8  
         ->  BitmapOr  (cost=21.98..21.98 rows=83 width=0) (actual time=0.334..0.334 rows=0 loops=1)  
               Buffers: shared hit=13  
               ->  Bitmap Index Scan on idx_t_gin3_1  (cost=0.00..7.92 rows=42 width=0) (actual time=0.172..0.172 rows=36 loops=1)  
                     Index Cond: ((t_gin3.c4 = 1) AND (t_gin3.c6 = 1))  
                     Buffers: shared hit=6  
               ->  Bitmap Index Scan on idx_t_gin3_1  (cost=0.00..7.91 rows=41 width=0) (actual time=0.162..0.162 rows=27 loops=1)  
                     Index Cond: ((t_gin3.c4 = 1) AND (t_gin3.c7 = 2))  
                     Buffers: shared hit=7  
         ->  Bitmap Index Scan on idx_t_gin3_1  (cost=0.00..14.38 rows=1317 width=0) (actual time=0.124..0.124 rows=1296 loops=1)  
               Index Cond: (t_gin3.c8 = 9)  
               Buffers: shared hit=3  
         ->  Bitmap Index Scan on idx_t_gin3_1  (cost=0.00..12.07 rows=1010 width=0) (actual time=0.102..0.102 rows=1061 loops=1)  
               Index Cond: (t_gin3.c9 = 10)  
               Buffers: shared hit=3  
 Planning time: 0.272 ms  
 Execution time: 6.349 ms  
(29 rows)

四、gist

原理

GiST stands for Generalized Search Tree.

It is a balanced, tree-structured access method, that acts as a base template in which to implement arbitrary indexing schemes.

B-trees, R-trees and many other indexing schemes can be implemented in GiST.

《從難纏的模煳查詢聊開 - PostgreSQL獨門絕招之一 GIN , GiST , SP-GiST , RUM 索引原理與技術背景》

應用場景

GiST是一個通用的索引接口，可以使用GiST實現b-tree, r-tree等索引結構。

不同的類型，支持的索引檢索也各不一樣。例如：

1、幾何類型，支持位置搜索（包含、相交、在上下左右等），按距離排序。

2、範圍類型，支持位置搜索（包含、相交、在左右等）。

3、IP類型，支持位置搜索（包含、相交、在左右等）。

4、空間類型（PostGIS），支持位置搜索（包含、相交、在上下左右等），按距離排序。

5、標量類型，支持按距離排序。

《PostgreSQL 百億地理位置數據近鄰查詢性能》

例子

1、幾何類型檢索

postgres=# create table t_gist (id int, pos point);  
CREATE TABLE  
postgres=# insert into t_gist select generate_series(1,100000), point(round((random()*1000)::numeric, 2), round((random()*1000)::numeric, 2));  
INSERT 0 100000  
postgres=# select * from t_gist  limit 3;  
 id |       pos         
----+-----------------  
  1 | (325.43,477.07)  
  2 | (257.65,710.94)  
  3 | (502.42,582.25)  
(3 rows)  
postgres=# create index idx_t_gist_1 on t_gist using gist (pos);  
CREATE INDEX  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gist where circle '((100,100) 10)'  @> pos;  
                                                       QUERY PLAN                                                         
------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_gist  (cost=2.55..125.54 rows=100 width=20) (actual time=0.072..0.132 rows=46 loops=1)  
   Output: id, pos  
   Recheck Cond: ('<(100,100),10>'::circle @> t_gist.pos)  
   Heap Blocks: exact=41  
   Buffers: shared hit=47  
   ->  Bitmap Index Scan on idx_t_gist_1  (cost=0.00..2.53 rows=100 width=0) (actual time=0.061..0.061 rows=46 loops=1)  
         Index Cond: ('<(100,100),10>'::circle @> t_gist.pos)  
         Buffers: shared hit=6  
 Planning time: 0.147 ms  
 Execution time: 0.167 ms  
(10 rows)  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gist where circle '((100,100) 1)' @> pos order by pos <-> '(100,100)' limit 10;  
                                                              QUERY PLAN                                                                 
---------------------------------------------------------------------------------------------------------------------------------------  
 Limit  (cost=0.28..14.60 rows=10 width=28) (actual time=0.045..0.048 rows=2 loops=1)  
   Output: id, pos, ((pos <-> '(100,100)'::point))  
   Buffers: shared hit=5  
   ->  Index Scan using idx_t_gist_1 on public.t_gist  (cost=0.28..143.53 rows=100 width=28) (actual time=0.044..0.046 rows=2 loops=1)  
         Output: id, pos, (pos <-> '(100,100)'::point)  
         Index Cond: ('<(100,100),1>'::circle @> t_gist.pos)  
         Order By: (t_gist.pos <-> '(100,100)'::point)  
         Buffers: shared hit=5  
 Planning time: 0.092 ms  
 Execution time: 0.076 ms  
(10 rows)

2、標量類型排序

postgres=# create extension btree_gist;  
CREATE EXTENSION  
  
postgres=# create index idx_t_btree_2 on t_btree using gist(id);  
CREATE INDEX  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_btree order by id <-> 100 limit 1;  
                                                                QUERY PLAN                                                                   
-------------------------------------------------------------------------------------------------------------------------------------------  
 Limit  (cost=0.15..0.19 rows=1 width=41) (actual time=0.046..0.046 rows=1 loops=1)  
   Output: id, info, ((id <-> 100))  
   Buffers: shared hit=3  
   ->  Index Scan using idx_t_btree_2 on public.t_btree  (cost=0.15..408.65 rows=10000 width=41) (actual time=0.045..0.045 rows=1 loops=1)  
         Output: id, info, (id <-> 100)  
         Order By: (t_btree.id <-> 100)  
         Buffers: shared hit=3  
 Planning time: 0.085 ms  
 Execution time: 0.076 ms  
(9 rows)

五、sp-gist

原理

SP-GiST is an abbreviation for space-partitioned GiST.

SP-GiST supports partitioned search trees, which facilitate development of a wide range of different non-balanced data structures, such as quad-trees, k-d trees, and radix trees (tries).

The common feature of these structures is that they repeatedly divide the search space into partitions that need not be of equal size.

Searches that are well matched to the partitioning rule can be very fast.

SP-GiST類似GiST，是一個通用的索引接口，但是SP-GIST使用了空間分區的方法，使得SP-GiST可以更好的支持非平衡數據結構，例如quad-trees, k-d tree, radis tree.

《Space-partitioning trees in PostgreSQL》

《SP-GiST for PostgreSQL User Manual》

應用場景

1、幾何類型，支持位置搜索（包含、相交、在上下左右等），按距離排序。

2、範圍類型，支持位置搜索（包含、相交、在左右等）。

3、IP類型，支持位置搜索（包含、相交、在左右等）。

例子

1、範圍類型搜索

postgres=# create table t_spgist (id int, rg int4range);  
CREATE TABLE  
  
postgres=# insert into t_spgist select id, int4range(id, id+(random()*200)::int) from generate_series(1,100000) t(id);  
INSERT 0 100000  
postgres=# select * from t_spgist  limit 3;  
 id |   rg      
----+---------  
  1 | [1,138)  
  2 | [2,4)  
  3 | [3,111)  
(3 rows)  
  
postgres=# set maintenance_work_mem ='32GB';  
SET  
postgres=# create index idx_t_spgist_1 on t_spgist using spgist (rg);  
CREATE INDEX  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_spgist where rg && int4range(1,100);  
                                                       QUERY PLAN                                                          
-------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_spgist  (cost=2.55..124.30 rows=99 width=17) (actual time=0.059..0.071 rows=99 loops=1)  
   Output: id, rg  
   Recheck Cond: (t_spgist.rg && '[1,100)'::int4range)  
   Heap Blocks: exact=1  
   Buffers: shared hit=6  
   ->  Bitmap Index Scan on idx_t_spgist_1  (cost=0.00..2.52 rows=99 width=0) (actual time=0.043..0.043 rows=99 loops=1)  
         Index Cond: (t_spgist.rg && '[1,100)'::int4range)  
         Buffers: shared hit=5  
 Planning time: 0.133 ms  
 Execution time: 0.111 ms  
(10 rows)  
  
postgres=# set enable_bitmapscan=off;  
SET  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_spgist where rg && int4range(1,100);  
                                                             QUERY PLAN                                                                
-------------------------------------------------------------------------------------------------------------------------------------  
 Index Scan using idx_t_spgist_1 on public.t_spgist  (cost=0.28..141.51 rows=99 width=17) (actual time=0.021..0.051 rows=99 loops=1)  
   Output: id, rg  
   Index Cond: (t_spgist.rg && '[1,100)'::int4range)  
   Buffers: shared hit=8  
 Planning time: 0.097 ms  
 Execution time: 0.074 ms  
(6 rows)

六、brin

原理

BRIN 索引是塊級索引，有別於B-TREE等索引，BRIN記錄並不是以行號為單位記錄索引明細，而是記錄每個數據塊或者每段連續的數據塊的統計信息。因此BRIN索引空間占用特別的小，對數據寫入、更新、刪除的影響也很小。

BRIN屬於LOSSLY索引，當被索引列的值與物理存儲相關性很強時，BRIN索引的效果非常的好。

例如時序數據，在時間或序列字段創建BRIN索引，進行等值、範圍查詢時效果很棒。

應用場景

《BRIN (block range index) index》

《PostgreSQL 物聯網黑科技 - 瘦身幾百倍的索引(BRIN index)》

《PostgreSQL 聚集存儲與 BRIN索引 - 高並發行為、軌跡類大吞吐數據查詢場景解說》

《PostgreSQL 並行寫入堆表，如何保證時序線性存儲 - BRIN索引優化》

例子

postgres=# create table t_brin (id int, info text, crt_time timestamp);  
CREATE TABLE  
postgres=# insert into t_brin select generate_series(1,1000000), md5(random()::text), clock_timestamp();  
INSERT 0 1000000  
  
postgres=# select ctid,* from t_brin limit 3;  
 ctid  | id |               info               |          crt_time            
-------+----+----------------------------------+----------------------------  
 (0,1) |  1 | e48a6cd688b6cc8e86ee858fa993b31b | 2017-06-27 22:50:19.172224  
 (0,2) |  2 | e79c335c679b0bf544e8ba5f01569df7 | 2017-06-27 22:50:19.172319  
 (0,3) |  3 | b75ec6db320891a620097164b751e682 | 2017-06-27 22:50:19.172323  
(3 rows)  
  
  
postgres=# select correlation from pg_stats where tablename='t_brin' and attname='id';  
 correlation   
-------------  
           1  
(1 row)  
  
postgres=# select correlation from pg_stats where tablename='t_brin' and attname='crt_time';  
 correlation   
-------------  
           1  
(1 row)  
  
postgres=# create index idx_t_brin_1 on t_brin using brin (id) with (pages_per_range=1);  
CREATE INDEX  
postgres=# create index idx_t_brin_2 on t_brin using brin (crt_time) with (pages_per_range=1);  
CREATE INDEX  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_brin where id between 100 and 200;  
                                                       QUERY PLAN                                                          
-------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_brin  (cost=43.52..199.90 rows=74 width=45) (actual time=1.858..1.876 rows=101 loops=1)  
   Output: id, info, crt_time  
   Recheck Cond: ((t_brin.id >= 100) AND (t_brin.id <= 200))  
   Rows Removed by Index Recheck: 113  
   Heap Blocks: lossy=2  
   Buffers: shared hit=39  
   ->  Bitmap Index Scan on idx_t_brin_1  (cost=0.00..43.50 rows=107 width=0) (actual time=1.840..1.840 rows=20 loops=1)  
         Index Cond: ((t_brin.id >= 100) AND (t_brin.id <= 200))  
         Buffers: shared hit=37  
 Planning time: 0.174 ms  
 Execution time: 1.908 ms  
(11 rows)  
  
postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_brin where crt_time between '2017-06-27 22:50:19.172224' and '2017-06-27 22:50:19.182224';  
                                                                                       QUERY PLAN                                                                                          
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on public.t_brin  (cost=59.63..4433.67 rows=4474 width=45) (actual time=1.860..2.603 rows=4920 loops=1)  
   Output: id, info, crt_time  
   Recheck Cond: ((t_brin.crt_time >= '2017-06-27 22:50:19.172224'::timestamp without time zone) AND (t_brin.crt_time <= '2017-06-27 22:50:19.182224'::timestamp without time zone))  
   Rows Removed by Index Recheck: 2  
   Heap Blocks: lossy=46  
   Buffers: shared hit=98  
   ->  Bitmap Index Scan on idx_t_brin_2  (cost=0.00..58.51 rows=4494 width=0) (actual time=1.848..1.848 rows=460 loops=1)  
         Index Cond: ((t_brin.crt_time >= '2017-06-27 22:50:19.172224'::timestamp without time zone) AND (t_brin.crt_time <= '2017-06-27 22:50:19.182224'::timestamp without time zone))  
         Buffers: shared hit=52  
 Planning time: 0.091 ms  
 Execution time: 2.884 ms  
(11 rows)

七、rum

原理

https://github.com/postgrespro/rum

rum 是一個索引插件，由Postgrespro開源，適合全文檢索，屬於GIN的增強版本。

增強包括：

1、在RUM索引中，存儲了lexem的位置信息，所以在計算ranking時，不需要回表查詢（而GIN需要回表查詢）。

2、RUM支持phrase搜索，而GIN無法支持。

3、在一個RUM索引中，允許用戶在posting tree中存儲除ctid（行號）以外的字段VALUE，例如時間戳。

這使得RUM不僅支持GIN支持的全文檢索，還支持計算文本的相似度值，按相似度排序等。同時支持位置匹配，例如（速度與激情，可以采用"速度" <2> "激情" 進行匹配，而GIN索引則無法做到）

位置信息如下

postgres=# select to_tsvector('english', 'hello digoal');
     to_tsvector      
----------------------
 'digoal':2 'hello':1
(1 row)

postgres=# select to_tsvector('english', 'hello i digoal');
     to_tsvector      
----------------------
 'digoal':3 'hello':1
(1 row)

postgres=# select to_tsvector('english', 'hello i am digoal');
     to_tsvector      
----------------------
 'digoal':4 'hello':1
(1 row)

postgres=# select to_tsquery('english', 'hello <1> digoal');
      to_tsquery      
----------------------
 'hello' <-> 'digoal'
(1 row)

postgres=# select to_tsquery('english', 'hello <2> digoal');
      to_tsquery      
----------------------
 'hello' <2> 'digoal'
(1 row)

postgres=# select to_tsquery('english', 'hello <3> digoal');
      to_tsquery      
----------------------
 'hello' <3> 'digoal'
(1 row)

postgres=# select to_tsvector('hello digoal') @@ to_tsquery('english', 'hello <1> digoal');
 ?column? 
----------
 t
(1 row)

postgres=# select to_tsvector('hello digoal') @@ to_tsquery('english', 'hello <2> digoal');
 ?column? 
----------
 f
(1 row)

postgres=# select to_tsvector('hello i digoal') @@ to_tsquery('english', 'hello <2> digoal');
 ?column? 
----------
 t
(1 row)

應用場景

《PostgreSQL 全文檢索加速快到沒有朋友 - RUM索引接口(潘多拉魔盒)》

《從難纏的模煳查詢聊開 - PostgreSQL獨門絕招之一 GIN , GiST , SP-GiST , RUM 索引原理與技術背景》

例子

postgres=# create table rum_test(c1 tsvector);  
CREATE TABLE  
  
postgres=# CREATE INDEX rumidx ON rum_test USING rum (c1 rum_tsvector_ops);  
CREATE INDEX  
  
$ vi test.sql  
insert into rum_test select to_tsvector(string_agg(c1::text,',')) from  (select (100000*random())::int from generate_series(1,100)) t(c1);  
  
$ pgbench -M prepared -n -r -P 1 -f ./test.sql -c 50 -j 50 -t 200000  
  
postgres=# explain analyze select * from rum_test where c1 @@ to_tsquery('english','1 | 2') order by c1 <=> to_tsquery('english','1 | 2') offset 19000 limit 100;  
                                                               QUERY PLAN                                                                  
-----------------------------------------------------------------------------------------------------------------------------------------  
 Limit  (cost=18988.45..19088.30 rows=100 width=1391) (actual time=58.912..59.165 rows=100 loops=1)  
   ->  Index Scan using rumidx on rum_test  (cost=16.00..99620.35 rows=99749 width=1391) (actual time=16.426..57.892 rows=19100 loops=1)  
         Index Cond: (c1 @@ '''1'' | ''2'''::tsquery)  
         Order By: (c1 <=> '''1'' | ''2'''::tsquery)  
 Planning time: 0.133 ms  
 Execution time: 59.220 ms  
(6 rows)  
  
postgres=# create table test15(c1 tsvector);  
CREATE TABLE  
postgres=# insert into test15 values (to_tsvector('jiebacfg', 'hello china, i''m digoal')), (to_tsvector('jiebacfg', 'hello world, i''m postgresql')), (to_tsvector('jiebacfg', 'how are you, i''m digoal'));  
INSERT 0 3  
postgres=# select * from test15;  
                         c1                            
-----------------------------------------------------  
 ' ':2,5,9 'china':3 'digoal':10 'hello':1 'm':8  
 ' ':2,5,9 'hello':1 'm':8 'postgresql':10 'world':3  
 ' ':2,4,7,11 'digoal':12 'm':10  
(3 rows)  
postgres=# create index idx_test15 on test15 using rum(c1 rum_tsvector_ops);  
CREATE INDEX  
postgres=# select *,c1 <=> to_tsquery('hello') from test15;  
                         c1                          | ?column?   
-----------------------------------------------------+----------  
 ' ':2,5,9 'china':3 'digoal':10 'hello':1 'm':8     |  16.4493  
 ' ':2,5,9 'hello':1 'm':8 'postgresql':10 'world':3 |  16.4493  
 ' ':2,4,7,11 'digoal':12 'm':10                     | Infinity  
(3 rows)  
postgres=# explain select *,c1 <=> to_tsquery('postgresql') from test15 order by c1 <=> to_tsquery('postgresql');  
                                   QUERY PLAN                                     
--------------------------------------------------------------------------------  
 Index Scan using idx_test15 on test15  (cost=3600.25..3609.06 rows=3 width=36)  
   Order By: (c1 <=> to_tsquery('postgresql'::text))  
(2 rows)

GIN VS RUM

GIN

postgres=# create table t_gin_1 (id int, ts tsvector);
CREATE TABLE
postgres=# insert into t_gin_1 values (1, to_tsvector('hello digoal')),(2, to_tsvector('hello i digoal')),(3, to_tsvector('hello i am digoal'));
INSERT 0 3
postgres=# create index idx_t_gin_1_1 on t_gin_1 using gin (ts);
CREATE INDEX
postgres=# explain select * from t_gin_1 where ts @@ to_tsquery('english', 'hello <1> digoal');
                       QUERY PLAN                       
--------------------------------------------------------
 Seq Scan on t_gin_1  (cost=0.00..1.04 rows=1 width=36)
   Filter: (ts @@ '''hello'' <-> ''digoal'''::tsquery)
(2 rows)

postgres=# set enable_seqscan=off;
SET
postgres=# explain select * from t_gin_1 where ts @@ to_tsquery('english', 'hello <1> digoal');
                                 QUERY PLAN                                 
----------------------------------------------------------------------------
 Bitmap Heap Scan on t_gin_1  (cost=4.50..6.01 rows=1 width=36)
   Recheck Cond: (ts @@ '''hello'' <-> ''digoal'''::tsquery)
   ->  Bitmap Index Scan on idx_t_gin_1_1  (cost=0.00..4.50 rows=1 width=0)
         Index Cond: (ts @@ '''hello'' <-> ''digoal'''::tsquery)
(4 rows)

postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gin_1 where ts @@ to_tsquery('english', 'hello <1> digoal');
                                                      QUERY PLAN                                                      
----------------------------------------------------------------------------------------------------------------------
 Bitmap Heap Scan on public.t_gin_1  (cost=4.50..6.01 rows=1 width=36) (actual time=0.029..0.030 rows=1 loops=1)
   Output: id, ts
   Recheck Cond: (t_gin_1.ts @@ '''hello'' <-> ''digoal'''::tsquery)
   Rows Removed by Index Recheck: 2
   Heap Blocks: exact=1
   Buffers: shared hit=4
   ->  Bitmap Index Scan on idx_t_gin_1_1  (cost=0.00..4.50 rows=1 width=0) (actual time=0.018..0.018 rows=3 loops=1)
         Index Cond: (t_gin_1.ts @@ '''hello'' <-> ''digoal'''::tsquery)
         Buffers: shared hit=3
 Planning time: 0.106 ms
 Execution time: 0.061 ms
(11 rows)

RUM

postgres=# create table t_gin_1 (id int, ts tsvector);
CREATE TABLE
postgres=# insert into t_gin_1 values (1, to_tsvector('hello digoal')),(2, to_tsvector('hello i digoal')),(3, to_tsvector('hello i am digoal'));
INSERT 0 3
postgres=#  create index idx_t_gin_1_1 on t_gin_1 using rum (ts rum_tsvector_ops);
CREATE INDEX
postgres=# explain select * from t_gin_1 where ts @@ to_tsquery('english', 'hello <1> digoal');
                       QUERY PLAN                       
--------------------------------------------------------
 Seq Scan on t_gin_1  (cost=0.00..1.04 rows=1 width=36)
   Filter: (ts @@ '''hello'' <-> ''digoal'''::tsquery)
(2 rows)

postgres=# set enable_seqscan =off;
SET
postgres=# explain select * from t_gin_1 where ts @@ to_tsquery('english', 'hello <1> digoal');
                                  QUERY PLAN                                  
------------------------------------------------------------------------------
 Index Scan using idx_t_gin_1_1 on t_gin_1  (cost=2.00..4.01 rows=1 width=36)
   Index Cond: (ts @@ '''hello'' <-> ''digoal'''::tsquery)
(2 rows)

postgres=# explain (analyze,verbose,timing,costs,buffers) select * from t_gin_1 where ts @@ to_tsquery('english', 'hello <1> digoal');
                                                          QUERY PLAN                                                           
-------------------------------------------------------------------------------------------------------------------------------
 Index Scan using idx_t_gin_1_1 on public.t_gin_1  (cost=2.00..4.01 rows=1 width=36) (actual time=0.049..0.049 rows=1 loops=1)
   Output: id, ts
   Index Cond: (t_gin_1.ts @@ '''hello'' <-> ''digoal'''::tsquery)
   Buffers: shared hit=3
 Planning time: 0.288 ms
 Execution time: 0.102 ms
(6 rows)

八、bloom

原理

bloom索引接口是PostgreSQL基於bloom filter構造的一個索引接口，屬於lossy索引，可以收斂結果集(排除絕對不滿足條件的結果，剩餘的結果裏再挑選滿足條件的結果)，因此需要二次check，bloom支持任意列組合的等值查詢。

bloom存儲的是簽名，簽名越大，耗費的空間越多，但是排除更加精準。有利有弊。

CREATE INDEX bloomidx ON tbloom USING bloom (i1,i2,i3)
       WITH (length=80, col1=2, col2=2, col3=4);

簽名長度 80 bit, 最大允許4096 bits
col1 - col32，分別指定每列的bits，默認長度2，最大允許4095 bits.

bloom provides an index access method based on Bloom filters.

A Bloom filter is a space-efficient data structure that is used to test whether an element is a member of a set. In the case of an index access method, it allows fast exclusion of non-matching tuples via signatures whose size is determined at index creation.

This type of index is most useful when a table has many attributes and queries test arbitrary combinations of them.

應用場景

bloom索引適合多列任意組合查詢。

《PostgreSQL 9.6 黑科技 bloom 算法索引，一個索引支撐任意列組合查詢》

例子

=# CREATE TABLE tbloom AS  
   SELECT  
     (random() * 1000000)::int as i1,  
     (random() * 1000000)::int as i2,  
     (random() * 1000000)::int as i3,  
     (random() * 1000000)::int as i4,  
     (random() * 1000000)::int as i5,  
     (random() * 1000000)::int as i6  
   FROM  
  generate_series(1,10000000);  
SELECT 10000000  
=# CREATE INDEX bloomidx ON tbloom USING bloom (i1, i2, i3, i4, i5, i6);  
CREATE INDEX  
=# SELECT pg_size_pretty(pg_relation_size('bloomidx'));  
 pg_size_pretty  
----------------  
 153 MB  
(1 row)  
=# CREATE index btreeidx ON tbloom (i1, i2, i3, i4, i5, i6);  
CREATE INDEX  
=# SELECT pg_size_pretty(pg_relation_size('btreeidx'));  
 pg_size_pretty  
----------------  
 387 MB  
(1 row)  
  
=# EXPLAIN ANALYZE SELECT * FROM tbloom WHERE i2 = 898732 AND i5 = 123451;  
                                                        QUERY PLAN  
---------------------------------------------------------------------------------------------------------------------------  
 Bitmap Heap Scan on tbloom  (cost=178435.39..178439.41 rows=1 width=24) (actual time=76.698..76.698 rows=0 loops=1)  
   Recheck Cond: ((i2 = 898732) AND (i5 = 123451))  
   Rows Removed by Index Recheck: 2439  
   Heap Blocks: exact=2408  
   ->  Bitmap Index Scan on bloomidx  (cost=0.00..178435.39 rows=1 width=0) (actual time=72.455..72.455 rows=2439 loops=1)  
         Index Cond: ((i2 = 898732) AND (i5 = 123451))  
 Planning time: 0.475 ms  
 Execution time: 76.778 ms  
(8 rows)

九、zombodb

原理

zombodb是PostgreSQL與ElasticSearch結合的一個索引接口，可以直接讀寫ES。

https://github.com/zombodb/zombodb

應用場景

與ES結合，實現SQL接口的搜索引擎，實現數據的透明搜索。

例子

-- Install the extension:  
  
CREATE EXTENSION zombodb;  
  
-- Create a table:  
  
CREATE TABLE products (  
    id SERIAL8 NOT NULL PRIMARY KEY,  
    name text NOT NULL,  
    keywords varchar(64)[],  
    short_summary phrase,  
    long_description fulltext,   
    price bigint,  
    inventory_count integer,  
    discontinued boolean default false,  
    availability_date date  
);  
  
-- insert some data  
-- Index it:  
  
CREATE INDEX idx_zdb_products   
          ON products   
       USING zombodb(zdb('products', products.ctid), zdb(products))  
        WITH (url='https://localhost:9200/', shards=5, replicas=1);  
  
-- Query it:  
  
SELECT *   
  FROM products   
 WHERE zdb('products', ctid) ==> 'keywords:(sports,box) or long_description:(wooden w/5 away) and price < 100000';

十、bitmap

原理

bitmap索引是Greenplum的索引接口，類似GIN倒排，隻是bitmap的KEY是列的值，VALUE是BIT（每個BIT對應一行），而不是行號list或tree。

《Greenplum 最佳實踐 - 什麼時候選擇bitmap索引》

應用場景

當某個字段的唯一值個數在100到10萬之間(超出這個範圍，不建議使用bitmap)時，如果表的記錄數特別多，而且變更不頻繁（或者是AO表），那麼很適合BITMAP索引，bitmap索引可以實現快速的多個或單個VALUE的搜索。因為隻需要對行號的BITMAP進行BIT與或運算，得到最終的BITMAP，從最終的BITMAP映射到行進行提取。

bitmap與btree一樣，都支持等於，大於，小於，大於等於，小於等於的查詢。

例子

postgres=# create table t_bitmap(id int, info text, c1 int);
NOTICE:  Table doesn't have 'DISTRIBUTED BY' clause -- Using column named 'id' as the Greenplum Database data distribution key for this table.
HINT:  The 'DISTRIBUTED BY' clause determines the distribution of data. Make sure column(s) chosen are the optimal data distribution key to minimize skew.
CREATE TABLE
postgres=# insert into t_bitmap select generate_series(1,1000000), 'test', random()*1000;
INSERT 0 1000000
postgres=# create index idx_t_bitmap_1 on t_bitmap using bitmap(c1);
CREATE INDEX
postgres=# explain analyze select * from t_bitmap where c1=1;
                                       QUERY PLAN                                       
----------------------------------------------------------------------------------------
 Gather Motion 3:1  (slice1; segments: 3)  (cost=0.00..200.27 rows=1 width=13)
   Rows out:  0 rows at destination with 3.769 ms to end, start offset by 0.250 ms.
   ->  Index Scan using idx_t_bitmap_1 on t_bitmap  (cost=0.00..200.27 rows=1 width=13)
         Index Cond: c1 = 1
         Rows out:  0 rows (seg0) with 0.091 ms to end, start offset by 3.004 ms.
 Slice statistics:
   (slice0)    Executor memory: 115K bytes.
   (slice1)    Executor memory: 273K bytes avg x 3 workers, 273K bytes max (seg0).
 Statement statistics:
   Memory used: 128000K bytes
 Total runtime: 4.110 ms
(11 rows)

postgres=# explain analyze select * from t_bitmap where c1<=10;
                                       QUERY PLAN                                       
----------------------------------------------------------------------------------------
 Gather Motion 3:1  (slice1; segments: 3)  (cost=0.00..200.27 rows=1 width=13)
   Rows out:  0 rows at destination with 2.952 ms to end, start offset by 0.227 ms.
   ->  Index Scan using idx_t_bitmap_1 on t_bitmap  (cost=0.00..200.27 rows=1 width=13)
         Index Cond: c1 <= 10
         Rows out:  0 rows (seg0) with 0.055 ms to end, start offset by 3.021 ms.
 Slice statistics:
   (slice0)    Executor memory: 115K bytes.
   (slice1)    Executor memory: 273K bytes avg x 3 workers, 273K bytes max (seg0).
 Statement statistics:
   Memory used: 128000K bytes
 Total runtime: 3.278 ms
(11 rows)

十一、varbitx

原理

varbitx是阿裏雲RDS的擴展包，豐富bit類型的函數接口，實際上並不是索引接口，但是在PostgreSQL中使用varbitx可以代替bitmap索引，達到同樣的效果。

應用場景

《阿裏雲RDS for PostgreSQL varbitx插件與實時畫像應用場景介紹》

《基於阿裏雲 RDS PostgreSQL 打造實時用戶畫像推薦係統》

《PostgreSQL (varbit, roaring bitmap) VS pilosa(bitmap庫)》

例子

略，請參考以上文章。

十二、部分索引

PostgreSQL允許用戶創建部分索引，例如業務上隻關心激活用戶，所以可以隻對激活用戶創建索引。

例子

create table test(id int, info text, crt_time timestamp, active boolean);  
  
create index idx_test_id on test(id) where active;  
  
select * from test where active and id=?;  -- 可以使用部分索引

十三、表達式索引

表達式索引也是PostgreSQL特有的特性，例如用戶的數據需要轉換後查詢，例如某些設備上傳的地理坐標的坐標係不符合國標，需要轉換為國內的空間坐標來查詢。

那麼可以針對這類字段，創建表達式索引，將轉換過程放到表達式中，查詢時也使用表達式進行查詢。

create table t_express (id int, pos geometry);  
  
create index idx_t_express_1 on t_express using gist ( ( ST_Transform(pos, 26986) ) );  
  
select * from t_express order by ST_Transform(pos, 26986) <-> ST_Transform(ST_GeomFromText('POINT(108.50000000001 22.8)', 4326), 26986) limit 10;

十四、內部窺視索引存儲

https://www.postgresql.org/docs/9.6/static/pageinspect.html

通過pageinspect插件，可以讀取索引頁的內容。

例子

test=# SELECT * FROM bt_page_stats('pg_cast_oid_index', 1);  
-[ RECORD 1 ]-+-----  
blkno         | 1  
type          | l  
live_items    | 256  
dead_items    | 0  
avg_item_size | 12  
page_size     | 8192  
free_size     | 4056  
btpo_prev     | 0  
btpo_next     | 0  
btpo          | 0  
btpo_flags    | 3  
  
test=# SELECT * FROM bt_page_items('pg_cast_oid_index', 1);  
 itemoffset |  ctid   | itemlen | nulls | vars |    data  
------------+---------+---------+-------+------+-------------  
          1 | (0,1)   |      12 | f     | f    | 23 27 00 00  
          2 | (0,2)   |      12 | f     | f    | 24 27 00 00  
          3 | (0,3)   |      12 | f     | f    | 25 27 00 00  
          4 | (0,4)   |      12 | f     | f    | 26 27 00 00  
          5 | (0,5)   |      12 | f     | f    | 27 27 00 00  
          6 | (0,6)   |      12 | f     | f    | 28 27 00 00  
          7 | (0,7)   |      12 | f     | f    | 29 27 00 00  
          8 | (0,8)   |      12 | f     | f    | 2a 27 00 00  
  
test=# SELECT * FROM brin_page_items(get_raw_page('brinidx', 5),  
                                     'brinidx')  
       ORDER BY blknum, attnum LIMIT 6;  
 itemoffset | blknum | attnum | allnulls | hasnulls | placeholder |    value       
------------+--------+--------+----------+----------+-------------+--------------  
        137 |      0 |      1 | t        | f        | f           |   
        137 |      0 |      2 | f        | f        | f           | {1 .. 88}  
        138 |      4 |      1 | t        | f        | f           |   
        138 |      4 |      2 | f        | f        | f           | {89 .. 176}  
        139 |      8 |      1 | t        | f        | f           |   
        139 |      8 |      2 | f        | f        | f           | {177 .. 264}

參考

最後更新：2017-06-28 11:32:08

PostgreSQL 9種索引的原理和應用場景

標簽

背景

一、btree

原理

應用場景

例子

二、hash

原理

應用場景

例子

三、gin

原理

應用場景

例子

四、gist

原理

應用場景

例子

五、sp-gist

原理

應用場景

例子

六、brin

原理

應用場景

例子

七、rum

原理

應用場景

例子

八、bloom

原理

應用場景

例子

九、zombodb

原理

應用場景

例子

十、bitmap

原理

應用場景

例子

十一、varbitx

原理

應用場景

例子

十二、部分索引

十三、表達式索引

十四、內部窺視索引存儲

參考

上一篇： MongoDB World 2017 參會全記錄

下一篇： PostgreSQL\HybridDB for PG 毫秒級多維數據透視 案例分享

相關內容

熱門內容

最新內容

下一篇： PostgreSQL\HybridDB for PG 毫秒級多維數據透視案例分享