postgresql下如何处理大表（过亿记录）

数据库中的某表在源源不断的更新，导致此表记录不断增大，当增大到一定程度后系统响应速度就会出现明显延迟。这种情况该如何处理呢？

最直观的方法，就是分表，一个表容量过大，那就其分为多个表。但这样会导致另外一个问题，即系统的前台和后台逻辑就会发生重大变化，需要对系统进行彻底的返工，工作量太大而且以后的扩展性还受到很大的限制。

但postgresql还提供了另外一种更好的方法：分区（partitioning)。使用分区方法可以避免分表方案的缺点。

在postgresql的文档中提到，分区的优点：

Query performance can be improved dramatically in certain situations, particularly when most of the heavily accessed rows of the table are in a single partition or a small number of partitions. The partitioning substitutes for leading columns of indexes, reducing index size and making it more likely that the heavily-used parts of the indexes fit in memory.
When queries or updates access a large percentage of a single partition, performance can be improved by taking advantage of sequential scan of that partition instead of using an index and random access reads scattered across the whole table.
Bulk loads and deletes can be accomplished by adding or removing partitions, if that requirement is planned into the partitioning design. ALTER TABLE NO INHERIT and DROP TABLE are both far faster than a bulk operation. These commands also entirely avoid the VACUUM overhead caused by a bulk DELETE.
Seldom-used data can be migrated to cheaper and slower storage media.

总而言之，分区的优点是在不破坏原有前段代码的结构的基础上，大幅提升数据库端的处理性能。

那如何创建分区呢？简单的讲，以下几个步骤：

评论