Add benchmark results

ololobus · ololobus · commit d38aa2ad816e · 2020-05-14T01:00:05.000+03:00
diff --git a/README.md b/README.md
@@ -49,7 +49,9 @@ CREATE EXTENSION ptrack;
 
 ## Configuration
 
-The only one configurable option is `ptrack.map_size` (in MB). Default is `-1`, which means `ptrack` is turned off. To completely avoid false positives it is recommended to set `ptrack.map_size` to `1 / 1000` of expected `PGDATA` size (i.e. `1000` for a 1 TB database), since a single 8 byte `ptrack` map record tracks changes in a standard 8 KB PostgreSQL page. To disable `ptrack` and clean up all remaining service files set `ptrack.map_size` to `0`.
+The only one configurable option is `ptrack.map_size` (in MB). Default is `-1`, which means `ptrack` is turned off. To completely avoid false positives it is recommended to set `ptrack.map_size` to `1 / 1000` of expected `PGDATA` size (i.e. `1000` for a 1 TB database), since a single 8 byte `ptrack` map record tracks changes in a standard 8 KB PostgreSQL page.
+
+To disable `ptrack` and clean up all remaining service files set `ptrack.map_size` to `0`.
 
 ## Public SQL API
 
@@ -87,10 +89,14 @@ postgres=# SELECT ptrack_get_pagemapset('0/186F4C8');
 
 2. The only one production-ready backup utility, that fully supports `ptrack` is [pg_probackup](https://github.com/postgrespro/pg_probackup).
 
-3. Currently, you cannot resize `ptrack` map in runtime, only on postmaster restart. Also, you will loose all tracked changes, so it is recommended to do so in the maintainance window and accompany this operation with full backup. See [TODO](#TODO) for details.
+3. Currently, you cannot resize `ptrack` map in runtime, only on postmaster start. Also, you will loose all tracked changes, so it is recommended to do so in the maintainance window and accompany this operation with full backup. See [TODO](#TODO) for details.
 
 4. You will need up to `ptrack.map_size * 3` of additional disk space, since `ptrack` uses two additional temporary files for durability purpose. See [TODO](#Architecture) for details.
 
+## Benchmarks
+
+Briefly, an overhead of using `ptrack` on TPS usually does not exceed a couple of percent (~1-3%) for a database of dozens to hundreds of gigabytes in size, while the backup time scales down linearly with backup size with a coefficient ~1. It means that an incremental `ptrack` backup of a database with only 20% of changed pages will be 5 times faster than a full backup. More details [here](benchmarks).
+
 ## Architecture
 
 We use a single shared hash table in `ptrack`, which is mapped in memory from the file on disk using `mmap`. Due to the fixed size of the map there may be false positives (when some block is marked as changed without being actually modified), but not false negative results. However, these false postives may be completely eliminated by setting a high enough `ptrack.map_size`.
diff --git a/benchmarks/README.md b/benchmarks/README.md
@@ -0,0 +1,97 @@
+# Ptrack benchmarks
+
+## Runtime overhead
+
+First target was to measure `ptrack` overhead on TPS due to marking modified pages in the map in memory. We used PostgreSQL 12 cluster of approximately 1 GB size, initialized with `pgbench` on a `tmpfs` partition:
+
+```sh
+pgbench -i -s 133
+```
+
+Default `pgbench` transaction script [were modified](pgb.sql) to exclude `pgbench_tellers` and `pgbench_branches` updates in order to lower lock contention and make `ptrack` overhead more visible. So `pgbench` was invoked as following:
+
+```sh
+pgbench -s133 -c40 -j1 -n -P15 -T300 -f pgb.sql
+```
+
+Results:
+
+| ptrack.map_size, MB | 0 (turned off) | 32 | 64 | 256 | 512 | 1024 |
+|---------------------|----------------|----|----|-----|-----|------|
+| TPS | 16900 | 16890 | 16855 | 16468 | 16490 | 16220 |
+
+TPS fluctuates in a several percent range around 16500 on the used machine, but in average `ptrack` overhead does not exceed 1-3% for any reasonable `ptrack.map_size`. It only becomes noticeable closer to 1 GB `ptrack.map_size` (~3-4%), which is enough to track changes in the database of up to 1 TB size without false positives.
+
+
+<!-- ## Checkpoint overhead
+
+Since `ptrack` map is completely flushed to disk during checkpoints, the same test were performed on HDD, but with slightly different configuration:
+```conf
+synchronous_commit = off
+shared_buffers = 1GB
+```
+and `pg_prewarm` run prior to the test. -->
+
+## Backups speedup
+
+To test incremental backups speed a fresh cluster were initialized with following DDL:
+
+```sql
+CREATE TABLE large_test (num1 bigint, num2 double precision, num3 double precision);
+CREATE TABLE large_test2 (num1 bigint, num2 double precision, num3 double precision);
+```
+
+These relations were populated with approximately 2 GB of data that way:
+
+```sql
+INSERT INTO large_test (num1, num2, num3)
+SELECT s, random(), random()*142
+FROM generate_series(1, 20000000) s;
+```
+
+Then a part of one relation was touched with a following query:
+
+```sql
+UPDATE large_test2 SET num3 = num3 + 1 WHERE num1 < 20000000 / 5;
+```
+
+After that, incremental `ptrack` backups were taken with `pg_probackup` followed by full backups. Tests show that `ptrack_backup_time / full_backup_time ~= ptrack_backup_size / full_backup_size`, i.e. if the only 20% of data were modified, then `ptrack` backup will be 5 times faster than full backup. Thus, the overhead of building `ptrack` map during backup is minimal. Example:
+
+```log
+21:02:43 postgres:~/dev/ptrack_test$ time pg_probackup backup -B $(pwd)/backup --instance=node -p5432 -b ptrack --no-sync --stream
+INFO: Backup start, pg_probackup version: 2.3.1, instance: node, backup ID: QAA89O, backup mode: PTRACK, wal mode: STREAM, remote: false, compress-algorithm: none, compress-level: 1
+INFO: Parent backup: QAA7FL
+INFO: PGDATA size: 2619MB
+INFO: Extracting pagemap of changed blocks
+INFO: Pagemap successfully extracted, time elapsed: 0 sec
+INFO: Start transferring data files
+INFO: Data files are transferred, time elapsed: 3s
+INFO: wait for pg_stop_backup()
+INFO: pg_stop backup() successfully executed
+WARNING: Backup files are not synced to disk
+INFO: Validating backup QAA89O
+INFO: Backup QAA89O data files are valid
+INFO: Backup QAA89O resident size: 632MB
+INFO: Backup QAA89O completed
+
+real	0m11.574s
+user	0m1.924s
+sys	0m1.100s
+
+21:20:23 postgres:~/dev/ptrack_test$ time pg_probackup backup -B $(pwd)/backup --instance=node -p5432 -b full --no-sync --stream
+INFO: Backup start, pg_probackup version: 2.3.1, instance: node, backup ID: QAA8A6, backup mode: FULL, wal mode: STREAM, remote: false, compress-algorithm: none, compress-level: 1
+INFO: PGDATA size: 2619MB
+INFO: Start transferring data files
+INFO: Data files are transferred, time elapsed: 32s
+INFO: wait for pg_stop_backup()
+INFO: pg_stop backup() successfully executed
+WARNING: Backup files are not synced to disk
+INFO: Validating backup QAA8A6
+INFO: Backup QAA8A6 data files are valid
+INFO: Backup QAA8A6 resident size: 2653MB
+INFO: Backup QAA8A6 completed
+
+real	0m42.629s
+user	0m8.904s
+sys	0m11.960s
+```
diff --git a/benchmarks/pgb.sql b/benchmarks/pgb.sql
@@ -0,0 +1,9 @@
+\set aid random(1, 100000 * :scale)
+\set bid random(1, 1 * :scale)
+\set tid random(1, 10 * :scale)
+\set delta random(-5000, 5000)
+BEGIN;
+UPDATE pgbench_accounts SET abalance = abalance + :delta WHERE aid = :aid;
+SELECT abalance FROM pgbench_accounts WHERE aid = :aid;
+INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (:tid, :bid, :aid, :delta, CURRENT_TIMESTAMP);
+END;