postgresql/src/backend/lib
Jeff Davis fd734f387d Use pg_bitutils for HyperLogLog.
Using pg_leftmost_one_post32() yields substantial performance benefits.

Backpatching to version 13 because HLL is used for HashAgg
improvements in 9878b643, which was also backpatched to 13.

Reviewed-by: Peter Geoghegan
Discussion: https://postgr.es/m/CAH2-WzkGvDKVDo+0YvfvZ+1CE=iCi88DCOGFF3i1hTGGaxcKPw@mail.gmail.com
Backpatch-through: 13
2020-07-30 09:14:23 -07:00
..
Makefile Make StringInfo available to frontend code. 2019-11-05 14:56:40 -08:00
README Add IntegerSet, to hold large sets of 64-bit ints efficiently. 2019-03-22 13:21:45 +02:00
binaryheap.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
bipartite_match.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
bloomfilter.c Move src/backend/utils/hash/hashfn.c to src/common 2020-02-27 09:25:41 +05:30
dshash.c Move src/backend/utils/hash/hashfn.c to src/common 2020-02-27 09:25:41 +05:30
hyperloglog.c Use pg_bitutils for HyperLogLog. 2020-07-30 09:14:23 -07:00
ilist.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
integerset.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
knapsack.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
pairingheap.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
rbtree.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00

README

This directory contains a general purpose data structures, for use anywhere
in the backend:

binaryheap.c - a binary heap

bipartite_match.c - Hopcroft-Karp maximum cardinality algorithm for bipartite graphs

bloomfilter.c - probabilistic, space-efficient set membership testing

dshash.c - concurrent hash tables backed by dynamic shared memory areas

hyperloglog.c - a streaming cardinality estimator

ilist.c - single and double-linked lists

integerset.c - a data structure for holding large set of integers

knapsack.c - knapsack problem solver

pairingheap.c - a pairing heap

rbtree.c - a red-black tree

stringinfo.c - an extensible string type


Aside from the inherent characteristics of the data structures, there are a
few practical differences between the binary heap and the pairing heap. The
binary heap is fully allocated at creation, and cannot be expanded beyond the
allocated size. The pairing heap on the other hand has no inherent maximum
size, but the caller needs to allocate each element being stored in the heap,
while the binary heap works with plain Datums or pointers.

The linked-lists in ilist.c can be embedded directly into other structs, as
opposed to the List interface in nodes/pg_list.h.