I'm working on a project that must support two database engines; SQL Server and PostgreSQL.
We are using NHibernate as the ORM.
We are running into performance issues with certain queries. Using SQL Server tools we've come up with several new indexes and statistics that greatly improve performance on SQL Server. However, I'm not certain how to implement the same indexes and statistics on PostgreSQL.
Two examples are:
CREATE STATISTICS [perfStat_Answer_02] ON [dbo].[Answer]
([InclusionExpressionGroupId], [QuestionId], [AnswerId])
CREATE NONCLUSTERED INDEX [perf_Answer_01] ON [dbo].[Answer]
(
[QuestionId] ASC
)
INCLUDE (
[AnswerId],
[InclusionExpressionGroupId],
[AnswerConceptId],
[Revision],
[AnswerText],
[AnswerOrder]
)
WITH (
SORT_IN_TEMPDB = OFF
, IGNORE_DUP_KEY = OFF
, DROP_EXISTING = OFF
, ONLINE = OFF)
ON [PRIMARY]
What is the syntax for the INCLUDE
d fields in PostgreSQL, if such a feature exists?
How do we add statistics?
Reading the PostgreSQL docs, I'm not convinced that either are supported. However, I would like to know if there is any way to accomplish something similar.
Best Answer
I don't know what
CREATE STATISTICS
does, but statistics for the optimizer are collected using theANALZYE
command whenautovacuum
is running - which is turned on by default.Statistics are always collected for all columns, no need to turn it on specifically.
You can control the level of details collected for the statistics on a per-column basis using
ALTER TABLE ... ALTER COLUMN column SET STATISTICS integer
.An index in PostgreSQL is always non-clustered, so I'd assume that the above index maps to a regular index on the
QuestionId
column.Not sure about the
INCLUDE
part. I assume this is to support index only retrievals if that index is chosen by the optimizer. As PostgreSQL does not yet have an index-only retrieval, there is no equivalent technique there.