Sql-server – How to manipulate the query plan so that I can have more control on memory grants

execution-planoptimizationplan-guidessql server 2014statistics

Let's say I have a complex query and for this query there are huge
Differences between the estimated and actual execution plans.

These differences are causing spillage into tempDb as described here.

The problem is that in this query there are tables that are too big for me to update the statistics.

How can I manipulate the query plan so that I can have more control on memory grants?

I want to avoid memory spillage to tempdb.

In order for this question to be objective and not too broad, we could concentrate for instance on the hash join operator, when can I replace it by a merge join operator?

Are there other operators with memory grant that could be replaced depending on the work load?

Understanding SQL server memory grant

-- Search cache for queries with memory grants:
SELECT t.text, cp.objtype,qp.query_plan
FROM sys.dm_exec_cached_plans AS cp
JOIN sys.dm_exec_query_stats AS qs ON cp.plan_handle = qs.plan_handle
CROSS APPLY sys.dm_exec_query_plan(cp.plan_handle) AS qp
CROSS APPLY sys.dm_exec_sql_text(qs.sql_handle) AS t
WHERE qp.query_plan.exist('declare namespace n=''http://schemas.microsoft.com/sqlserver/2004/07/showplan''; //n:MemoryFractions') = 1

Best Answer

I suggest thinking about your problem in a different way. It's not true that there's some inherit advantage to a hash join that makes the SQL Server query optimizer want to pick it (except in batch mode with CCIs). The query optimizer chose a hash join because it thought it was a better option than a merge or nested loop join. Nested loop joins are a good choice when the outer table is small and the inner table is indexed. Merge joins are a good choice when the inputs are sorted and you need to access a non-small percentage of rows from the tables. Hash joins are a good choice when you don't have a better choice than scanning both tables. In most cases a hash join when it is not appropriate indicates missing indexes.

If you have a legitimate need to encourage a merge join there are a few things that you can do. If it is a 1:1 join define constraints so that SQL Server knows that. The query optimizer costs many-to-many merge joins much higher than 1:1 joins. Create indexes so that your data is presorted. If you cannot do that then make sure the sorts are in a parallel zone of the plan. Sorts benefit greatly from parallelism. Note that a merge join can lead to tempdb usage if there are large sorts or if the merge join is many-to-many. There may be other scenarios as well.

Nested loop joins aren't necessary a bad choice for a large, complicated plan. If you have a legitimate need to encourage a nested loop join make sure you have indexes that enable that. Your query may benefit from a parallel apply type of plan. Those plans can use less memory than a plan with large parallel hash joins. You should be aware that in some cases nested loop joins will do a sort on the outer table before the join operator to improve performance.

On the subject of memory, you could make sure that your server has an appropriate amount of memory and that other queries on the server aren't using too much memory. Otherwise, trying to pick query plan operators on the basis on memory seems kind of backwards. You should really fix your statistics. Use table partitioning, a partitioned view, a smaller sample size, filtered indexes, etc. You're playing a game that's very difficult to win if you don't fix your cardinality estimates.

Related Solutions

Sql-server – How bad is query plan pollution (too many equivalent plans in cache)

"How bad is it?" depends on the degree to which you are suffering now or could suffer with increased workload in the future.

One major point of suffering with plan cache pollution could be too many single use plans bloating your plan cache leading to inefficient cache usage.

Another point of suffering could be high compilations/second - so in an environment with a heavy workload and a lot of activity, there is a cost associated with compiling over and over.

You can see the impact of compilations/sec in perfmon (SQL Server Statistics:Compilations/sec). This can look like CPU pressure. To your performance/applications, this can look like increased query duration waiting for needless compiles each time it runs.

You can see the impact to the plan cache from the memory bloat by this query borrowed from Glenn Berry's Diagnostic scripts. How big is your SQLCP plan cache?

SELECT TOP(10) [type] AS [Memory Clerk Type], 
       SUM(pages_kb)/1024 AS [Memory Usage (MB)] 
FROM sys.dm_os_memory_clerks WITH (NOLOCK)
GROUP BY [type]  
ORDER BY SUM(pages_kb) DESC OPTION (RECOMPILE);

Also the query that was used in the question to identify the number of plans helps as well.

Is This Ever a Good Thing?

There are some cases where this could be good, but the situation is rare. Basically if you were suffering from parameter sniffing gone bad (nutshell: if the data can vary widely from execution to execution based on parameters, one compilation for one set of parameters ideal may yield an excellent query plan for that one query but poor for others.). My guess is that you likely wouldn't be dealing with that as bad as the implications from poor plan reuse.

What Can You Do About It?

Optimize For Ad Hoc Workloads can certainly help with the memory implications since only a stub of the plan is stored in cache at first execution, and the full plan isn't stored until it is executed a second time with the same plan.
Forced Parameterization could help here also. It can sometimes force parameterization to happen and help solve both the issue of cache bloat and the cost of having to recompile.
Fix The Queries Ideally, you shouldn't have to resort to these options, but instead can be more strict in your database development, encourage plan reuse, consider stored procedures for all of their benefits, and attempt to head off the problem that way. The ways to help fix this through forced parameterization or optimize for ad hoc are good to help, but the best solution is always aimed at the root cause.

There is an excellent resource here that talks about some of the dangers of plan cache pollution and some things you can do. I'd recommend a read here. It is written for SQL Server 2012, but the concepts and solutions apply.

Sql-server – Multicolumn statistics on large table yielding poor estimations

With a table that large I would consider partitioning. Unfortunately I can't answer your specific questions (1-3) but in general one of the benefits of using partitioned views (instead of native partitioning) is that the individual tables within the partitioned view are considered separate objects and have their own statistics each with 200 steps. Here is a post where Kimberly Tripp of SQLSkills recommends for large tables that you consider not only partitioning or partitioned views but combining the two.

In case you don't know a partitioned view is one where you have multiple tables each holding a portion of the data and a view on top with UNION ALLs to combine the tables together.

Here is another of Kimberly's blogs on statistics in case you are interested. It should help you answer some of your other questions.

Here's an article of Connor Cunningham regarding statistics: Statistics, Damned Lies, and Statistics – What is Statman?

Best Answer

Related Solutions

Sql-server – How bad is query plan pollution (too many equivalent plans in cache)

Sql-server – Multicolumn statistics on large table yielding poor estimations

Related Question