Duplicates in sys.query_store_wait_stats

December 18, 2017 Training

For machine learning, or in general any data analysis task, we need data. That’s not enough because data has to be in the right shape and data wrangling is usually tedious/time consuming/character shaping experience. QueryStore is one of the most important tools in context of Performance Intelligence (synonym for gaining performance insights and making automated decisions) because it provides consistent and unified view on queries’ performance by forming multi-feature timeseries for every query-plan-interval:

QueryStore Metrics

What’s more it’s real-time and it doesn’t require any post-processing (except steps described below). There are some restrictions to take into account (for continuity objects shouldn’t be dropped/recreated and database renamed) but that’s subject for another post. I’ve disused and gave presentations about tools built on top of QueryStore like Performance Recommendations and Automatic Tuning, but I haven’t properly covered recent changes in QueryStore that made that shift possible. These include:

Temp DB Memory Used (KB)
Log Memory Used (KB) - as above, but relevant in Azure Sql Database
Wait Stats

sys.query_store_wait_stats

In Sql Server 2016 version of QueryStore there is plan store and runtime statistics store. Sql Server 2017 introduces 3rd structure - Wait Stats store. Wait Stats are captured at query plan level and are available through sys.query_store_wait_stats. It’s special view because from all objects in sys.querystore* only that one has to be aggregated before data can be consumed. Without GROUP BY it can return duplicates and combination of (plan_id, runtime_stats_interval_id, execution_type, wait_category) is not unique. In means that for the same plan/query that view can return maximum two rows (one represents in-memory and another on-disk stats) for particular wait_category and interval and that might be confusing. Steps to reproduce issue:

Enable QS and set INTERVAL_LENGTH_MINUTES to something long like default 60 (1h)
Run resource consuming query
Flush stats captured in QueryStore to disk
Rerun query from p.2
Examine output of sys.query_store_wait_stats

	USE [WWI]
	GO
	ALTER DATABASE CURRENT SET QUERY_STORE (OPERATION_MODE = READ_WRITE,
	QUERY_CAPTURE_MODE = ALL,
	INTERVAL_LENGTH_MINUTES = 60)
	ALTER DATABASE CURRENT SET QUERY_STORE CLEAR;
	GO
	SELECT *
	FROM [Sales].[CustomerTransactions]
	ORDER BY [TaxAmount] DESC
	OPTION(MAXDOP 1);
	EXEC sp_query_store_flush_db;
	GO
	SELECT *
	FROM [Sales].[CustomerTransactions]
	ORDER BY [TaxAmount] DESC
	OPTION(MAXDOP 1);
	GO 2
	SELECT *
	FROM sys.query_store_wait_stats

view raw QS Wait Stats duplicates hosted with ❤ by GitHub

Result set with duplicates:

Duplicates in Wait Stats

In the example above I used WideWorldImporters database and query that causes TempDB spills (order by unindexed column with forced serialised execution). That can also be tracked using new ‘Temp DB Memory Used (KB)’ metric or ‘Other Disk IO’ (that contains IO_COMPLETION). There’s more than one way to skin a cat. Finally my recommended way of getting data from sys.query_store_wait_stats:

	SELECT w.plan_id,
	w.runtime_stats_interval_id,
	w.execution_type,
	w.wait_category,
	w.wait_category_desc,
	SUM(w.total_query_wait_time_ms) total_query_wait_time_ms,
	AVG(w.avg_query_wait_time_ms) avg_query_wait_time_ms,
	MIN(w.min_query_wait_time_ms) min_query_wait_time_ms,
	MAX(w.max_query_wait_time_ms) max_query_wait_time_ms
	FROM sys.query_store_wait_stats w
	GROUP BY w.plan_id,
	w.runtime_stats_interval_id,
	w.execution_type,
	w.wait_category,
	w.wait_category_desc

view raw QS Wait Stats aggregation hosted with ❤ by GitHub

Wait categories

Also below are waits categories mapped to types of waits (based on BOL). Might be useful for demos/presentations.

	SELECT *
	FROM (VALUES (0,'Unknown','Unknown'),
	(1,'CPU','SOS_SCHEDULER_YIELD'),
	(2,'Worker Thread','THREADPOOL'),
	(3,'Lock','LCK_M_%'),
	(4,'Latch','LATCH_%'),
	(5,'Buffer Latch','PAGELATCH_%'),
	(6,'Buffer IO','PAGEIOLATCH_%'),
	(7,'Compilation*','RESOURCE_SEMAPHORE_QUERY_COMPILE'),
	(8,'SQL CLR','CLR%, SQLCLR%'),
	(9,'Mirroring','DBMIRROR%'),
	(10,'Transaction','XACT%, DTC%, TRAN_MARKLATCH_%, MSQL_XACT_%, TRANSACTION_MUTEX'),
	(11,'Idle','SLEEP_%, LAZYWRITER_SLEEP, SQLTRACE_BUFFER_FLUSH, SQLTRACE_INCREMENTAL_FLUSH_SLEEP, SQLTRACE_WAIT_ENTRIES, FT_IFTS_SCHEDULER_IDLE_WAIT, XE_DISPATCHER_WAIT, REQUEST_FOR_DEADLOCK_SEARCH, LOGMGR_QUEUE, ONDEMAND_TASK_QUEUE, CHECKPOINT_QUEUE, XE_TIMER_EVENT'),
	(12,'Preemptive','PREEMPTIVE_%'),
	(13,'Service Broker','BROKER_% (but not BROKER_RECEIVE_WAITFOR)'),
	(14,'Tran Log IO','LOGMGR, LOGBUFFER, LOGMGR_RESERVE_APPEND, LOGMGR_FLUSH, LOGMGR_PMM_LOG, CHKPT, WRITELOGF'),
	(15,'Network IO','ASYNC_NETWORK_IO, NET_WAITFOR_PACKET, PROXY_NETWORK_IO, EXTERNAL_SCRIPT_NETWORK_IOF'),
	(16,'Parallelism','CXPACKET, EXCHANGE'),
	(17,'Memory','RESOURCE_SEMAPHORE, CMEMTHREAD, CMEMPARTITIONED, EE_PMOLOCK, MEMORY_ALLOCATION_EXT, RESERVED_MEMORY_ALLOCATION_EXT, MEMORY_GRANT_UPDATE'),
	(18,'User Wait','WAITFOR, WAIT_FOR_RESULTS, BROKER_RECEIVE_WAITFOR'),
	(19,'Tracing','TRACEWRITE, SQLTRACE_LOCK, SQLTRACE_FILE_BUFFER, SQLTRACE_FILE_WRITE_IO_COMPLETION, SQLTRACE_FILE_READ_IO_COMPLETION, SQLTRACE_PENDING_BUFFER_WRITERS, SQLTRACE_SHUTDOWN, QUERY_TRACEOUT, TRACE_EVTNOTIFF'),
	(20,'Full Text Search','FT_RESTART_CRAWL, FULLTEXT GATHERER, MSSEARCH, FT_METADATA_MUTEX, FT_IFTSHC_MUTEX, FT_IFTSISM_MUTEX, FT_IFTS_RWLOCK, FT_COMPROWSET_RWLOCK, FT_MASTER_MERGE, FT_PROPERTYLIST_CACHE, FT_MASTER_MERGE_COORDINATOR, PWAIT_RESOURCE_SEMAPHORE_FT_PARALLEL_QUERY_SYNC'),
	(21,'Other Disk IO','ASYNC_IO_COMPLETION, IO_COMPLETION, BACKUPIO, WRITE_COMPLETION, IO_QUEUE_LIMIT, IO_RETRY'),
	(22,'Replication','SE_REPL_%, REPL_%, HADR_% (but not HADR_THROTTLE_LOG_RATE_GOVERNOR), PWAIT_HADR_%, REPLICA_WRITES, FCB_REPLICA_WRITE, FCB_REPLICA_READ, PWAIT_HADRSIM'),
	(23,'Log Rate Governor','LOG_RATE_GOVERNOR, POOL_LOG_RATE_GOVERNOR, HADR_THROTTLE_LOG_RATE_GOVERNOR, INSTANCE_LOG_RATE_GOVERNOR')
	) T(Id, Wait_Category, Waits)
	ORDER BY Wait_Category

view raw QS Wait Stats Categories hosted with ❤ by GitHub

Data [sic!]

Aggregator for Jan Rokicki's online interactions and articles

Duplicates in sys.query_store_wait_stats

sys.query_store_wait_stats

Wait categories