Tweet

L_ N___

17 Dec, 26 tweets, 6 min read

484 #sqlserver batch mode on rowstore queries. DOP 8. run consecutively by a single session. each query is a fullscan of a different single table, grouping by a CASE statement on a non-indexed INT/BIGINT/NUMERIC data type column. producing one or two result rows of 1 column each.

based on observation of the first experiment with these queries, in which the queries were ordered alphabetically by the name of the only table in the FROM, i reordered the queries based on decreasing pages:rows ratio. that gives the evident decreasing pages/sec trend :-)

a closeup of the first 10 minutes shows the considerable variation even though the downward trend is evident.

https://twitter.com/sqL_handLe/status/1471841541668978693?s=20

those are some sharp peaks and valleys.
but for the 5 second capture interval (i usually use 30 seconds) i'd have missed it.

https://twitter.com/sqL_handLe/status/1471841541668978693?s=20

Yesterday i had a solid theory about the cause for the variation. No kidding; when i woke up today i couldn't remember it 😂😂😂

i remembered my theory about 10 minutes after remembering i had a theory 😂🤣🙃

i think it's htbuild/htdelete waits.

if there's no corrrelation between CPU utilization and page reads/sec i have to let that theory go.

if there *is* a correlation... still doesn't mean i'm right.

do you see what i see?

LoL

my theory lives on for now.

this is a *super* important graph and i'll hafta try to explain why later.

i just had a super-scaling idea that needs to stay top-secret for now. maybe i'll send myself a dm so i remember it.

talk a little about this graph with pages/sec in blue area and cpu utilization as the red dotted line.
48 vcpu system. so, for the DOP 8 BMoR 8/48 = 1/6 ~ 16.6% is the max cpu utilization.

what about execution context ID 0 for the DOP 8 queries? doesn't that make max cpu utilization higher than 8/48 = 16.6%, since there are 9 worker threads for the dop 8 queries?

Nope. Placement of the execution context ID 0 thread relative to the parallel workers isn't pre-determined on this 48 vcpu system. it might be on one of the 8 schedulers that also have a parallel worker for the query.
in which case it doesn't matter *what* ecid 0 does; 16.6% max.

*if* ecid 0 is on a different scheduler than any of the 8 parallel workers in a DOP 8 query with a single zone (this will commonly be the case on a 48 vcpu vm), then the questions are:
1) what does ecid 0 do
2) *when* does ecid 0 do it relative to work of the 8 parallel workers

what does ecid 0 do?
in these queries, ecid 0 compiles the plan, sends results to the client, and takes care of housekeeping to go from one table/query to the next.

compiling the plan takes place when there *are* no parallel workers for that ecid 0. so does the housekeeping to go from one table/query to the next.

there are only up to 2 result rows from this aggregation query - a single BIGINT column in each row.
so sending results to the client - which happens at the end of the parallel processing - is a minimal task.

so... 16.6% CPU utilization is the maximum CPU consumption for these DOP 8 queries on this 48 vcpu vm.

a shortcut to showing that the valleys in pages/sec is largely due to waits (rather than a varying pace of work per % CPU utilized) is to "normalize" or "embiggen" the pages/sec number to its value if CPU were at 16.6% at the time.

dang it.
whaddya say now, smartie pants?
it was a solid idea. And it smooths the curve a bit.
i was hoping for more.

you know. something amazing, i guess.

this means there's at least one other contributor to the curve, other than waits.

maybe 484 plan compiles and housekeeping at irregular points along the way in ~37 minutes was a bigger confounding variable than i expected.

power runs at DOP 12 and DOP 4, with graphs adjusting page reads/sec to 25% CPU utilized and 8.3%, compared to what i have at DOP 8, may indicate how much the DOP 1 "nonquery processing" work confounds the results.

as a minor consolation to myself and sign of the progress i've made understanding what's happening so far, here is #sqlserver page reads/s vs cpu utilation.
the top graph is the same 484 queries, in alpabet order of their table names.
bottom graph, sorted by pages:rows ratio.

there's something happening here
what it is ain't exactly clear...

if i were to run two sessions at a time with DOP 8 queries, with the goal of getting through this list of 484 queries as fast as possible, this is how I'd do it.

one session starts at the high end of the pages:rows ratio, the other session starts at the low end; meet in middle.

*at least one other contributor to /variance from/ the curve, other than waits.
the sole contributor to the curve is pages:rows ratio, since the plan for every query is the same.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @sqL_handLe

L_ N___

@sqL_handLe

17 Dec

just like that, i'm probably done thinking about scalabilty strategies for #sqlserver, and back to investigating latch timeouts on very busy systems.

here's the thing. the number of mdmp files for a given large-memory system can be pretty small, especially if an Availability Group is involved.

while a dump is in progress, #sqlserver memory must remain in a constant consistant state. the more memory, the longer the dump may take (there are other un-enumerated factors).

but, Availability Group lease timeout can only be set as high as 100 seconds currently.

Read 5 tweets

L_ N___

@sqL_handLe

17 Dec

wow
~~
Oracle in Talks to Buy Cerner
An agreement, which could potentially be worth $30 billion, would rank as biggest ever for software giant
2021 December 16
wsj.com/articles/oracl…

i don't know i've ever heard a story about Oracle in talks to buy a company with an ending other than Oracle *buying* that company.

oh. i forgot about the TikTok debacle, mentioned by Financial TImes here.
so that's *one* story about Oracle not buying.
~~
Oracle nears deal to buy health IT company Cerner for $30bn
2021 December 16
ft.com/content/9bf806…

Read 13 tweets

L_ N___

@sqL_handLe

17 Dec

@nocentino

such a good blog post from @nocentino ...
~~
Understanding #SQL Server IO Size
2021 December 10
nocentino.com/posts/2021-12-…

i'll nitpick two details, though :-)
the 512k upper bound for disk IO size isn't properly a #sqlserver limit, although it *is* the limit on most systems. it's the upper bound of the disk adapter.

some disk adapters - most fibre channel HBAs and some others, too - allow configuration of the maximum disk transfer size, and can be set higher than 512kb.

Read 23 tweets

L_ N___

@sqL_handLe

16 Dec

#AIX #IBMPower
Earl Jew
Part 1
CPU Threading Efficiency: How to Improve L2/L3 Cache Hits
2017 February 15
techchannel.com/SMB/2/2017/aix…

#AIX #IBMPower
Earl Jew
Part 2
Recognizing the Efficiency Benefits of CPU Threading
2017 March 15
techchannel.com/SMB/3/2017/aix…

#AIX #IBMPower
Earl Jew
Part 3
CPU Threading Efficiency: Tactical AIX Monitoring of the Runqueue Value
2017 May 10
techchannel.com/SMB/5/2017/aix…

Read 6 tweets

L_ N___

@sqL_handLe

16 Dec

#AIX #IBMPower
there is just no-one else like Earl Jew
~~
CPU Threading Efficiency: The Processor Consumed Value
2017 September 13
techchannel.com/SMB/9/2017/aix…

indubitably
~~
"To conduct tactical monitoring, we must also consider the complexities of virtualization, consolidation and concurrency alongside the activities of multiple CPU cores."

"... for capacity planning, this assumption is mostly harmless.

However, this assumption is reckless for tactical monitoring..."

Read 10 tweets

L_ N___

@sqL_handLe

16 Dec

Startling?
~~
We invited an AI to debate its own ethics in the Oxford Union – what it said was startling
2021 December 10
theconversation.com/we-invited-an-…

Startlingly... ridiculous!

"The good news is that you don’t have to build your own AI team. You can outsource your AI work to experts in the field, which can help you make the most of technology and ensure that you don’t fall victim to the same AI problems as your competitors."

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

L_ N___

Try unrolling a thread yourself!

More from @sqL_handLe

L_ N___

L_ N___

L_ N___

L_ N___

L_ N___

L_ N___

Did Thread Reader help you today?

Like this author's thread?