SQL Deadlock ChangeNotificationHeartBeat

Don't miss out Virtual Happy Hour this Friday (April 26).

Try our conversational search powered by Generative AI!

AI OnAI Off

Home / Forums / Developers (Main Products) Forum / CMS 7.5 - 11 /

SQL Deadlock ChangeNotificationHeartBeat

A Baillie

Vote:

We are seeing deadlocks in SQL seemingly caused by ChangeNotificationHeartBeat which in turn calls ChangeNotificationAccessConnectionWorker which is updating values in tblChangeNotificationConnection. This race condition is being triggered seemingly in a load balanced environment as both servers are triggering the ChangeNotificationHeartBeat procedure every 5 seconds and sometimes overlap.

Can someone confirm if this is a known issue and/or there is a work around or if this is caused by the misconfiguration of a load balanced environment and point me at the appropriate documentation. From re-reading http://world.episerver.com/documentation/Items/Developers-Guide/EPiServer-CMS/8/Deployment/Deployment-scenarios/Deploying-to-Windows-Servers/ I cannot see a step that would prevent this issue.

Whilst we are running a commerce setup this is posting here as my understanding is that this notification system is part of the wider CMS framework

Thanks,Andrew

#123132

Jun 25, 2015 13:34

Mads Gosvig

Vote:

Hi Andrew,

We are experiencing the exact same thing.

We also have a load balanced environment, and can reproduce it by setting the "Maximum Worker Processes" to more than 1 (simulate load balanced environment) in our IIS applicataion pool.

(We have modified our local host files, and set up an IIS site with application pool on our local developer machines)

We have not found a solution, but are talking to Episerver to figure this out.

Do you have any steps on how to reproduce this maybe with the Quicksilver project?

If anyone have experinced this it would be cool if you would reply to this so we can figure this out! :)

#164180

Oct 24, 2016 11:17

Mads Gosvig

Vote:

For anyone who wants to reproduce this issue (I also send this EPiServer and will post an update once I get an answer)

I managed to recreate the problem in Episervers Quicksilver project.

Here are the steps I did to recreate the problem:

Download the Quicksilver project https://github.com/episerver/Quicksilver
Load up the project in Visual Studio
Locate the file connectionString.config in the EPiServer.Reference.Commerce.Site project
Outcomment EPiServerAzureBlops and EPiServerAzureEvents lines (line 5 & 6)
Build the project (restoring packages requires episerver nuget feed)
Run the file SetupDatabases.cmd in the folder Setup.
Start the project in debug-mode and do the final migration step (should be automatic) (login is admin/store)

Then we must setup the project in IIS (Internet Information Services):

Open your localhost file located in C:\Windows\System32\drivers\etc\hosts
Add the line:
127.0.0.1 dev.quicksilver.com
at the end of the file.
Open IIS and create a new website.
Fill out the following:
Site name: dev.quicksilver.com
physical path: path to EPiServer.Reference.Commerce-Site folder
Host name: dev.quicksilver.com
Now go to application pools in IIS – Right click the dev.quicksilver.com pool and go to advanced settings.
Change the value of the “Maximum Worker Processes” to for example 10 (2 should be fine but the error will happen more frequently if you change it to a high value)

Now we must start multiple instances of the site:

Start your browser and go to dev.quicksilver.com and the site will start up (styles might be missing but we don’t care about that)
Start an incognito browser and browse to dev.quicksilver.com – If the site takes 5-10 seconds to start it means you successfully started a new instance in the IIS.
Close the incognito browser and repeat step 2 until you have seen at least 4-5 long loads (meaning you have around 5 instances of the IIS running)
This can also be checked in the task manager – Check the processes “IIS Worker Process” if the use “a lot of memory” it means they are running instances of the site.

Now we need to capture the deadlocks in the SQL Profiler:

Start SQL Profiler.
Start a new trace using the template TSQL_Locks.
Go to “Events selection” and tick off (as in we do not care about these - keep the others "clicked") Lock:Cancel, SP:StmtCompleted, SP:StmtStarting, SQL:StmtCompleted & SQL:StmtStarting.

Now that the profiler is running is time to wait.

For me it took some time (30 minutes until the first deadlock happened)

#164402

Edited, Oct 24, 2016 13:49

Vincent

Vote:

Disable the scheduler on other servers from web.config, make sure there is only one server running the schedule job.

#170728

Oct 27, 2016 14:18

Please login to post a reply