Quantcast
Channel: SQLServerCentral » SQL Server 2005 » SS2K5 Replication » Latest topics
Viewing all articles
Browse latest Browse all 302

Queue Reader Agent error

$
0
0
Hi,I've been searching a long time now to find out how to get rid of this error.[quote]The replication agent has not logged a progress message in 10 minutes. This might indicate an unresponsive agent or high system activity. Verify that records are being replicated to the destination and that connections to the Subscriber, Publisher, and Distributor are still active.[/quote]This issue has been raised a few times here and lots of times else where, but no solutions.[url]http://www.sqlservercentral.com/Forums/Topic445184-291-1.aspx[/url][url]http://www.sqlservercentral.com/Forums/Topic1168869-391-1.aspx[/url][url]http://www.sqlservercentral.com/Forums/Topic1173906-291-1.aspx[/url][url]http://www.sqlservercentral.com/Forums/Topic1038860-1550-1.aspx[/url]Replication seems to be working fine. The big red circle with a cross in replication monitor makes us nervous though.The publisher that shows the cross has 8 publications with a total of 9 subscriptions (1 each except for one with 2 subscriptions).Windows 2003 (Clustered) SQL 2005 ==>> 2000 / 2005 / 2008.I have been comparing dbo.syspublications, dbo.syssubscriptions, dbo.sysarticles to those on a similar box with similar replications. I can't find any differences in values that would point a possible configuration difference.I have tried all sorts to try to resolve this.[code="sql"]USE masterexec sp_changedistributor_property@property = N'heartbeat_interval',@value = 5;[/code]Bigger or smaller interval only changes the number in the error.dibbydibby posted a query in one of the above links to see if the distribtor is overloaded.[code="sql"]-- Commands per articleselect ag.name Publication, s.undelivcmdsindistdb, s.delivcmdsindistdbfrom msdistribution_status s, msdistribution_agents ag, msarticles arwhere s.agent_id = ag.id and s.article_id = ar.article_idorder by undelivcmdsindistdb desc-- Commands per Publicationselect ag.name Publication, COUNT(*) Articles, SUM(s.undelivcmdsindistdb) undelivcmdsindistdb, SUM(s.delivcmdsindistdb) delivcmdsindistdbfrom msdistribution_status s, msdistribution_agents ag, msarticles arwhere s.agent_id = ag.id and s.article_id = ar.article_idGROUP BY ag.nameorder by undelivcmdsindistdb desc[/code]From this I gleamed that retrieving all 84 rows from msdistribution_status takes 15-30 secs. This is due to msrepl_commands having 7mils records. Is this a normal size? The troubled publisher only ocassionally has 20+ under column undelivcmdsindistdb. If anything, it looks like another server is having problems with very high undelivcmdsindistdb values, but it's not.I have also cleared out [Distribution].[dbo].msqreader_history then manually started job "Replication agents checkup". All agent_IDs are writing to this table.Somewhere someone posted this query. Though, not what you should be looking for.[code="sql"]select 'dbcc inputbuffer(' + CAST(spid AS Varchar(5)) + ')' from sys.sysprocesses where program_name like '<Publisher>%'dbcc inputbuffer(73)[/code]Another suggestion was to run[code="sql"]EXEC sp_browsereplcmds[/code]Which returns:[quote]Msg 6513, Level 16, State 27, Procedure sp_browsereplcmds, Line 110Failed to initialize the Common Language Runtime (CLR) v2.0.50727 due to memory pressure. Please restart SQL server in Address Windowing Extensions (AWE) mode to use CLR integration features.[/quote]I suppose this may or may not be the cause. Got 16 cores / 16 GB RAM on this clustedered box. 99.98% buffer hitting. No other reason to believe memory is an issue. This just shows the unioned result msqreader_history tables between the troublesome publisher and the comparable publisher. Svr1 being the naughty one.[code="sql"]SvrName agent_id publication_id runstatus start_time time duration comments transaction_id transaction_status transactions_processed commands_processed delivery_rate transaction_rate subscriber subscriberdb error_id timestampSvr1 1 NULL 1 2013-03-26 13:29:59.297 2013-03-26 13:29:59.297 0 Starting agent. NULL 0 0 0 0 0 NULL NULL 0 0x0000000005AFB7AESvr1 1 NULL 6 2013-03-26 13:29:59.297 2013-03-26 13:35:00.453 301 The replication agent has not logged a progress message in 5 minutes. This might indicate an unresponsive agent or high system activity. Verify that records are being replicated to the destination and that connections to the Subscriber, Publisher, and Distributor are still active. NULL 0 0 0 0 0 NULL NULL 0 0x0000000005AFBA7DSvr2 1 NULL 4 2013-03-10 18:12:06.160 2013-03-26 17:16:44.530 1379078 No queued transaction available NULL 0 0 0 0 0 NULL NULL 0 0x0000000005A157F1[/code]Could use some help on this one.Cheers.P.S. Sorry for the long post.

Viewing all articles
Browse latest Browse all 302

Trending Articles