Intrusion Prevention
Highlighted
Intrusion Prevention

trying and failing to get DB Replication running

‎07-15-2013 03:27 AM

All,

 

I've spent the last weekend trying to get two managers to replicate the database from the Primary to the Secondary. I've checked the comms (I can ssh from one to the other using nsm and not get a password prompt). I have tried recreating the database - no luck - I have tried fixing the issue as detailed in KB10465 and no improvement at all.

 

tailing the guiDaemon.0 log file gives me this message:

 

[07/15/2013 11:15:49.949] [Error] [3062122176-xdb.cpp:70] Unable to start Xdb due to error: Exception [5]: Failed to create container "category_self" according to spec

StackTrace:
136535982: /usr/netscreen/GuiSvr/bin/.guiSvrManager(_ZN9ExceptionC2ERKSt9exceptionRKSsi+0xae) [0x8235fae]
135946676: /usr/netscreen/GuiSvr/bin/.guiSvrManager(_ZN16XmlContainerImpl30createContainerAccordingToSpecEv+0x484) [0x81a61b4]
136241348: /usr/netscreen/GuiSvr/bin/.guiSvrManager(_ZN21AbstractContainerImpl4initEPK13ContainerSpecP7XdbImpl+0x104) [0x81ee0c4]
136416671: /usr/netscreen/GuiSvr/bin/.guiSvrManager(_ZN7XdbImpl28initContainerAccordingToSpecEv+0xaf) [0x8218d9f]
136417444: /usr/netscreen/GuiSvr/bin/.guiSvrManager(_ZN7XdbImpl5startEPKci+0x174) [0x82190a4]
136343466: /usr/netscreen/GuiSvr/bin/.guiSvrManager(_ZN3Xdb5startEPKci+0xda) [0x8206faa]
136523100: /usr/netscreen/GuiSvr/bin/.guiSvrManager(nsXdbStart+0x5c) [0x8232d5c]
136534626: /usr/netscreen/GuiSvr/bin/.guiSvrManager(xdbStartup+0x62) [0x8235a62]
136531478: /usr/netscreen/GuiSvr/bin/.guiSvrManager(xdbLinkSupport_init+0x156) [0x8234e16]
137385422: /usr/netscreen/GuiSvr/bin/.guiSvrManager(setDbMgrLinkSupport_init+0xce) [0x83055ce]
Previous exception ============>
Error: Cannot allocate memory
[07/15/2013 11:15:49.950] [Error] [3062122176-startstop.c:28] Failed to start the Xdb due to error!
[07/15/2013 11:15:49.951] [Fatality] [3062122176-xdbLibSupport.c:88] xdbStartup() failed

 

haStatus gives me this message:

 

=======================================================
H/A process status
=======================================================


Retrieving status...
highAvail (pid 29183)..............................ON
highAvailSvr (pid 30407)...........................ON


=======================================================
State of the local and peer H/A server
=======================================================


Local Server:
172.16.0.119 error network-up db-repl:dirty

Peer Server:
172.16.0.51 starting services network-updb-repl:dirty


=======================================================
Reasons that the local H/A server went into error mode
=======================================================


[07/15/2013 11:24:27.753] [Notice] [1121664-highAvail.c:1532] State-transition: both servers running, both db dirty, both servers should enter error mode

 

I am at my wits end with this time vampire. Does anyone know what I can do to fix this issue?

1 REPLY
Intrusion Prevention

Re: trying and failing to get DB Replication running

[ Edited ]
‎06-27-2018 10:06 AM

Were you ever able to figure out the problem? I'm having a similar problem with 2012. I've forced replication  by clearing data directory and it starts replicating but hangs half way and then restart every few minutes/hours. Both devices are up but showing 'dirty' but only the secondary is working properly.