Mongodb – Invalid sync source in mongodb config server

mongodb

I have 3 config servers 1 primary and 2 secondaries i have received Waring alerts
in secondary servers as

Log File Monitoring : key = error, File = /var/log/mongodb/mongod.log
(mgdb09:log[/var/log/mongodb/mongod.log,@mongodb_error,UTF-8]):
2018-05-05T05:42:08.913+0900 W REPL [rsBackgroundSync] Fetcher
stopped querying remote oplog with error: InvalidSyncSource: sync
source mgdb08:27019 (last visible optime: { ts: Timestamp
1525466619000|1, t: 24 }; config version: 1; sync source index: -1;
primary index: 1) is no longer valid

Log File Monitoring : key = error, File = /var/log/mongodb/mongod.log
(mgdb07:log[/var/log/mongodb/mongod.log,@mongodb_error,UTF-8]):
2018-05-05T05:37:19.219+0900 W REPL [rsBackgroundSync] Fetcher
stopped querying remote oplog with error: InvalidSyncSource: sync
source mgdb08:27019 (last visible optime: { ts: Timestamp
1525466619000|1, t: 24 }; config version: 1; sync source index: -1;
primary index: 2) is no longer valid

. Log File Monitoring : key = error, File =
/var/log/mongodb/mongod.log
(mgdb09:log[/var/log/mongodb/mongod.log,@mongodb_error,UTF-8]):
2018-05-05T05:42:13.911+0900 I REPL [SyncSourceFeedback]
SyncSourceFeedback error sending update to mgdb08:27019:
InvalidSyncSource: Sync source was cleared. Was mgdb08:27019

and later on one of the secondary became primary

what cause this kind of issue what action do i need

Best Answer

As I am able to see in your error log file such as error Fetcher stopped querying remote oplog with error: InvalidSyncSource: sync source

As per MongoDB jira documentation here It seems that the common cause for the remote replSetUpdatePosition failing with those error codes NodeNotFound,NotMasterOrSecondary and InvalidReplicaSetConfig would be a change in config. Config changes currently will cause the oplog fetcher to fail because it will detect a change in the config version contained in the oplog query metadata. Subsequently, after exiting the oplog fetcher, we will go back to the sync source resolver which has its own blacklisting logic.

For your further ref here

Related Solutions

Mongodb config servers not in sync

Restoring config servers, particularly if you have had some sort of catastrophic event is tricky, but not impossible. But, before we go any further, a big bold caveat:

BACK UP EVERYTHING

That means taking a back up of all three config servers. I am going to give you some advice, and it is generally correct, but please, please take a back up of every current config server instance before you overwrite/replace anything

As a quick explanation, config servers are not configured as a replica set - each config server instance is supposed to be identical (at least for all the collections that matter) to the others. Hence, any healthy config server can be used to replace a non-healthy config server and you can then follow the tutorial you mentioned to get back to a good config.

The key to recovery is to identify the healthy config server and then use that to replace the others - you then end up with 3 identical config servers.

There is more than one way to do this, they basically fall into three categories:

1) Use the error message

The error message that is printed out actually lets you know which config server it believes is health, though that is not obvious from the messaging. Here's how to read it generically:

ERROR: config servers not in sync! config servers <healthy-server> and <out-of-sync-server> differ

Basically the first one in the list is the healthy one, in your case that would be mongocfg1.testing.com:27000. That is our first candidate for a healthy config database.

2) Use dbhash to compare all three and pick the ones that agree

On each config server switch to the config database using use config, run db.runCommand("dbhash") and compare the hashes for the collections below:

chunks
databases
settings
shards
version

You are looking for two servers that agree, and using that as the basis to determine that the version of the config database on those hosts is basically trustworthy and should be used to seed the rest.

3. Manually inspect the collections in the config database

Finally, take a look at the config database, and pay attention to the collections listed in the second option above. This is a straight judgement call based on your familiarity with your data.

Hopefully all three methods point you at the same host (or hosts). That config server should be used to seed the other two (after you have taken backups so you can go back). That is basically your best bet. Should that fail, then you may want to try one of the other versions (from the backups) - always making sure that when you start them, all three are identical.

Finally, always ensure that all mongos processes are using the same config server string, and that all 3 servers are always listed in the same order on every process - not doing so across all mongos processes can lead to (very) odd results.

Mongodb – Socket recv() timeout on secondary nodes

Just a shot in the dark. Could it be a problem with the primary servicing too many connectionss simultaneously?

enter image description here

This information comes from the mongodb 202 training.

If a primary can no longer service requests, there will be

socket exceptions
timeouts
the over servers can no longer ping the primary possibly causing an election --maxConns limit connections to mongos can insulate mongods from getting into unpredicatble states

From MongoDB best practices

Connections MongoDB drivers implement connection pooling to facilitate efficient use of resources. Each connection consumes 1MB of RAM, so be careful to monitor the total number of connections so they do not over-whelm the available RAM and reduce the availablememory for the working set. This typically happenswhen client applicationsdo not properly close their connections, or with Java in particular, that relies ongarbage collection to close the connections.

If there are too many connections open on the primary at any one given time, secondaries, administrative, clients, etc, then your primary could also be running out of memory.

Here is an example layout, with connections coming from drivers, secondaries, to and from different processes. This can all quickly add up depending on the number of clients and machines in your infrastructure!

enter image description here

Best Answer

Related Solutions

Mongodb config servers not in sync

Mongodb – Socket recv() timeout on secondary nodes

Related Question