MySQL Replication query – Primary key need

MySQLmysql-5.6perconaprimary-keyreplication

I have few doubts regarding MySQL replication (Master-Slave).
Is it mandatory for a table to have primary query for replication to function properly?

Referring this Percona link for the above question as it mentions that If there is no primary key or unique key defined then it’s even worse because INSERT may be re-executed and you will get multiple rows with the same data – which again means you’ve got inconsistent data with the master

But with InnoDB as the storage engine, if a table does not have primary or a unique query defined, the engine itself creates hidden clustered index on a synthetic column containing row ID values as per Jeremy Cole's blog

So, even in the case primary or unique not present, the replication should not have any impact it in itself creates a clustered index which should ensure the replication is smooth, correct? I'm not sure on this part.

Would be great if someone can throw some light on the need of Primary key in Master-Slave Replication setup.

Best Answer

Your assumption is incorrect.

The hidden clustered index does nothing to help, here, because the server can't use that index to find rows in a way that replication needs.

This causes some problems when the master is logging in STATEMENT mode, different problems in ROW mode, and all of the problems, combined, in MIXED mode. Here's the tl;dr on the biggest pitfall:

Every replication event against a table without a primary or unique key whose value is accessible through the SQL interface has the potential to require a full table scan for each row replicated.

The inaccessibility of the hidden clustered index to the SQL layer excludes it from use... and it wouldn't help, even if it were not as hidden, since its value is not deterministic from server to server, and so its value is never written to the binlog.

Simply enough... you should not create tables without accessible primary keys. It's bad design.

I would argue that a valid table cannot have two identical rows -- every relation has at least one candidate key, by definition, and candidate keys must be unique, also by definition. Choose one and make it primary, or create a surrogate (auto-increment) so that it's visible. If you have unique constraints, you need to define them in the schemata, not just impose them in code.

OPTION 1

Skip the error, wait 5 seconds, and view the Slave Status. Here the 5 steps for Skipping an Error

STOP SLAVE;
SET GLOBAL SQL_SLAVE_SKIP_COUNTER = 1;
START SLAVE;
SELECT SLEEP(5);
SHOW SLAVE STATUS\G

When you view the Slave Status, here is what to expect

If Seconds_Behind_Master is NULL
- Replication is Broken : Look for Tell-Tale Signs
- If Error Number is 1062 again, Repeat the 5 steps for Skipping an Error
If Seconds_Behind_Master is a Number
- Replication is running
- When Seconds_Behind_Master > 0, Replication is Catching Up.
- When Seconds_Behind_Master = 0, Replication is Fully Caught Up.

OPTION 2

Remove the row to allow replication to continue

Delete the row from the table on the Slave and do the following 4 Steps for Skipping an Error:

STOP SLAVE;
START SLAVE;
SELECT SLEEP(5);
SHOW SLAVE STATUS\G

At the risk of sounding redundant...

When you view the Slave Status, here is what to expect

If Seconds_Behind_Master is NULL
- Replication is Broken : Look for Tell-Tale Signs
- If Error Number is 1062 again, delete the row Repeat the 4 steps for Skipping an Error
If Seconds_Behind_Master is a Number
- Replication is running
- When Seconds_Behind_Master > 0, Replication is Catching Up.
- When Seconds_Behind_Master = 0, Replication is Fully Caught Up.

What if there are just too many duplicate key issues? Here are some of my earlier posts concerning how to use MAATKIT's mk-table-checksum, mk-table-sync, pt-table-checksum, pt-table-sync:

Percona MySQL – Unique Key is Duplicated

This looks like you might have hit a bug logged against Percona Server 5.5:
Concurrent duplicate inserts can violate a unique key constraint in InnoDB tables.

There is no fix and no reproducible test case for this bug yet. It has only been observed in a production environment.

The pattern described is:

INSERT a value into a column with a unique constraint.
DELETE that row.
Two concurrent sessions INSERT new rows that have the same value as in the deleted row.
Both sessions commit, and both of their INSERTs succeed.

The root cause might be related to unfinished purging of the deleted row. In InnoDB, deleting an entry from an index is a multi-step process. First, the entry is "delete-marked" which leaves the entry in the index so as to postpone the physical removal from the index. Then later, the purge thread performs the final removal, which may include some rebalancing of the B-tree.

If you try to insert the same value as one which is delete-marked, it simply removes its delete mark, and associates the value with the new row you insert.

Based on the bug report, it seems that while the deleted entry is merely delete-marked, but not yet purged, two concurrent sessions can insert the same value. This probably happens all the time on non-unique indexes, and it's no problem. But of course this is a problem if the index is a unique index.

Sorry, there's no resolution to this bug yet. I encourage you to log into launchpad and register that this bug affects you. If you can post additional information about how the bug occurs in your environment, that would be helpful too. Best of all is if you can help create a reproducible test case!

Also, this might be related to a bug against stock MySQL: Bug #69979 columns with unique key gets intermittent duplicate values! although some details are different. That MySQL bug was closed as "not a bug" because the developers apparently concluded that in InnoDB's MVCC architecture, it's acceptable for some conflicts to occur and produce invalid results based on race conditions. IMHO, this should earn them a resounding "WTF?!"

Best Answer

Related Solutions

MySQL Replication – Fixing Replication Errors

OPTION 1

OPTION 2

Percona MySQL – Unique Key is Duplicated

Related Question