Percona Tools – Difference Between Xtrabackup with and without –safe-slave-backup

backuppercona-toolsxtrabackup

Currently I'm studying Percona's xtrabackup. In the manual in the paragraph "Taking Backups in Replication Environments" it says that using the --safe-slave-backup
option is always recommended and I understand the reasons behind it.

I'm just wondering now, if there is actually a difference in the result when I do not use this option. After applying the logs on a backup, I don't see why there should be a difference between a backup taken with or without this option.

I'm asking, because in our production environment we do not use this option. The backup runs at midnight, but it failed this night. The backup is taken on a slave and I'd have a bad feeling about stopping the SQL thread now to take the backup.

Best Answer

As the manual says:

this option stops the slave SQL thread and wait to start backing up until Slave_open_temp_tables in SHOW STATUS is zero. [...] the SQL thread will be started and stopped until there are no open temporary tables

The reason for this is because Percona Xtrabackup basically mimics a controlled crash/shutdown of the server, and temporary tables can make the slave inconsistent, as you can see on the MySQL manual. This is not a problem of consistency per se (buckups will be consistent with the given timestamp/binlog), but it may make a slave lose some transactions when resynchronized with a master (typical usage- cloning a slave for creating another).

This does not happen if you use ROW-based replication, so I would recommend you using that. But some people cannot or do not want to use it, so this is the way to be 100% sure that new slaves work well. On a typical replication scenario, using --safe-slave-backup may not be very problematic, assuming there are not many temporary tables being created, but that is the workaround (and typically these options are added because someone had a problem in the past).

What I would recommend you is to use always --slave-info unless you are using GTID replication.

As the own manual says, using pt-table-checksum to test the backups is a good piece of advise.

Related Solutions

Mysql – Problem with Error: InnoDB: page … log sequence … number is in the future!

Nobody seems to have a real solution for this issue, at least i was not able to dig anything useful up. But apparently it helps, to just let it run and cleanup the logs on a regular basis, so your server does not get filled up with logs ... at least, since some days i do not have anymore log entries ...

this is all very odd -- in my humble opinion.

Postgresql – Taking hot backup of slave node in postgres (master-slave config with repmgr)

At least on 9.3, you can run pg_basebackup against a hot standby replica, and that's the approach I would recommend. To do this, you must have postgresql.conf set appropriately:

max_wal_senders = 2
hot_standby = on

You didn't specify your version. If you're on a version that can't do a pg_basebackup from a standby you're pretty much out of luck - you need to do a logical dump, dump from the master, or stop the replica for the backup.

Best Answer

Related Solutions

Mysql – Problem with Error: InnoDB: page … log sequence … number is in the future!

Postgresql – Taking hot backup of slave node in postgres (master-slave config with repmgr)

Related Question