This is a good question. You have several solutions but your table is quite big so none will be without pain :)
You have three solutions to "shrink" InnoDB tables:
1. OPTIMIZE TABLE
You can use OPTIMIZE TABLE
as you mentionned it but you should care about the innodb_file_per_table
variable :
mysql> show variables like "innodb_file_per_table";
+-----------------------+-------+
| Variable_name | Value |
+-----------------------+-------+
| innodb_file_per_table | ON |
+-----------------------+-------+
1 row in set (0.00 sec)
Let me explain:
The OPTIMIZE TABLE
with InnoDB tables, locks the table, copies the data in a new clean table (that's why the result is smaller), drops the original table and renames the new table with the original name. That why you should make sure to have twice the space of the original table available on your disk (You'll probably need less, since the optimized table will be smaller, but it's better to be safe than sorry).
innodb_file_per_table = ON
: In this mode, all tables have their own data file. The OPTIMIZE
statement will then create a new data file with optimized space usage. When the operation is finished, MySQL will drop the original one and replace it with the optimized version (so at the end the 700GB -- probably less because it will be optimized -- of data generated during the operation will be released)
innodb_file_per_table = OFF
: In this mode, all data is contained in one data file: ibdata. This mode has a big drawback since it cannot be optimized. So during the OPTIMIZE
process, your new table will be created (near 700GB), but even after the drop and renaming operation (and the end of OPTIMIZE
phase) your ibdata will not released the ~700GB, so you wanted to free some data, instead you have 700GB more, cool isn't it?
2. ALTER TABLE
You can also use an ALTER TABLE
statement, the ALTER TABLE
will work in the same way as OPTIMIZE TABLE
. You can just use:
ALTER TABLE myTable ENGINE=InnoDB;
3. ALTER TABLE (ONLINE)
The problem of OPTIMIZE
and ALTER TABLE
is, that it locks the table during operation. You can use the Percona tool : pt-online-schema-change (from Percona Toolkit : link
). pt-online-schema... provide mechanisms to optimize the table, while keeping the original table available for read and writes. I use this tool in production for ALTER
statements on big tables and it's pretty cool.
Note that any FOREIGN KEY
s referencing your table might complicate things, since locks might lead to locks on other tables and so on. To check this, simply query:
mysql> SELECT COUNT(*) FROM information_schema.REFERENTIAL_CONSTRAINTS WHERE REFERENCED_TABLE_NAME = "myTable";
+----------+
| COUNT(*) |
+----------+
| 0 |
+----------+
1 row in set (0.04 sec)
Here is how I use pt-online-schema-change:
pt-online-schema-change --alter "ENGINE=InnoDB" D=myBase,t=myTable --user --ask-pass
Note that my note on innodb_file_per_table
is true also for this solution.
4. mysqldump
The last solution is to recreate all databases from a dump. It takes forever, but it's extremely efficient. Note that this is the only solution to optimize your ibdata file, if innodb_file_per_table
is OFF
Max.
Best Answer
You will have to take the table offline to prevent any reads or writes that would get locked and probably have to roll back. If the table is called
mydb.mytable
, here is what you must do:Any queries attempting to read from or write to
mytable
will fail immediately rather than hang on a lock fromOPTIMIZE TABLE
.Give it a Try !!!
UPDATE 2013-08-26 15:24 EDT
Look back at the code I suggested. Please note the line
That line takes the table offline with respect to the application.
To get a good idea how long it should take to optimize run the following:
However long this takes, is how long it will actually take to run.