Mysql – 4m Rows in MySQL, ~5 Minutes to Run Query

MySQLoptimizationperformance

I have 2 tables, a users table and a log table which literally logs each time a user signs in.

What I'm trying to do is grab a list of all users with a certain rank who have not logged in for over a year. That said, I have a working query, but it's taking ~5 minutes to execute.

Here's what I'm currently working with:
SQLFiddle

SELECT u.usrid, u.username, u.rank, u.extras, l.dtime 
FROM users AS u
JOIN (
    SELECT userid, MAX(datetime) dtime 
    FROM log 
    GROUP BY userid
) AS l
ON u.usrid = l.userid 
WHERE (u.rank = 'P' OR u.extras LIKE '%W%')
AND l.dtime < DATE_SUB(NOW(), INTERVAL 1 YEAR)
ORDER BY l.dtime DESC

It's a part of a very old and soon to be updated system… But until then, I'm trying to make the best of what I've been given to work with. That being said, I don't have access to actually alter the database schema so I'm hoping there is a way to optimize this query to work better.

Furthermore, the users table has ~500k rows while the log table has ~4m rows. With that many rows, I'm also wondering if it's a size issue. If so, what suggestions could I pass on to the DBA in order to improve it's ability to scale? I know I could simply store the last log in date somewhere separate for this particular use-case; but we use that log data for a lot of things so we need a way to be able to search it efficiently.

Best Answer

SELECT  u.usrid, u.username, u.rank, u.extras, x.dtime
    FROM  
      ( SELECT  userid, MAX(datetime) dtime
            FROM  log
            GROUP BY  userid
            HAVING  dtime < NOW() - INTERVAL 1 YEAR 
      ) AS x
    JOIN  users u ON u.usrid = x.userid
    WHERE  ( u.rank = 'P'
              OR  u.extras LIKE '%W%' 
           )
    ORDER BY  x.dtime DESC

Plus

 INDEX(usrid, datetime) -- in `log`

(I assume usrid is the PRIMARY KEY in users.)

Is size an issue? Yes and no. With the right indexes and queries, etc, billion-row tables can work fine. In your case, it is dubious... The subquery will probably produce 500K distinct users from its GROUP BY. Then the HAVING will shrink the number. What's left needs to be looked up in users and further filtered. Finally another sort (ORDER BY) before delivering the results.

I think that newer versions of MySQL will leapfrog through the INDEX I suggested, landing on only 500K index rows, not all 4M.

If you think the rank and extras do a better filtering than MAX(datetime), then a different approach might be faster.

Related Solutions

Sql-server – Best way to rank all the columns in a table and store those ranks in another table

You would need an index on each column in order to avoid a sort of every row and column during the ranking process. That would of course introduce significant overhead as scores are updated continuously. Probably not an option unless you have a high-end hardware configuration.

The ranking processes could be offloaded onto a read-only copy maintained via log shipping on a different box. That would avoid concurrency issues with hitting the live database system of record and provide more server resources for both the intensive batch process and live database. If you are running Enterprise Edition, you could use a database snapshot on the same box as the source if your hardware is sufficiently sized.

You mention truncate and insert. Just want to mention that you could also use SWITCH such that you truncate and insert into a staging table. Once the staging load is completed, truncate Table_Ranks and then SWITCH from the staging table into Table_Ranks in a single transaction. That way, Table_Ranks would never be empty.

Mysql – SQL Query; Admin Log System; Foreign Keys; Retriving multiple rows form different tables at once

The question is all over the place but I think this is what you are looking for

select a.name, u.name, b.note, b.date     
  from table_B as B 
  join table_A as u 
        on u.ID = B.ID 
  join table_A as a 
        on a.ID = B.admin_id  
 where B.ID = @userID

you can create a line with
select a.name + " took action on " + u.name ...

Best Answer

Related Solutions

Sql-server – Best way to rank all the columns in a table and store those ranks in another table

Mysql – SQL Query; Admin Log System; Foreign Keys; Retriving multiple rows form different tables at once

Related Question