How can I do a contiguous group by in MySQL?

https://stackoverflow.com/questions/1610599

05-07-2019
|

Question

How can I return what would effectively be a "contiguous" GROUP BY in MySQL. In other words a GROUP BY that respects the order of the recordset?

For example, SELECT MIN(col1), col2, COUNT(*) FROM table GROUP BY col2 ORDER BY col1 from the following table where col1 is a unique ordered index:

returns:

1    a    4
3    b    2

but I need to return the following:

1    a    2
3    b    2
5    a    2

Solution

Use:

   SELECT MIN(t.id) 'mi', 
          t.val, 
          COUNT(*)
     FROM (SELECT x.id, 
                 x.val, 
                 CASE 
                   WHEN xt.val IS NULL OR xt.val != x.val THEN 
                     @rownum := @rownum+1 
                   ELSE 
                     @rownum 
                 END AS grp
            FROM TABLE x
            JOIN (SELECT @rownum := 0) r
       LEFT JOIN (SELECT t.id +1 'id',
                         t.val
                    FROM TABLE t) xt ON xt.id = x.id) t
 GROUP BY t.val, t.grp
 ORDER BY mi

The key here was to create an artificial value that would allow for grouping.

Previously, corrected Guffa's answer:

   SELECT t.id, t.val
     FROM TABLE t
LEFT JOIN TABLE t2 on t2.id + 1 = t.id
    WHERE t2.val IS NULL 
       OR t.val <> t2.val

OTHER TIPS

If the numbers in col1 are contiguous, you can do like this:

select x.col1, x.col2
from table x
left join table y on x.col1 = y.col1 + 1
where x.col2 <> isnull(y.col2, '')

It works like this:

-x-  -y-  out
1 a  - -  1 a
2 a  1 a
3 b  2 a  3 b
4 b  3 b
5 a  4 b  5 a
6 a  5 a

same logic as rexem, but works on any windowing-capable RDBMS (won't work on MySQL yet):

CREATE TABLE tbl
(
id INT,
val VARCHAR(1)
);

INSERT INTO tbl(id,val) 
VALUES(1,'a'),(2,'a'),(3,'a'),(4,'a'),(5,'b'),(6,'b'),(7,'a'),(8,'a'),(9,'a');

source:

1 a
2 a
3 a
4 a
5 b
6 b
7 a
8 a
9 a

Windowing-style query: (works on windowing-capable rdbms):

WITH grouped_result AS
(
    SELECT x.id, x.val, 
        COUNT(CASE WHEN y.val IS NULL OR y.val <> x.val THEN 1 END) 
        OVER (ORDER BY x.id) AS grp
    FROM tbl x LEFT JOIN tbl y ON y.id + 1 = x.id
) 

SELECT MIN(id) mi, val, COUNT(*)
FROM grouped_result 
GROUP BY val, grp
ORDER BY mi

Output:

1  a  4
5  b  2
7  a  3

BTW, this is the result of the grouped_result without GROUP BY:

Feels good rewriting mysqlism-query to ANSI-conforming one :-) For now, while mysql don't have windowing capabality yet, rexem's answer is the best one. Rexem, that's a good mysql technique(JOIN (SELECT @rownum := 0)) there, and afaik MSSQL and PostgreSQL don't support implicitly declared variable, kudos! :-)

This won't work:

SELECT min_col1 = MIN(col1), col2
FROM table
GROUP BY col2
ORDER BY min_col1

Perhaps this?

SELECT min_col1, col2
FROM ( SELECT min_col1 = MIN(col1), col2
       FROM table
       GROUP BY col2 ) x
ORDER BY min_col1

Here is a lengthier description of essentially the same (I think) solution offered by omg-ponies - "create an artificial value that would allow for grouping".

I know this question was asked two and a half year ago (and I don't expect any upvote), but I just encountered the exact same problem, except that 'table' was already a very complicated SQL statement, so I couldn't make any join without copy-pasting it

So I had another idea: order by col2 and substract the current row number to the value of col1

SELECT *, col1-(@rownum:=@rownum+1) FROM (SELECT * FROM table JOIN (SELECT @rownum:=0) AS i ORDER BY col2) AS t

Which gives a result like this:

1    a    0
2    a    0
5    a    2
6    a    2
3    b    -2
4    b    -2

Now you just need to group by the value of the last column

SELECT MIN(col1) AS mi, col2, COUNT(*) FROM 
    (SELECT *, col1-(@rownum:=@rownum+1) AS grp FROM (SELECT * FROM table JOIN (SELECT @rownum:=0) AS i ORDER BY col2) AS t) AS x
GROUP BY grp ORDER BY mi

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow