Consultas lentas encontradas en el registro de la base de datos

Peper · 27 Mayo, 2020 20:43

Ejecuto las pruebas en Discourse y detecto algunas consultas ineficientes en el registro de consultas, como se muestra a continuación:

DISTINCT innecesario que ralentiza la consulta site_settings_controller.rb#L165:

SELECT 
  DISTINCT users.id 
FROM 
  "users" CROSS 
  JOIN categories c 
  LEFT JOIN category_users cu ON users.id = cu.user_id 
  AND c.id = cu.category_id 
WHERE 
  (
    c.id = '3613'
    AND cu.notification_level IS NULL
  )

Cuando categories.id y notification_level tienen valores específicos, debido a la restricción UNIQUE (category_id, user_id) en category_users y la PRIMARY KEY(id) en categories, ni el CROSS JOIN ni el LEFT JOIN generarán registros duplicados. Esto significa que podemos eliminar DISTINCT para acelerar la consulta, como se muestra a continuación:
Esta consulta optimizada tarda 4532166 nanosegundos (mejora del 30 %).

DISTINCT innecesario en una subconsulta que ralentiza la consulta search.rb#L523:

SELECT 
  "posts".* 
FROM 
  "posts" 
  JOIN (
    SELECT 
      *, 
      row_number() over() row_number 
    FROM 
      (
        SELECT 
          topics.id, 
          min(posts.post_number) post_number 
        FROM 
          "posts" 
          INNER JOIN "post_search_data" ON "post_search_data"."post_id" = "posts"."id" 
          INNER JOIN "topics" ON "topics"."id" = "posts"."topic_id" 
          AND ("topics"."deleted_at" IS NULL) 
          LEFT JOIN categories ON categories.id = topics.category_id 
        WHERE 
          ("posts"."deleted_at" IS NULL) 
          AND "posts"."post_type" IN (1, 2, 3) 
          AND (topics.visible) 
          AND (
            topics.archetype <> 'private_message'
          ) 
          AND (
            topics.id IN (
              SELECT 
                DISTINCT(tt.topic_id) 
              FROM 
                topic_tags tt 
              WHERE 
                tt.tag_id in (
                  SELECT 
                    tag_id 
                  FROM 
                    tag_group_memberships 
                  WHERE 
                    tag_group_id = 504
                )
            )
          ) 
          AND (
            categories.id NOT IN (
              SELECT 
                categories.id 
              WHERE 
                categories.search_priority = 1
            )
          ) 
          AND (
            (categories.id IS NULL) 
            OR (NOT categories.read_restricted)
          ) 
        GROUP BY 
          topics.id 
        ORDER BY 
          MAX(posts.created_at) DESC 
        LIMIT 
          6 OFFSET 0
      ) xxx
  ) x ON x.id = posts.topic_id 
  AND x.post_number = posts.post_number 
WHERE 
  ("posts"."deleted_at" IS NULL) 
ORDER BY 
  row_number

DISTINCT(tt.topic_id) es redundante y podemos eliminarlo para acelerar la consulta, como se muestra a continuación:
Esto mejora el rendimiento de esta consulta de 12655768 a 5005154 (mejora del 60 %).

DISTINCT innecesario en una subconsulta que ralentiza la consulta [search.rb#L642]:

SELECT 
  "posts".* 
FROM 
  "posts" 
  JOIN (
    SELECT 
      *, 
      row_number() over() row_number 
    FROM 
      (
        SELECT 
          topics.id, 
          posts.post_number 
        FROM 
          "posts" 
          INNER JOIN "post_search_data" ON "post_search_data"."post_id" = "posts"."id" 
          INNER JOIN "topics" ON "topics"."id" = "posts"."topic_id" 
          AND ("topics"."deleted_at" IS NULL) 
          LEFT JOIN categories ON categories.id = topics.category_id 
        WHERE 
          ("posts"."deleted_at" IS NULL) 
          AND "posts"."post_type" IN (1, 2, 3) 
          AND (topics.visible) 
          AND (
            topics.archetype <> 'private_message'
          ) 
          AND (
            topics.category_id IN (3715)
          ) 
          AND (
            topics.id IN (
              SELECT 
                DISTINCT(tt.topic_id) 
              FROM 
                topic_tags tt, 
                tags 
              WHERE 
                tt.tag_id = tags.id 
                AND lower(tags.name) IN ('lunch')
            )
          ) 
          AND (
            (categories.id IS NULL) 
            OR (NOT categories.read_restricted)
          ) 
        ORDER BY 
          posts.like_count DESC 
        LIMIT 
          6 OFFSET 0
      ) xxx
  ) x ON x.id = posts.topic_id 
  AND x.post_number = posts.post_number 
WHERE 
  ("posts"."deleted_at" IS NULL) 
ORDER BY 
  row_number

Al igual que en el caso anterior (código fuente diferente), DISTINCT(tt.topic_id) es redundante y podemos eliminarlo para acelerar la consulta:
Esto mejora el rendimiento de esta consulta de 23659762 a 21030593 (mejora del 10 %).

Tema		Respuestas	Vistas
Slow queries in Discourse Development	9	703	25 Mayo 2020
Slow SQL query causes homepage to load in 2-4 sec Support	15	1709	7 Febrero 2018
App/models/topic_tracking_state - `report` - long response times? Development slow-sql	6	1818	26 Febrero 2016
Some queries that don't finish Support	16	999	5 Abril 2019
Long-Running Sidekiq Jobs Feature	21	1825	24 Diciembre 2020

Consultas lentas encontradas en el registro de la base de datos

Temas relacionados