Code-level performance testing

angus · 20.Октябрь.2023 03:44:49

I’ve been looking at the performance of the ActivityPub plugin recently and considering the best ways to reliably test, and prove, performance for the purpose of such a project. Here’s a few initial pieces of context:

The plugin is an open source project with multiple parties involved (i.e. Discourse, who owns the plugin, and Pavilion who is currently building it).
Different parties may have different internal performance testing tools / systems.
Multi-party open source projects benefit from commonly available methods of testing, and proving, that something works, or in this case performs, reliably.
Discourse (laudably) cares about performance.

Contributing to Discourse development

At Discourse we believe performance is a feature. We welcome pull requests that improve either client or server side performance.
Currently, the only commonly available method of testing / proving server-side performance in Discourse (that I’m aware of) is track_sql_queries, which is typically (but not exclusively) used in request tests.
While the query count is one indicator of performance, it’s not the only one (some queries are bigger than others).

For a recent example of the use of track_sql_queries see

github.com/discourse/discourse-activity-pub

PERF: Improve topic serialization performance

main ← angusmcleod:topic_serialization_perf

merged 06:30PM - 19 Oct 23 UTC

angusmcleod

+220 -14

@pmusaraj This is the first of a few performance related PRs. This one is focuse…d on topics. The next one will likely be focused on categories, and will come after https://github.com/discourse/discourse/pull/23969 is merged. I'm going to circle back to topics again after this is and the first categories performance PR is handled to see what additional gains can be made. Note that all the actual performance gains here are being made in the topic `show` action. The AP plugin seems to have little discernible impact on topic lists (category lists are a different story) and I've added a spec to both (partially) demonstrate that and also as a guard against regression. With respect to topic `show` I'll let the specs speak for themselves, but just to illustrate the gain being made there (and as a sanity check) these are the different profiles of a topic on my local. The topic has: - Full Topic publication type (when AP is enabled) - 7 posts from 4 different actors (users) ##### No AP plugin (vanilla discourse) <img width="741" alt="Screenshot 2023-10-19 at 14 16 28" src="https://github.com/discourse/discourse-activity-pub/assets/5931623/bd3be390-56f1-4d2e-a0dd-68365647552f"> ##### AP plugin on this branch <img width="730" alt="Screenshot 2023-10-19 at 14 12 32" src="https://github.com/discourse/discourse-activity-pub/assets/5931623/45555532-fcbd-4b0b-a24a-26e330ff0d59"> ##### AP plugin on main <img width="733" alt="Screenshot 2023-10-19 at 14 14 21" src="https://github.com/discourse/discourse-activity-pub/assets/5931623/fbcd3b7b-2027-42f0-ac50-d3e8089e3763">

If you’re familiar with this area, you probably know something like

The rule of thumb is that unit tests need speed and performance tests need time.

(quote from this decent explainer)

Which, on the face of it, can make performance testing (beyond query counting) somewhat hard to integrate into an rspec (or similar) suite. That said, some people try

I’m curious what other practical methods, suggestions or ideas folks may have to add more commonly available performance testing, and performance proofs, to the Discourse ecosystem. Or if there are methods or approaches I haven’t mentioned here. I would emphasise the words “practical” and “commonly available” there.

One thought that occurs is that it could be possible to use MiniProfiler in a spec, i.e. something like Rack::MiniProfiler.profile_singleton_method. But I have neither tried that, or know whether that’d be a good idea.

sam · 23.Октябрь.2023 23:29:59

My general recommendation is to avoid performance testing in specs.

We have some examples where we try to monitor for N+1s in a spec, but they all tend to be pretty fragile.

Its a very very tough problem with no obvious solution, all solutions come with compromises so we generally avoid this and just monitor production for this kind of stuff.

Тема		Ответов	Просм.
Any performance-related areas you'd like investigated? Development	1	1479	16.08.2016
Using Google's 'tachometer' to measure JS performance changes in Discourse Developer Guides code	0	946	05.10.2023
Forum Performance Indicators Community Building performance	5	799	28.06.2022
Discourse server performance or load testing Development	0	640	31.12.2020
Does Discourse have more testing except automated ones & how to ensure no bugs? (Question when learning software engineering & testing) Development	0	413	07.05.2021

Code-level performance testing

Связанные темы