Add a lockfree SPSC queue by sakertooth · Pull Request #8280 · LMMS/lmms

sakertooth · 2026-02-24T17:43:16Z

Adds a lockfree SPSC (single producer, single consumer) queue meant for sending data (e.g. audio or messages) in real time from one thread to another. It uses std::atomic with relaxed memory orderings where applicable as well as alignment via alignas(std::hardware_destructive_interference_size) for optimal performance.

That being said, I'm not too confident this (mainly the performance) is any better or worse than some of the more battletested SPSC queues like the ones from boost and moodycamel, but if we are not too interested in outsourcing another library for this, this should work and fit our needs well enough.

The queue replaces the old LocklessRingBuffer, and as such is currently being used in the AudioEngine for submitting new PlayHandle* objects, the currently unused Lv2Worker code, and in SpectrumAnalyzer and Vectorscope for sending audio data across two separate threads.

…f minimum size

…tion

…::analyze (might change later, might not)

…tch overloads

Changing the indicies while waiting to -1 does work but makes the queue unreusable

…pscQueue

…pBlocking

…ta in its own function, remove space available flag (unused)

…not calculate contiguous region size in reserveContiguous functions

…with 64

messmerd · 2026-02-24T18:19:29Z

To fix the macOS failure, we need to bump our x86_64 macOS minimum deployment target to macOS 11.0.

This should be done explicitly by setting the -DCMAKE_OSX_DEPLOYMENT_TARGET CMake flag rather than using our mac-os.env script. See #8118 for how it should be done.

We will also need to do this to enable std::filesystem support on x86_64 macOS.

messmerd · 2026-02-24T18:29:56Z

For std::hardware_destructive_interference_size, check out what @rubiefawn did in #8132 with the new Hardware.h header. And just so you're aware, there might be a warning about potential ABI changes that needs to be disabled when using std::hardware_destructive_interference_size.

rubiefawn · 2026-02-24T18:33:15Z

The work in #8132 should also be able to be generalized into a lock free MPSC ring buffer, which may be useful in other places as well.

Edit: Don't know why I said "may", I've already done this, it's just not tested outside of Lb302, and I haven't meaningfully worked on it in months. Only the MPMC queue is ~~implemented~~ at the moment, but the other types can be derived from it by removing things (such as the MPSC queue used in Lb302).

sakertooth · 2026-02-24T18:56:25Z

To fix the macOS failure, we need to bump our x86_64 macOS minimum deployment target to macOS 11.0.
This should be done explicitly by setting the -DCMAKE_OSX_DEPLOYMENT_TARGET CMake flag rather than using our mac-os.env script. See #8118 for how it should be done.
We will also need to do this to enable std::filesystem support on x86_64 macOS

Should this PR wait then?

For std::hardware_destructive_interference_size, check out what @rubiefawn did in #8132 with the new Hardware.h header. And just so you're aware, there might be a warning about potential ABI changes that needs to be disabled when using std::hardware_destructive_interference_size.

I guess I'll wait to avoid doing the same thing. I'm also not sure because MinGW complains about std::hardware_destructive_interference_size being unstable, rather than it being not defined, so in the event we eventually switch to using it, MinGW might still be broken (AFAICT). Though this complaint from MinGW is actually a warning, so disabling it might resolve it.

The work in #8132 should also be able to be generalized into a lock free MPSC ring buffer, which may be useful in other places as well.

The only place I can think of right now that might use a queue whereby there is multiple threads on either side is with the audio thread and its worker threads, but even then in that situation I think separate SPSC queues might scale better (though, a MPSC/SPMC/MPMC queue might work fine when using an atomic fetch_add with a slot based system, though I designed this queue with contiguous reservations of the buffer in mind, which I don't think you can do with those queues, could be wrong). That all being said though, if we have a use case for those queues, I would be fine adding them as needed.

I'm curious why a MPSC queue would be needed for 8132 though, (are the multiple audio worker threads each sending voices or something?)

rubiefawn · 2026-02-24T19:01:58Z

are the multiple audio worker threads each sending voices or something?

This is the case. I had each thread print out its id when performing an enqueue operation and got several different ids.

JohannesLorenz · 2026-03-08T22:18:45Z

I have 2 questions:

The queue replaces the old LocklessRingBuffer

What was wrong with LocklessRingBuffer? What is the advantage that this PR introduces?

the currently unused Lv2Worker code

What code is unused in Lv2Worker?

sakertooth · 2026-03-09T00:32:21Z

@JohannesLorenz,

What was wrong with LocklessRingBuffer? What is the advantage that this PR introduces?

LocklessRingBuffer brought with it some concerns that discouraged my use of it, most notably in #7705 (see #7705 (comment)). From further inspection of the ringbuffer submodule however, it seems to be a lockfree SPMC queue, so it seems to be more real-time safe than I had initially thought.

In 7705 I was wary of the use of QWaitCondition, since in my mind its still a heavy OS construct, most likely heavier than a simple std::atomic_flag::wait, which is what my PR uses. I also didn't like the split reader/writer API, where reading has to be done with a separate class. For SPSC queue uses this seemed a bit much.

Also wasn't sure if we really had to outsource another library (the ringbuffer submodule). I initially was planning to slowly get rid of it, but come to see its actually a SPMC queue and solves somewhat of a different problem, it might be needed (though seeing that you wrote it I don't know if you would've been okay with that regardless).

What code is unused in Lv2Worker?

Correct me if I'm wrong, but the code in Lv2Worker.cpp doesn't seem to be in use anywhere else in the codebase currently. I assumed that this was because the Lv2 implementation was being worked in incrementally rather than being complete in one go.

JohannesLorenz · 2026-03-09T22:28:56Z

Correct me if I'm wrong, but the code in Lv2Worker.cpp doesn't seem to be in use anywhere else in the codebase currently.

$ git grep Lv2Worker
include/Lv2Proc.h:#include "Lv2Worker.h"
include/Lv2Proc.h:      std::optional<Lv2Worker> m_worker;

You can continue with git grep m_worker -- src/core/lv2/Lv2Proc.cpp. I really wonder why you cannot see this.

I also didn't like the split reader/writer API, where reading has to be done with a separate class. For SPSC queue uses this seemed a bit much.

Indeed it is a bit much, but restricts you to SPSC (vs SPMC).

--

If I put it all together, for me it looks a bit like this:

Advantages of this PR:

Follows our style guide
No submodule (some users find them difficult)
No need for Reader AND writer (but no SPMC)
Uses atomic_flag (though LocklessRingbuffer could be extended by it)

Advandages of keeping ringbuffer:

No merge conflicts (this PR will conflict at least Real time safe recording with ring buffer stage one #7903 and Lv2 UI - Testing #7201)
No need to delete the submodule (can get ugly)
SPMC (but you need 2 classes)
mlock (though this PR could be extended by it)
Proven in use, has extensive tests

I value any effort. However, this PR here looks to me to only have stylistic advantages, and has functional disadvantages (yet).

sakertooth · 2026-03-09T23:11:29Z

You can continue with git grep m_worker -- src/core/lv2/Lv2Proc.cpp. I really wonder why you cannot see this.

Ah, I see it now. I skipped over this by accident.

I value any effort. However, this PR here looks to me to only have stylistic advantages, and has functional disadvantages (yet).

If the API for LocklessRingBuffer was simpler instead of split, I would be more receptive of using it. I would also rename it to reflect if its a SPSC or SPMC queue, so we know in what contexts it can be used. And I was confused why we needed a LocklessRingBuffer because I thought the underlying structure was already a lockfree SPMC queue? If the appropriate changes are made I guess I can close this work out and use what's already there.

Maybe have a simple interface for SPSC uses, and a more general interface for SPMC uses, though both can use the underlying library.

I'm also not sure of its capabilities/how this library actually works. I wasn't even sure if ringbuffer was lockfree (I thought it was just a regular ring buffer 😵‍💫)

Note

I never really like these situations because of the attachment to ones work and the conflict that brings. I usually am willing to let my work be disbanded, but just need confirmation that the original solution will be improved to be more convenient and easy to use. Nevertheless, I should've asked more questions about this than starting from scratch.

TLDR: Didn't really understand what ringbuffer was, wrote my own SPSC solution, should've asked around, see what could've been done about the code already there.

messmerd · 2026-03-09T23:27:19Z

I think I'm honestly more in favor of using a widely-used, extensively-tested, well-documented, high-performance 3rd party library than trying to roll our own homemade implementation. Concurrency is hard to do correctly, so I'd feel more at ease using something that thousands of other people depend on.

The problems with this PR as I see it:

Homemade (can I trust it to always work correctly and be free of UB? I have no idea.)
No testing
No performance benchmarks

The problems with the current ringbuffer as I see it:

Homemade (can I trust it to always work correctly and be free of UB? Definitely more than this PR, but I'm still not fully confident)
There are a couple tests, but not much
No performance benchmarks
Confusing API

The problems with a 3rd party library:

Requires a new dependency (or just a different dependency, since it would replace the ringbuffer submodule)
That's it

sakertooth · 2026-03-09T23:31:48Z

Homemade (can I trust it to always work correctly and be free of UB? I have no idea.)
No testing
No performance benchmarks

Unfortunately yes you're right, I think something like moodycamel would be a good bet, off the top of my head. Might not have enough time to commit to thorough testing and benchmarks for this PR, though it's in our best interest to have them.

I just need to make sure the library has capabilities we need, like reserving contiguous regions on either side for SPSC queues at a minimum (edit: it has bulk enqueue/dequeue, so it should be fine).

sakertooth added 30 commits February 20, 2026 18:49

Add CircularBuffer.h

ee935f9

Replace use of LocklessList with LockfreeSpscQueue in the audio engine

35b51c2

Remove LocklessList

dd754fb

Remove LocklessAllocator

7bf6931

Refactor CircularBuffer load/read index mechanism

4de9acf

Add block push/pop overloads

d6068de

Use move iterators for batch push/pop overloads, remove calculation o…

dc0aadb

…f minimum size

Only reserve contiguous space within the buffer

8c8dd8c

Remove use of move iterators, fix reserveReadRegion available calcula…

3b9238e

…tion

Add dynamic support to CircularBuffer

09fe5e0

Add include guards within CircularBuffer.h

62492bb

Use new circular buffer SPSC implementation for spectrum analyzer

440b3a7

Use policy based design for circular buffer, yield CPU in SaProcessor…

b63429d

…::analyze (might change later, might not)

Add commitRegion, try to push/pop as many elements in the push/pop ba…

7b896e0

…tch overloads

Implement waiting for SPSC

a2b14b1

Refactor waiting mechanism to be recoverable

09650de

Changing the indicies while waiting to -1 does work but makes the queue unreusable

Replace LocklessRingBuffer with LockfreeSpscQueue in Vectorscope

2fc4378

Rename push/pop to tryPush/truPop

bb8cd2d

Hold off on CircularBuffer implementation for now, focus on LockfreeS…

d2a6775

…pscQueue

Remove unused LocklessRingBuffer include

49f149d

Make tryPush/tryPop "all or nothing"

02e9625

Use LockfreeSpscQueue in Lv2Worker

7afc4f1

Fix bugs, add helper functions

7fbf194

Remove LocklessRingBuffer

792c6b5

Rename nonblocking functions to push/pop, blocking to pushBlocking/po…

73fd828

…pBlocking

Use reinterpret_cast in Lv2Worker

77021af

Inline DynamicSpscQueueSize, rename it

b8addd1

Add separate reserve/reserveContiguous functions, move waiting for da…

3a5c7a3

…ta in its own function, remove space available flag (unused)

Add trailing return types

2750fa4

Take 'requested' into consideration for write space reservations, do …

09d2fbe

…not calculate contiguous region size in reserveContiguous functions

sakertooth requested a review from messmerd February 24, 2026 17:44

sakertooth added 2 commits February 24, 2026 12:52

CI did not like std::hardware_destructive_interference_size, replace …

a66cd18

…with 64

Include bit header

ef74caf

sakertooth added 2 commits February 25, 2026 05:37

Use notify_one() instead of notify_all()

57fc35d

Remove unused variable p

7ee850d

rubiefawn mentioned this pull request Feb 25, 2026

Lb302 cleanup #8132

Open

17 tasks

Introduce RAII wrapper LockfreeSpscQueueRegion

75e3411

JohannesLorenz mentioned this pull request Mar 8, 2026

Add Oscilloscope Effect #7937

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a lockfree SPSC queue#8280

Add a lockfree SPSC queue#8280
sakertooth wants to merge 35 commits intoLMMS:masterfrom
sakertooth:add-lockfree-spsc-queue

sakertooth commented Feb 24, 2026 •

edited

Loading

Uh oh!

messmerd commented Feb 24, 2026 •

edited

Loading

Uh oh!

messmerd commented Feb 24, 2026 •

edited

Loading

Uh oh!

rubiefawn commented Feb 24, 2026 •

edited

Loading

Uh oh!

sakertooth commented Feb 24, 2026 •

edited

Loading

Uh oh!

rubiefawn commented Feb 24, 2026

Uh oh!

JohannesLorenz commented Mar 8, 2026

Uh oh!

sakertooth commented Mar 9, 2026 •

edited

Loading

Uh oh!

JohannesLorenz commented Mar 9, 2026

Uh oh!

sakertooth commented Mar 9, 2026 •

edited

Loading

Uh oh!

messmerd commented Mar 9, 2026 •

edited

Loading

Uh oh!

sakertooth commented Mar 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

sakertooth commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

messmerd commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

messmerd commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rubiefawn commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sakertooth commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rubiefawn commented Feb 24, 2026

Uh oh!

JohannesLorenz commented Mar 8, 2026

Uh oh!

sakertooth commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohannesLorenz commented Mar 9, 2026

Uh oh!

sakertooth commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

messmerd commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sakertooth commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sakertooth commented Feb 24, 2026 •

edited

Loading

messmerd commented Feb 24, 2026 •

edited

Loading

messmerd commented Feb 24, 2026 •

edited

Loading

rubiefawn commented Feb 24, 2026 •

edited

Loading

sakertooth commented Feb 24, 2026 •

edited

Loading

sakertooth commented Mar 9, 2026 •

edited

Loading

sakertooth commented Mar 9, 2026 •

edited

Loading

messmerd commented Mar 9, 2026 •

edited

Loading

sakertooth commented Mar 9, 2026 •

edited

Loading