WIP: Add Rust implementation for DABA #24

segeljakt · 2020-07-18T19:13:45Z

This PR builds on #23 and adds a Rust implementation for DABA. It does not however yet add an implementation for the chunked array queue. Instead it uses Rust's std::collections::VecDeque. I will give a shot at implementing the Chunked Array Queue soon. It will be interesting to see the performance difference.

scotts · 2020-07-20T15:14:27Z

Something to keep in mind is that (I believe) std::collections::VecDeque has worst-case O(n) operations, which means that DABA would no longer be worst-case O(1). That probably wouldn't show up in the throughput (although I'm curious!) but I'm confident it would show up in the latency.

Comparing the latency and throughput of VecDeque and the chunked-array queue should be very interesting. We can't do it easily in our C++ implementations because operations on std::deque invalidate iterators.

We have compared performance of std::deque with our implementation of the chunked array queue for Two-Stacks, and the throughput difference was negligible. We did not, however, look at latency

scotts · 2020-07-20T15:26:14Z

rust/src/daba/mod.rs

+{
+    // ith oldest value in FIFO order stored at vi = vals[i]
+    vals: VecDeque<Value>,
+    aggs: VecDeque<Value>,


We implemented the vals and aggs using a single underlying queue. We created an internal struct that wrapped both a value and a partial aggregate, and then made the queue contain that struct. I think that will probably have better performance, both because of locality and by just doing less total work.

Good point, that I should definitely fix

I just loaded up our 2017 paper, and we presented it with parallel queues. :) So I understand why you implemented it that way. Our soon-to-be-submitted journal article presents it in a way closer to our C++ implementation.

I still do not think I fully get this algorithm, but thankfully it was pretty easy to implement 😅

Looking forward to see the paper

scotts · 2020-07-20T15:46:37Z

rust/src/daba/mod.rs

+    fn pop(&mut self) {
+        if self.vals.pop_front().is_some() {
+            self.aggs.pop_front();
+            self.l -= 1;


I'm not sure decrementing is the right thing here - it may be, but it should be carefully considered.

You've implemented l, r, a and b as indices, which is something we're just now grappling with: we implemented them as iterators, but our presentation in the 2017 paper was kinda loose with are they iterators/pointers or indices. When they're indices, that means f==0 and e==vals.len() is always true, and don't need to explicitly represent them. I think you've correctly done that. But I'm unsure about this decrementing - you're following our published algorithms for fixup(), which did not assume decrementing on eviction. It's also possible there's a subtly I'm missing with the semantics of VecDeque.

In Rust it's possible to overload the indexing operator. The indexing for VecDeque is an alias for get which indexes from the front (including offset). I decrement all indices when removing the front element to shift them left. I believe it should work but I might be mistaken. If the VecDeque was replaced with a regular Vec, whose indexing does not include offset, then I would not decrement.

Yes, now I see what you mean. Calling pop_front() on a VecDeque effectively shifts the entire deque to the left, so if you had an index for location i, accessing q[i] before and after the pop will yield different values. You're decrementing i so that it continues to point at the same value in the deque.

I was wondering if it would be easier to just use an iterator, and I don't think it is. From what I can tell, there's no easy way to "decrement" a standard Rust iterator as is required in the shrink case. So, in effect, you have to do your own iterator management.

I think what we want is a linked list of arrays with "cursors".

Yes, the concept of a cursor is essentially what C++ style iterators are, and how we ended up presenting the algorithm. The chunked array queue is a linked list, it just has many elements in each node.

scotts · 2020-07-24T14:31:55Z

The implementation of ChunkedArrayQueue looks promising - the only part I can't reason through yet are the unsafe operations. But that's my lack of understanding Rust. I think some simple unit tests which push a bunch of chunks, and then drain them completely, checking full contents after every operation could help a lot.

By the way, I assume that when you're confident the implementation is correct you'll remove the "WIP"?

segeljakt · 2020-07-24T14:53:45Z

The implementation of ChunkedArrayQueue looks promising - the only part I can't reason through yet are the unsafe operations. But that's my lack of understanding Rust. I think some simple unit tests which push a bunch of chunks, and then drain them completely, checking full contents after every operation could help a lot.

The implementation is getting closer to working but there are some small problems still that I need to fix. The unsafe operations are mostly needed for maintaining multiple cursors simultaneously which can mutate the queue. In safe Rust there can only be one mutable reference (&mut) to a memory address at a time. Safe Rust also requires all memory to be initialised before it is used. I'm using unsafe in some places to get around this so chunks can be initialised without having to initialise their elements to some default value.

By the way, I assume that when you're confident the implementation is correct you'll remove the "WIP"?

Sure, will do 👍

segeljakt · 2020-07-24T14:58:37Z

If it is ok, I will initially create two separate implementations, one for VecDeque and one for ChunkedArrayQueue, they have very slight differences in their interfaces I believe.

segeljakt force-pushed the rust-daba branch from 414d8bb to 7774b2f Compare July 18, 2020 19:15

Add Rust implementation for DABA

a2ebe95

segeljakt force-pushed the rust-daba branch from 7774b2f to a2ebe95 Compare July 20, 2020 15:23

scotts reviewed Jul 20, 2020

View reviewed changes

Merge VecDeques into one

0ba8f1e

scotts reviewed Jul 20, 2020

View reviewed changes

segeljakt added 3 commits July 20, 2020 20:49

Add #[inline(always)]

94c0400

Add Rust implementation of Chunked Array Queue

f49672e

Refactor: if let => match

730481d

scotts mentioned this pull request Feb 17, 2021

Rust implementation of the chunked-array queue #47

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add Rust implementation for DABA #24

WIP: Add Rust implementation for DABA #24

segeljakt commented Jul 18, 2020 •

edited

Loading

scotts commented Jul 20, 2020

scotts Jul 20, 2020

segeljakt Jul 20, 2020

scotts Jul 20, 2020 •

edited

Loading

segeljakt Jul 20, 2020 •

edited

Loading

scotts Jul 20, 2020

segeljakt Jul 20, 2020 •

edited

Loading

scotts Jul 20, 2020

segeljakt Jul 21, 2020

scotts Jul 21, 2020

scotts commented Jul 24, 2020

segeljakt commented Jul 24, 2020

segeljakt commented Jul 24, 2020

WIP: Add Rust implementation for DABA #24

Are you sure you want to change the base?

WIP: Add Rust implementation for DABA #24

Conversation

segeljakt commented Jul 18, 2020 • edited Loading

scotts commented Jul 20, 2020

scotts Jul 20, 2020

Choose a reason for hiding this comment

segeljakt Jul 20, 2020

Choose a reason for hiding this comment

scotts Jul 20, 2020 • edited Loading

Choose a reason for hiding this comment

segeljakt Jul 20, 2020 • edited Loading

Choose a reason for hiding this comment

scotts Jul 20, 2020

Choose a reason for hiding this comment

segeljakt Jul 20, 2020 • edited Loading

Choose a reason for hiding this comment

scotts Jul 20, 2020

Choose a reason for hiding this comment

segeljakt Jul 21, 2020

Choose a reason for hiding this comment

scotts Jul 21, 2020

Choose a reason for hiding this comment

scotts commented Jul 24, 2020

segeljakt commented Jul 24, 2020

segeljakt commented Jul 24, 2020

segeljakt commented Jul 18, 2020 •

edited

Loading

scotts Jul 20, 2020 •

edited

Loading

segeljakt Jul 20, 2020 •

edited

Loading

segeljakt Jul 20, 2020 •

edited

Loading