use BufReader to read archive index files #1530

syphar · 2021-10-26T14:36:13Z

I ran test with an index with 17k files:

it took ~250ms to load the index
after this change it only takes 25ms
for the problematic crate ( big crates are slow to use with new archive-storage #1528 ) it still takes ~250ms

Since the average archive-size for docs is just around 600 I think we should merge and deploy this first.
Using mmap made it only slightly faster, which is why I didn't add the additional dependency.

The next optimization is probably not to load the whole index into memory but to answer the find-file requests while streaming the content of the index. But I would like to gather more data on archive-sizes first and benchmark it.

jyn514 · 2021-10-26T15:41:18Z

for the problematic crate ( big crates are slow to use with new archive-storage #1528 ) it still takes ~250ms

What was it before this change?

The next optimization is probably not to load the whole index into memory but to answer the find-file requests while streaming the content of the index. But I would like to gather more data on archive-sizes first and benchmark it.

👍

jyn514 · 2021-10-26T15:41:48Z

src/storage/archive_index.rs

@@ -28,11 +28,14 @@ pub(crate) struct Index {

 impl Index {
    pub(crate) fn load(reader: impl io::Read) -> Result<Index> {


Can we take BufReader here instead, to avoid having a BufReader<BufReader>?

while changing this I saw an issue with that:
sometimes we are using load with a slice. In this case I also used an unnecessary BufReader.

Also, the BufWriter on save doesn't help, because we're saving into a in-memory buffer anyways...

I updated the code, even less now

src/storage/archive_index.rs

syphar · 2021-10-26T16:08:57Z

for the problematic crate ( big crates are slow to use with new archive-storage #1528 ) it still takes ~250ms

What was it before this change?

roughly the same ratio, so over 20 seconds

src/storage/mod.rs

syphar self-assigned this Oct 26, 2021

syphar added the S-waiting-on-review Status: This pull request has been implemented and needs to be reviewed label Oct 26, 2021

jyn514 suggested changes Oct 26, 2021

View reviewed changes

jyn514 added S-waiting-on-author Status: This PR is incomplete or needs to address review comments and removed S-waiting-on-review Status: This pull request has been implemented and needs to be reviewed labels Oct 26, 2021

syphar force-pushed the archive-index-quickfix branch from f2d80f8 to 8c76fa2 Compare October 26, 2021 16:26

syphar added S-waiting-on-review Status: This pull request has been implemented and needs to be reviewed and removed S-waiting-on-author Status: This PR is incomplete or needs to address review comments labels Oct 26, 2021

syphar requested a review from jyn514 October 26, 2021 16:26

syphar changed the title ~~use BufReader and BufWriter to read/write archive index files~~ use BufReader to read archive index files Oct 26, 2021

syphar force-pushed the archive-index-quickfix branch 2 times, most recently from fcbac39 to c8a1f35 Compare October 26, 2021 16:47

jyn514 reviewed Oct 26, 2021

View reviewed changes

src/storage/mod.rs Outdated Show resolved Hide resolved

use BufReader to read archive index files

c4cbe31

syphar force-pushed the archive-index-quickfix branch from c8a1f35 to c4cbe31 Compare October 26, 2021 16:49

jyn514 approved these changes Oct 26, 2021

View reviewed changes

jyn514 added S-waiting-on-deploy This PR is ready to be merged, but is waiting for an admin to have time to deploy it and removed S-waiting-on-review Status: This pull request has been implemented and needs to be reviewed labels Oct 26, 2021

jyn514 merged commit f0f19a6 into rust-lang:master Oct 26, 2021

syphar deleted the archive-index-quickfix branch October 26, 2021 17:46

syphar removed the S-waiting-on-deploy This PR is ready to be merged, but is waiting for an admin to have time to deploy it label Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use BufReader to read archive index files #1530

use BufReader to read archive index files #1530

Uh oh!

syphar commented Oct 26, 2021

Uh oh!

jyn514 commented Oct 26, 2021

Uh oh!

jyn514 Oct 26, 2021

Uh oh!

syphar Oct 26, 2021

Uh oh!

Uh oh!

syphar commented Oct 26, 2021

Uh oh!

Uh oh!

Uh oh!

		@@ -28,11 +28,14 @@ pub(crate) struct Index {

		impl Index {
		pub(crate) fn load(reader: impl io::Read) -> Result<Index> {

use BufReader to read archive index files #1530

use BufReader to read archive index files #1530

Uh oh!

Conversation

syphar commented Oct 26, 2021

Uh oh!

jyn514 commented Oct 26, 2021

Uh oh!

jyn514 Oct 26, 2021

Choose a reason for hiding this comment

Uh oh!

syphar Oct 26, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

syphar commented Oct 26, 2021

Uh oh!

Uh oh!

Uh oh!