This is an archive of the discontinued Mercurial Phabricator instance.

I personally find this harder to read, and the use of usize before bit shifting making this technically platform-dependent makes me a bit uneasy. I can't easily compare the compiler output since godbolt does not support crates and I'd have to find out how to do it by hand, but maybe you or @martinvonz did?

acezar mentioned this in D8958: hg-core: Add a limited read only `revlog` implementation.Sep 28 2020, 10:24 AM

In D9098#136605, @Alphare wrote:

I personally find this harder to read

I find it easier to read :)

, and the use of usize before bit shifting making this technically platform-dependent makes me a bit uneasy.

It probably doesn't matter if we're truncating (on small platforms) the result to usize anyway? I suppose we can use usize::try_from() to check (whether or not we make the change in this patch).

I can't easily compare the compiler output since godbolt does not support crates and I'd have to find out how to do it by hand, but maybe you or @martinvonz did?

I hadn't, but I just tried it using the criterion benchmarking crate. This is the code I used:

let input: Vec<u8> = (0..60000).map(|_| { rand::random::<u8>() }).collect();
let routine = || {
    for i in 0..input.len()/6 {
        let slice = &input[6*i..6*i+6];
        let val = ((BigEndian::read_u16(&slice[0..2]) as usize) << 32)
            + (BigEndian::read_u32(&slice[2..6]) as usize);
    }
};

Criterion says that it takes around 5.43 us both before and after this patch. So I guess the compiler is smart enough to eliminate the allocation, or maybe I messed something up.

In D9098#136662, @martinvonz wrote:

In D9098#136605, @Alphare wrote:

I personally find this harder to read

I find it easier to read :)

heh

, and the use of usize before bit shifting making this technically platform-dependent makes me a bit uneasy.

It probably doesn't matter if we're truncating (on small platforms) the result to usize anyway? I suppose we can use usize::try_from() to check (whether or not we make the change in this patch).

I can't easily compare the compiler output since godbolt does not support crates and I'd have to find out how to do it by hand, but maybe you or @martinvonz did?

I hadn't, but I just tried it using the criterion benchmarking crate. This is the code I used:
let input: Vec<u8> = (0..60000).map(|_| { rand::random::<u8>() }).collect();
let routine = || {
    for i in 0..input.len()/6 {
        let slice = &input[6*i..6*i+6];
        let val = ((BigEndian::read_u16(&slice[0..2]) as usize) << 32)
            + (BigEndian::read_u32(&slice[2..6]) as usize);
    }
};
Criterion says that it takes around 5.43 us both before and after this patch. So I guess the compiler is smart enough to eliminate the allocation, or maybe I messed something up.

Seems fine indeed.

In D9098#136670, @Alphare wrote:
In D9098#136662, @martinvonz wrote:

In D9098#136605, @Alphare wrote:

I personally find this harder to read

I find it easier to read :)

heh
, and the use of usize before bit shifting making this technically platform-dependent makes me a bit uneasy.

It probably doesn't matter if we're truncating (on small platforms) the result to usize anyway? I suppose we can use usize::try_from() to check (whether or not we make the change in this patch).

I can't easily compare the compiler output since godbolt does not support crates and I'd have to find out how to do it by hand, but maybe you or @martinvonz did?

I hadn't, but I just tried it using the criterion benchmarking crate. This is the code I used:
let input: Vec<u8> = (0..60000).map(|_| { rand::random::<u8>() }).collect();
let routine = || {
    for i in 0..input.len()/6 {
        let slice = &input[6*i..6*i+6];
        let val = ((BigEndian::read_u16(&slice[0..2]) as usize) << 32)
            + (BigEndian::read_u32(&slice[2..6]) as usize);
    }
};
Criterion says that it takes around 5.43 us both before and after this patch. So I guess the compiler is smart enough to eliminate the allocation, or maybe I messed something up.
Seems fine indeed.

To check that I hadn't messed something up, I ran the benchmark on a debug build and there this patch speeds it up from 1.51ms to 1.38ms (which is still ~240x as slow as the release build! :) ).

In D9098#136671, @martinvonz wrote:

To check that I hadn't messed something up, I ran the benchmark on a debug build and there this patch speeds it up from 1.51ms to 1.38ms (which is still ~240x as slow as the release build! :) ).

Yeah debug builds have notoriously little to no optimizations, so that makes sense.

In D9098#136672, @Alphare wrote:

In D9098#136671, @martinvonz wrote:

To check that I hadn't messed something up, I ran the benchmark on a debug build and there this patch speeds it up from 1.51ms to 1.38ms (which is still ~240x as slow as the release build! :) ).

Yeah debug builds have notoriously little to no optimizations, so that makes sense.

@acezar said on chat that they find the old version clearer, and the compiler is apparently smart enough to optimize out the allocation, so I'll skip this patch.

acezar updated this revision to Diff 22909.Sep 29 2020, 4:19 AM

acezar abandoned this revision.Sep 29 2020, 11:59 AM

martinvonz removed a child revision: D9099: hg-core: remove useless code (D8958#inline-14988 followup).Sep 29 2020, 12:00 PM

Revision Contents
Changeset List

			Path	Packages
M			rust/hg-core/src/revlog/index.rs (5 lines)

Commit	Parents	Author	Summary	Date
501ef32d73be	cb32fc2b4de5	Antoine cezar		Sep 28 2020, 7:57 AM

Status	Author	Revision
Closed	acezar	D9109 rhg: use `.or(Err(Error))` not `.map_err(\|_\| Error)` (D9100#inline-15067)
Closed	acezar	D9108 hg-core: use `.or(Err(Error))` not `.map_err(\|_\| Error)` (D9100#inline-15067)
Closed	acezar	D9107 hg-core: return Err if `offset != bytes.len()`
Closed	acezar	D9106 hg-core: make `Index` owner of its bytes (D8958#inline-14994 followup 1/2)
Closed	acezar	D9105 hg-core: renaming of `Chunk` offset methods (D8958#inline-15002 followup)
Closed	acezar	D9104 hg-core: minor rewording in docstring (D8958#inline-15005 followup)
Closed	acezar	D9103 hg-core: use anonymous lifetime for `impl Chunk` (D8958#inline-15003 followup)
Closed	acezar	D9102 hg-core: use `u32` instead of `i32` in `Chunk` (D8958#inline-15001 followup)
Closed	acezar	D9101 hg-core: use the term `chunk` instead of `frag` (D8958#inline-15000 followup)
Closed	acezar	D9100 hg-core: return `Err` on decompression error (D8958#inline-15004 followup)
Closed	acezar	D9099 hg-core: remove useless code (D8958#inline-14988 followup)
Abandoned	acezar	D9098 hg-core: avoid memory allocation (D8958#inline-14990 followup)
Closed	acezar	D9097 hg-core: minor docstring update (D8958#inline-14991 followup)
Closed	acezar	D9096 hg-core: minor code style change (D8958#inline-14986 followup)
Closed	acezar	D9095 hg-core: Explain offset override of first revision
Closed	acezar	D9083 hg-core: minor code style change (D8958#inline-14993 followup)

Diff 22909

rust/hg-core/src/revlog/index.rs

	}			}

	impl<'a> IndexEntry<'a> {			impl<'a> IndexEntry<'a> {
	/// Return the offset of the data.			/// Return the offset of the data.
	pub fn offset(&self) -> usize {			pub fn offset(&self) -> usize {
	if let Some(offset_override) = self.offset_override {			if let Some(offset_override) = self.offset_override {
	offset_override			offset_override
	} else {			} else {
	let mut bytes = [0; 8];			((BigEndian::read_u16(&self.bytes[0..2]) as usize) << 32)
	bytes[2..8].copy_from_slice(&self.bytes[0..=5]);			+ (BigEndian::read_u32(&self.bytes[2..6]) as usize)
	BigEndian::read_u64(&bytes[..]) as usize
	}			}
	}			}

	/// Return the compressed length of the data.			/// Return the compressed length of the data.
	pub fn compressed_len(&self) -> usize {			pub fn compressed_len(&self) -> usize {
	BigEndian::read_u32(&self.bytes[8..=11]) as usize			BigEndian::read_u32(&self.bytes[8..=11]) as usize
	}			}

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	22884		Sep 28 2020, 9:48 AM	★	★
Diff 2	22909		Sep 29 2020, 4:19 AM	★	★

This is an archive of the discontinued Mercurial Phabricator instance.

hg-core: avoid memory allocation (D8958#inline-14990 followup)AbandonedPublic

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 22909

rust/hg-core/src/revlog/index.rs

hg-core: avoid memory allocation (D8958#inline-14990 followup)
AbandonedPublic

Revision Contents
Changeset List