copies-rust: send PyBytes values back be dropped ino the parent thread
ClosedPublic

Authored by SimonSapin on Jan 6 2021, 9:12 AM.

Download Raw Diff

Details

Reviewers

Group Reviewers

Commits

rHG8d20abed6a1e: copies-rust: send PyBytes values back be dropped ino the parent thread
rHGd66a1fe24f1b: copies-rust: send PyBytes values back be dropped ino the parent thread

Summary

… instead of acquiring the GIL in the Rust thread in the Drop impl

This commit is based on the premise that crossbeam-channel
with unbounded send and non-blocking receive is faster than
a contended GIL, but that remains to be measured.

Diff Detail

Repository

rHG Mercurial

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

Event Timeline

SimonSapin created this revision.Jan 6 2021, 9:12 AM

Herald added a reviewer: hg-reviewers. · View Herald TranscriptJan 6 2021, 9:12 AM

Herald added a subscriber: mercurial-patches. · View Herald Transcript

Alphare accepted this revision.Jan 22 2021, 5:15 AM

✅ refresh by Heptapod after a successful CI run (🐙 💚)

SimonSapin added a commit: rHGd66a1fe24f1b: copies-rust: send PyBytes values back be dropped ino the parent thread.Feb 24 2021, 11:04 AM

This revision was not accepted when it landed; it landed in state Needs Review.

Closed by commit rHGd66a1fe24f1b: copies-rust: send PyBytes values back be dropped ino the parent thread (authored by SimonSapin). · Explain Why

This revision was automatically updated to reflect the committed changes.

SimonSapin added a commit: rHG8d20abed6a1e: copies-rust: send PyBytes values back be dropped ino the parent thread.Feb 24 2021, 12:11 PM

Revision Contents
Changeset List

			Path	Packages
M			rust/hg-cpython/src/copy_tracing.rs (47 lines)

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	24619		Jan 6 2021, 9:12 AM	★	★
Diff 2	25706		Feb 22 2021, 9:25 AM	★	★
Diff 3	25743		Feb 22 2021, 11:00 AM	★	★
Diff 4	25788		Feb 22 2021, 3:47 PM	★	★
Diff 5	25865	rHGd66a1fe24f1b45ae08c74563511f391f60bb54ca	Jan 5 2021, 3:46 PM	★	★

Status	Author	Revision
Closed	marmoute	D9591 copies: rename value/other variable to minor/major for clarity
Closed	marmoute	D9590 copies: extract value comparison in the python copy tracing
Closed	marmoute	D9607 hghave: add some official category for known-bad and missing-good output
Closed	marmoute	D9608 copies: stop attempt to avoid extra dict copies around branching
Closed	marmoute	D9592 copies: deal with the "same revision" special case earlier
Closed	marmoute	D9589 copies-tests: update to null in test-copies-chain-merge.t
Closed	marmoute	D9588 copies-tests: add a summary of all cases created in test-copies-chain-merge.t
Closed	SimonSapin	D9686 copies-rust: send PyBytes values back be dropped ino the parent thread
Closed	SimonSapin	D9685 copies-rust: introduce PyBytesWithData to reduce GIL requirement
Closed	SimonSapin	D9684 copies-rust: move CPU-heavy Rust processing into a child thread
Closed	SimonSapin	D9683 copies-rust: split up combine_changeset_copies function into a struct
Closed	SimonSapin	D9682 copies-rust: extract generic map merge logic from merge_copies_dict
Closed	marmoute	D9656 copies-rust: use imrs::OrdSet instead of imrs::HashSet
Closed	marmoute	D9655 copies-rust: use simpler overwrite when value on both side are identical
Closed	marmoute	D9654 copies-rust: make more use of the new comparison property
Closed	marmoute	D9653 copies-rust: implement PartialEqual manually
Closed	marmoute	D9652 copies-rust: record "overwritten" information from both side on delete
Closed	marmoute	D9651 copies-rust: refactor the "deletion" case
Closed	marmoute	D9650 copies-rust: process copy information of both parent at the same time
Closed	marmoute	D9649 copies-rust: yield both p1 and p2 copies in `ChangedFiles.actions()`
Closed	marmoute	D9648 copies-rust: extract the processing of a single copy information
Closed	marmoute	D9647 copies-rust: use matching to select the final copies information
Closed	marmoute	D9646 copies-rust: get the parents' copies earlier
Closed	marmoute	D9645 copies-rust: remove the ancestor Oracle logic
Closed	marmoute	D9644 copies-rust: track "overwrites" directly within CopySource
Closed	marmoute	D9643 copies-rust: add methods to build and update CopySource
Closed	marmoute	D9657 copies-rust: fix reverted argument when merging tiny minor or major
Closed	marmoute	D9642 copies-rust: rename TimeStampedPathCopy to CopySource
Closed	marmoute	D9641 copies-rust: rename TimeStampedPathCopies to InternalPathCopies
Closed	marmoute	D9613 copies: detect case when a merge decision overwrite previous data
Closed	marmoute	D9612 copies: rearrange all value comparison conditional
Closed	marmoute	D10059 test-copies: introduce merge chains test for the P/Q merges
Closed	marmoute	D10058 test-copies: add a case involving the `b` and a new `r` branch
Closed	marmoute	D10057 test-copies: introduce case combining the `p` and `q` branch
Closed	marmoute	D10056 test-copies: add a `q` branch similar to the `e` but on the new files
Closed	marmoute	D10055 test-copies: add a `p` branch similar to the `a` but on the new files
Closed	marmoute	D10054 test-copies: move the new files in the `i` branch
Closed	marmoute	D10053 test-copies: add 3 new files with their own content
Closed	marmoute	D10052 test-copies: introduce merge chaing test for the A/E + change tests
Closed	marmoute	D10051 test-copies: add a "change during merge" variant to the A+E test
Closed	marmoute	D10050 test-copies: filter out the linkrev part of `debugindex`
Closed	marmoute	D10049 test-copies: use "case-id" instead of revision number when listing sidedata
Closed	marmoute	D10048 test-copies: remove revision number from log
Closed	marmoute	D9611 test-copies: add test chaining multiple merge
Closed	marmoute	D9610 test-copies: add test chaining multiple merges
Closed	marmoute	D9609 test-copies: add test chaining multiple merges
Closed	marmoute	D10047 test-copies: add subcase titles for various "conflicting" information variant
Closed	marmoute	D10046 test-copies: improve description of the B+F case
Closed	marmoute	D10045 test-copies: improve description of the C+H case
Closed	marmoute	D10044 test-copies: improve description of the B+C "revert/restore" case
Closed	marmoute	D10043 test-copies: improve description of the G+C case
Closed	marmoute	D10042 test-copies: improve description of the G+F case
Closed	marmoute	D10041 test-copies: improve description of the D+G case
Closed	marmoute	D10040 test-copies: improve description of the A+E case
Closed	marmoute	D10039 test-copies: improve description of the B+D case
Closed	marmoute	D10038 test-copies: improve description of the B+C case
Closed	marmoute	D10037 test-copies: improve description of the A+B case
Closed	marmoute	D10036 test-copies: use intermediate variable some commit descriptions
Closed	marmoute	D10035 test-copies: don't use empty file for "same content" cases
Closed	marmoute	D9587 test-copies: reinstall initial identical (empty) files for chained copied
Closed	marmoute	D9586 copies: explain the "arbitrary" copy source pick in case of conflict
Closed	marmoute	D9585 copies: properly match result during changeset centric copy tracing
Closed	marmoute	D9584 copies: avoid early return in _combine_changeset_copies
Closed	marmoute	D9499 copies-rust: record overwrite when merging
Closed	marmoute	D9498 copies-rust: make the comparison aware of the revision being current merged
Closed	marmoute	D9497 copies-rust: start recording overwrite as they happens
Closed	marmoute	D9496 copies-rust: rename Oracle.is_ancestor to Oracle.is_overwrite
Closed	marmoute	D9495 copies-rust: use the `entry` API for copy information too
Closed	marmoute	D9494 copies-rust: use the entry API to overwrite deleted entry
Closed	marmoute	D9493 copies-rust: tokenize all paths into integer
Closed	marmoute	D9492 copies-rust: pre-introduce a PathToken type and use it where applicable
Closed	marmoute	D9491 copies-rust: add smarter approach for merging small mapping with large mapping
Closed	marmoute	D9426 copies-rust: hide most of the comparison details inside a closure
Closed	marmoute	D9425 copies-rust: move the mapping merging into a else clause
Closed	marmoute	D9424 copies-rust: extract conflicting value comparison in its own function
Closed	marmoute	D9423 copies: no longer cache the ChangedFiles during copy tracing
Closed	marmoute	D9422 copies: iterate over children directly (instead of parents)
Closed	marmoute	D9581 copies: document the current algorithm step

Diff 25865

rust/hg-cpython/src/copy_tracing.rs

	use cpython::ObjectProtocol;			use cpython::ObjectProtocol;
	use cpython::PyBytes;			use cpython::PyBytes;
	use cpython::PyDict;			use cpython::PyDict;
				use cpython::PyDrop;
	use cpython::PyList;			use cpython::PyList;
	use cpython::PyModule;			use cpython::PyModule;
	use cpython::PyObject;			use cpython::PyObject;
	use cpython::PyResult;			use cpython::PyResult;
	use cpython::PyTuple;			use cpython::PyTuple;
	use cpython::Python;			use cpython::Python;

	use hg::copy_tracing::ChangedFiles;			use hg::copy_tracing::ChangedFiles;
	}			}
	}			}

	pub fn data(&self) -> &[u8] {			pub fn data(&self) -> &[u8] {
	// Safety: the raw pointer is valid as long as the PyBytes is still			// Safety: the raw pointer is valid as long as the PyBytes is still
	// alive, and the returned slice borrows `self`.			// alive, and the returned slice borrows `self`.
	unsafe { &*self.data }			unsafe { &*self.data }
	}			}

				pub fn unwrap(self) -> PyBytes {
				self.keep_alive
				}
	}			}
	}			}

	/// Combines copies information contained into revision `revs` to build a copy			/// Combines copies information contained into revision `revs` to build a copy
	/// map.			/// map.
	///			///
	/// See mercurial/copies.py for details			/// See mercurial/copies.py for details
	pub fn combine_changeset_copies_wrapper(			pub fn combine_changeset_copies_wrapper(
	let tuple: PyTuple =			let tuple: PyTuple =
	rev_info.call(py, (rev_py,), None)?.cast_into(py)?;			rev_info.call(py, (rev_py,), None)?.cast_into(py)?;
	let p1 = tuple.get_item(py, 0).extract(py)?;			let p1 = tuple.get_item(py, 0).extract(py)?;
	let p2 = tuple.get_item(py, 1).extract(py)?;			let p2 = tuple.get_item(py, 1).extract(py)?;
	let opt_bytes = tuple.get_item(py, 2).extract(py)?;			let opt_bytes = tuple.get_item(py, 2).extract(py)?;
	Ok((rev, p1, p2, opt_bytes))			Ok((rev, p1, p2, opt_bytes))
	});			});

	let path_copies = if !multi_thread {			let path_copies;
				if !multi_thread {
	let mut combine_changeset_copies =			let mut combine_changeset_copies =
	CombineChangesetCopies::new(children_count);			CombineChangesetCopies::new(children_count);

	for rev_info in revs_info {			for rev_info in revs_info {
	let (rev, p1, p2, opt_bytes) = rev_info?;			let (rev, p1, p2, opt_bytes) = rev_info?;
	let files = match &opt_bytes {			let files = match &opt_bytes {
	Some(bytes) => ChangedFiles::new(bytes.data(py)),			Some(bytes) => ChangedFiles::new(bytes.data(py)),
	// Python None was extracted to Option::None,			// Python None was extracted to Option::None,
	// meaning there was no copy data.			// meaning there was no copy data.
	None => ChangedFiles::new_empty(),			None => ChangedFiles::new_empty(),
	};			};

	combine_changeset_copies.add_revision(rev, p1, p2, files)			combine_changeset_copies.add_revision(rev, p1, p2, files)
	}			}
	combine_changeset_copies.finish(target_rev)			path_copies = combine_changeset_copies.finish(target_rev)
	} else {			} else {
	// Use a bounded channel to provide back-pressure:			// Use a bounded channel to provide back-pressure:
	// if the child thread is slower to process revisions than this thread			// if the child thread is slower to process revisions than this thread
	// is to gather data for them, an unbounded channel would keep			// is to gather data for them, an unbounded channel would keep
	// growing and eat memory.			// growing and eat memory.
	//			//
	// TODO: tweak the bound?			// TODO: tweak the bound?
	let (rev_info_sender, rev_info_receiver) =			let (rev_info_sender, rev_info_receiver) =
	crossbeam_channel::bounded::<RevInfo<PyBytesWithData>>(1000);			crossbeam_channel::bounded::<RevInfo<PyBytesWithData>>(1000);

				// This channel (going the other way around) however is unbounded.
				// If they were both bounded, there might potentially be deadlocks
				// where both channels are full and both threads are waiting on each
				// other.
				let (pybytes_sender, pybytes_receiver) =
				crossbeam_channel::unbounded();

	// Start a thread that does CPU-heavy processing in parallel with the			// Start a thread that does CPU-heavy processing in parallel with the
	// loop below.			// loop below.
	//			//
	// If the parent thread panics, `rev_info_sender` will be dropped and			// If the parent thread panics, `rev_info_sender` will be dropped and
	// “disconnected”. `rev_info_receiver` will be notified of this and			// “disconnected”. `rev_info_receiver` will be notified of this and
	// exit its own loop.			// exit its own loop.
	let thread = std::thread::spawn(move \|\| {			let thread = std::thread::spawn(move \|\| {
	let mut combine_changeset_copies =			let mut combine_changeset_copies =
	CombineChangesetCopies::new(children_count);			CombineChangesetCopies::new(children_count);
	for (rev, p1, p2, opt_bytes) in rev_info_receiver {			for (rev, p1, p2, opt_bytes) in rev_info_receiver {
	let files = match &opt_bytes {			let files = match &opt_bytes {
	Some(raw) => ChangedFiles::new(raw.data()),			Some(raw) => ChangedFiles::new(raw.data()),
	// Python None was extracted to Option::None,			// Python None was extracted to Option::None,
	// meaning there was no copy data.			// meaning there was no copy data.
	None => ChangedFiles::new_empty(),			None => ChangedFiles::new_empty(),
	};			};
	combine_changeset_copies.add_revision(rev, p1, p2, files)			combine_changeset_copies.add_revision(rev, p1, p2, files);

	// The GIL is (still) implicitly acquired here through			// Send `PyBytes` back to the parent thread so the parent
	// `impl Drop for PyBytes`.			// thread can drop it. Otherwise the GIL would be implicitly
				// acquired here through `impl Drop for PyBytes`.
				if let Some(bytes) = opt_bytes {
				if let Err(_) = pybytes_sender.send(bytes.unwrap()) {
				// The channel is disconnected, meaning the parent
				// thread panicked or returned
				// early through
				// `?` to propagate a Python exception.
				break;
				}
				}
	}			}

	combine_changeset_copies.finish(target_rev)			combine_changeset_copies.finish(target_rev)
	});			});

	for rev_info in revs_info {			for rev_info in revs_info {
	let (rev, p1, p2, opt_bytes) = rev_info?;			let (rev, p1, p2, opt_bytes) = rev_info?;
	let opt_bytes = opt_bytes.map(\|b\| PyBytesWithData::new(py, b));			let opt_bytes = opt_bytes.map(\|b\| PyBytesWithData::new(py, b));

	// We’d prefer to avoid the child thread calling into Python code,			// We’d prefer to avoid the child thread calling into Python code,
	// but this avoids a potential deadlock on the GIL if it does:			// but this avoids a potential deadlock on the GIL if it does:
	py.allow_threads(\|\| {			py.allow_threads(\|\| {
	rev_info_sender.send((rev, p1, p2, opt_bytes)).expect(			rev_info_sender.send((rev, p1, p2, opt_bytes)).expect(
	"combine_changeset_copies: channel is disconnected",			"combine_changeset_copies: channel is disconnected",
	);			);
	});			});

				// Drop anything in the channel, without blocking
				for pybytes in pybytes_receiver.try_iter() {
				pybytes.release_ref(py)
				}
	}			}
	// We’d prefer to avoid the child thread calling into Python code,			// We’d prefer to avoid the child thread calling into Python code,
	// but this avoids a potential deadlock on the GIL if it does:			// but this avoids a potential deadlock on the GIL if it does:
	py.allow_threads(\|\| {			path_copies = py.allow_threads(\|\| {
	// Disconnect the channel to signal the child thread to stop:			// Disconnect the channel to signal the child thread to stop:
	// the `for … in rev_info_receiver` loop will end.			// the `for … in rev_info_receiver` loop will end.
	drop(rev_info_sender);			drop(rev_info_sender);

	// Wait for the child thread to stop, and propagate any panic.			// Wait for the child thread to stop, and propagate any panic.
	thread.join().unwrap_or_else(\|panic_payload\| {			thread.join().unwrap_or_else(\|panic_payload\| {
	std::panic::resume_unwind(panic_payload)			std::panic::resume_unwind(panic_payload)
	})			})
	})			});

				// Drop anything left in the channel
				for pybytes in pybytes_receiver.iter() {
				pybytes.release_ref(py)
				}
	};			};

	let out = PyDict::new(py);			let out = PyDict::new(py);
	for (dest, source) in path_copies.into_iter() {			for (dest, source) in path_copies.into_iter() {
	out.set_item(			out.set_item(
	py,			py,
	PyBytes::new(py, &dest.into_vec()),			PyBytes::new(py, &dest.into_vec()),
	PyBytes::new(py, &source.into_vec()),			PyBytes::new(py, &source.into_vec()),

This is an archive of the discontinued Mercurial Phabricator instance.

copies-rust: send PyBytes values back be dropped ino the parent threadClosedPublic