Download Raw Diff

Details

Reviewers

None

Group Reviewers

Commits

rHGa56487081109: store: don't read the whole fncache in memory

Summary

In large repositories with lot of files, the fncache grows more than 100 MB and
reading that whole thing into memory slows things down. Let's not read the whole
thing into memory.

This patch changes fncache loading code to read 1 MB at once. Loading 1 MB at
once saves ~1 sec on perffncacheload for our internal repository. I tried
various values such as 0.5 MB, 5 MB, 10 MB but best results were produced using
1 MB as the chunksize.

On a narrow clone with fncache around 40 MB, this patch saves ~0.04 seconds on
average on perffncacheload.

To test the code, I have coded an extension in test-fncache.t which set
chunksize to 1 byte, and the test passes with that.

Diff Detail

Repository

rHG Mercurial

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

Event Timeline

pulkit created this revision.Nov 22 2018, 7:27 AM

Herald added a reviewer: hg-reviewers. · View Herald TranscriptNov 22 2018, 7:27 AM

Herald added subscribers: mercurial-devel, mjpieters. · View Herald Transcript

pulkit edited the summary of this revision. (Show Details)Nov 22 2018, 7:28 AM

diff --git a/mercurial/store.py b/mercurial/store.py

a/mercurial/store.py

+++ b/mercurial/store.py
@@ -461,13 +461,13 @@

skip nonexistent file self.entries = set() return

self.entries = set(decodedir(fp.read()).splitlines())

if '' in self.entries:

fp.seek(0)

for n, line in enumerate(util.iterfile(fp)):

if not line.rstrip('\n'):

t = _('invalid entry in fncache, line %d') % (n + 1)

raise error.Abort(t)

+ self.entries = set()
+ for n, line in enumerate(util.iterfile(fp)):
+ entry = line.rstrip('\n')
+ if not entry:
+ t = _('invalid entry in fncache, line %d') % (n + 1)
+ raise error.Abort(t)
+ self.entries.add(decodedir(entry))

This goes the opposite direction to 9fca5b056c0a. I guess the current code
would be faster if the fncache is around ~10MB. Can you benchmark it?

In D5296#78834, @yuja wrote:

diff --git a/mercurial/store.py b/mercurial/store.py

a/mercurial/store.py

+++ b/mercurial/store.py
@@ -461,13 +461,13 @@

skip nonexistent file self.entries = set() return

self.entries = set(decodedir(fp.read()).splitlines())

if '' in self.entries:

fp.seek(0)

for n, line in enumerate(util.iterfile(fp)):

if not line.rstrip('\n'):

t = _('invalid entry in fncache, line %d') % (n + 1)

raise error.Abort(t)

+ self.entries = set()
+ for n, line in enumerate(util.iterfile(fp)):
+ entry = line.rstrip('\n')
+ if not entry:
+ t = _('invalid entry in fncache, line %d') % (n + 1)
+ raise error.Abort(t)
+ self.entries.add(decodedir(entry))

This goes the opposite direction to 9fca5b056c0a. I guess the current code
would be faster if the fncache is around ~10MB. Can you benchmark it?

Yeah the current code is much faster if fncache is not large.

Seeing the performance benefit it brings on our repo, I want to try other ways we can do this. Do we like having a conditional which checks the size of fncache and choose one of the approaches depending on how large it is?

If size(fncache) < 50M, use the current approach
else, use the approach described in this patch

@yuja what do you think?

Ping!

I suspect 9fca5b056c0a made things faster because the code before was using 1 I/O operation for every entry. I would also not be surprised if CPython from that era did something very inefficient with regards to line reading.

The current code is pretty bad because it buffers the entire file content in memory! I agree we should change it.

I like this patch as written. If profiling shows it to be slow, I think there is room to optimize util.iterfile() or even to teach the vfs layer how to efficiently open files for line-based I/O. This is something I could help optimize if needed.

While I'm here, the fncache file being a newline delimited list of full file paths is kinda ridiculous. We could do much better by using compression and/or a more complicated data structure. It is kinda silly that we have to load this decoded data structure into memory. So if your file on disk is ~100MB, you are going to have a Python set that also consumes ~100MB. That's really not great.

(resend without the "On ... wrote:" line)

Seeing the performance benefit it brings on our repo, I want to try other ways we can do this. Do we like having a conditional which checks the size of fncache and choose one of the approaches depending on how large it is?
If size(fncache) < 50M, use the current approach
else, use the approach described in this patch

Suppose the current code is fast because it avoids running Python in loop,
I think it can be extended to not use too much memory.

chunk = b''
while True:
    chunk += fp.read(chunk_size)  # maybe ~10MB?
    p = chunk.rindex(b'\n')  # need to handle error
    decodedir(chunk[:p + 1])...
    chunk = chunk[p + 1:]

Just an idea. Not profiled.

pulkit edited the summary of this revision. (Show Details)Feb 27 2019, 9:05 AM

pulkit updated this revision to Diff 14255.

pulkit added a parent revision: D6030: store: move logic to check for invalid entry in fncache to own function.Feb 27 2019, 9:06 AM

In D5296#87892, @yuja wrote:
(resend without the "On ... wrote:" line)
Seeing the performance benefit it brings on our repo, I want to try other ways we can do this. Do we like having a conditional which checks the size of fncache and choose one of the approaches depending on how large it is?
If size(fncache) < 50M, use the current approach
else, use the approach described in this patch
Suppose the current code is fast because it avoids running Python in loop,
I think it can be extended to not use too much memory.
chunk = b''
while True:
    chunk += fp.read(chunk_size)  # maybe ~10MB?
    p = chunk.rindex(b'\n')  # need to handle error
    decodedir(chunk[:p + 1])...
    chunk = chunk[p + 1:]
Just an idea. Not profiled.

I implemented this idea. It saves ~1 sec on perffncacheload for us when chunksize is taken 1 MB.

In D5296#87890, @indygreg wrote:

I suspect 9fca5b056c0a made things faster because the code before was using 1 I/O operation for every entry. I would also not be surprised if CPython from that era did something very inefficient with regards to line reading.
The current code is pretty bad because it buffers the entire file content in memory! I agree we should change it.
I like this patch as written. If profiling shows it to be slow, I think there is room to optimize util.iterfile() or even to teach the vfs layer how to efficiently open files for line-based I/O. This is something I could help optimize if needed.
While I'm here, the fncache file being a newline delimited list of full file paths is kinda ridiculous. We could do much better by using compression and/or a more complicated data structure. It is kinda silly that we have to load this decoded data structure into memory. So if your file on disk is ~100MB, you are going to have a Python set that also consumes ~100MB. That's really not great.

I agree. We should either use compression or use a much complicated data structure here. Right now there is no way to check whether an entry exists or not without parsing the whole fncache. It does not affect much on small repositories but on large repositories such as mozilla, the difference of parsing the whole fncache is noticable. We have perffncacheload taking ~4 seconds :/

pulkit updated this revision to Diff 14256.Feb 27 2019, 12:47 PM

pulkit edited the summary of this revision. (Show Details)Feb 27 2019, 1:05 PM

self.entries = set(decodedir(fp.read()).splitlines())

+
+ self.entries = []
+ totalsize = self.vfs.stat('fncache').st_size

I don't think totalsize has to be stat()ed. We can just loop over until
fp.read() reaches to EOF. It's unreliable to assume that the stat('fncache')
see the identical file to the fp open()ed before.

+ chunksize = (10 ** 6) # 1 MB
+ chunk = b''
+ chunksize = min(totalsize, chunksize)

Do we have any test covering totalsize > chunksize case?

+ totalsize -= chunksize
+ while chunksize:
+ chunk += fp.read(chunksize)
+ try:
+ p = chunk.rindex(b'\n')
+ self.entries.extend(decodedir(chunk[:p + 1]).splitlines())

Any reason to not build a set directly?

+ chunk = chunk[p + 1:]
+ except ValueError:
+ # substring '\n' not found, maybe the entry is bigger than the
+ # chunksize, so let's keep iterating
+ pass
+ chunksize = min(totalsize, chunksize)
+ totalsize -= chunksize
+
+ self.entries = set(self.entries)

pulkit updated this revision to Diff 14529.Mar 16 2019, 8:33 PM

In D5296#88016, @yuja wrote:

self.entries = set(decodedir(fp.read()).splitlines())

+
+ self.entries = []
+ totalsize = self.vfs.stat('fncache').st_size

I don't think totalsize has to be stat()ed. We can just loop over until
fp.read() reaches to EOF. It's unreliable to assume that the stat('fncache')
see the identical file to the fp open()ed before.

+ chunksize = (10 ** 6) # 1 MB
+ chunk = b''
+ chunksize = min(totalsize, chunksize)

Do we have any test covering totalsize > chunksize case?

Right now this patch does not have any tests. How should I add them?

add a debug config option and pass that to fncachestore and then to fncache
have a function which returns the chunk_size, write an extenion in tests which wrap that function and enable that extension in tests

I think 2) is better option here. I am unable to think of any solution expect this two. I think you will have better ideas here. What do you think?

+ for c in iter(functools.partial(fp.read, chunksize), b''):
+ chunk += c
+ try:
+ p = chunk.rindex(b'\n')
+ self.entries |= set(decodedir(chunk[:p + 1]).splitlines())

Nit: you can entries.update(any_iterable) which would be slightly faster
when bucket size changes.

Right now this patch does not have any tests. How should I add them?
1. add a debug config option and pass that to fncachestore and then to fncache
2. have a function which returns the chunk_size, write an extenion in tests which wrap that function and enable that extension in tests

Something like 2. chunksize can be a module constant, which can later be
updated by an extension. That's probably the easiest option.

pulkit edited the summary of this revision. (Show Details)Mar 17 2019, 10:54 AM

pulkit updated this revision to Diff 14535.

pulkit updated this revision to Diff 14541.Mar 18 2019, 7:32 AM

Closed by commit rHGa56487081109: store: don't read the whole fncache in memory (authored by pulkit). · Explain WhyMar 18 2019, 8:29 AM

This revision was automatically updated to reflect the committed changes.

Queued, thanks.

+ self.entries = set()
+ chunk = b''
+ for c in iter(functools.partial(fp.read, fncache_chunksize), b''):
+ chunk += c
+ try:
+ p = chunk.rindex(b'\n')
+ self.entries.update(decodedir(chunk[:p + 1]).splitlines())
+ chunk = chunk[p + 1:]
+ except ValueError:
+ # substring '\n' not found, maybe the entry is bigger than the
+ # chunksize, so let's keep iterating
+ pass

We might want to check if the chunk is fully consumed. If the file doesn't
end with '\n', which I think is invalid though, the last line would be silently
ignored.

			Path	Packages
M			mercurial/store.py (19 lines)
M			tests/test-fncache.t (14 lines)

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	12583		Nov 22 2018, 7:26 AM	★	★
Diff 2	14255		Feb 27 2019, 9:05 AM	★	★
Diff 3	14256		Feb 27 2019, 12:47 PM	★	★
Diff 4	14529		Mar 16 2019, 8:33 PM	★	★
Diff 5	14535		Mar 17 2019, 10:54 AM	★	★
Diff 6	14541		Mar 18 2019, 7:32 AM	★	★
Diff 7	14542	rHGa564870811098d6721a6be5110cd6982b370c32e	Nov 22 2018, 7:14 AM	★	★

	Status	Author	Revision
	Closed	pulkit	D5296 store: don't read the whole fncache in memory
	Closed	pulkit	D6030 store: move logic to check for invalid entry in fncache to own function

	# store.py - repository store handling for Mercurial			# store.py - repository store handling for Mercurial
	#			#
	# Copyright 2008 Matt Mackall <mpm@selenic.com>			# Copyright 2008 Matt Mackall <mpm@selenic.com>
	#			#
	# This software may be used and distributed according to the terms of the			# This software may be used and distributed according to the terms of the
	# GNU General Public License version 2 or any later version.			# GNU General Public License version 2 or any later version.

	from __future__ import absolute_import			from __future__ import absolute_import

	import errno			import errno
				import functools
	import hashlib			import hashlib
	import os			import os
	import stat			import stat

	from .i18n import _			from .i18n import _
	from . import (			from . import (
	error,			error,
	node,			node,
	policy,			policy,
	pycompat,			pycompat,
	util,			util,
	vfs as vfsmod,			vfs as vfsmod,
	)			)

	parsers = policy.importmod(r'parsers')			parsers = policy.importmod(r'parsers')
				# how much bytes should be read from fncache in one read
				# It is done to prevent loading large fncache files into memory
				fncache_chunksize = 10 ** 6

	def _matchtrackedpath(path, matcher):			def _matchtrackedpath(path, matcher):
	"""parses a fncache entry and returns whether the entry is tracking a path			"""parses a fncache entry and returns whether the entry is tracking a path
	matched by matcher or not.			matched by matcher or not.

	If matcher is None, returns True"""			If matcher is None, returns True"""

	if matcher is None:			if matcher is None:
	'''fill the entries from the fncache file'''			'''fill the entries from the fncache file'''
	self._dirty = False			self._dirty = False
	try:			try:
	fp = self.vfs('fncache', mode='rb')			fp = self.vfs('fncache', mode='rb')
	except IOError:			except IOError:
	# skip nonexistent file			# skip nonexistent file
	self.entries = set()			self.entries = set()
	return			return
	self.entries = set(decodedir(fp.read()).splitlines())
				self.entries = set()
				chunk = b''
				for c in iter(functools.partial(fp.read, fncache_chunksize), b''):
				chunk += c
				try:
				p = chunk.rindex(b'\n')
				self.entries.update(decodedir(chunk[:p + 1]).splitlines())
				chunk = chunk[p + 1:]
				except ValueError:
				# substring '\n' not found, maybe the entry is bigger than the
				# chunksize, so let's keep iterating
				pass

	self._checkentries(fp)			self._checkentries(fp)
	fp.close()			fp.close()

	def _checkentries(self, fp):			def _checkentries(self, fp):
	""" make sure there is no empty string in entries """			""" make sure there is no empty string in entries """
	if '' in self.entries:			if '' in self.entries:
	fp.seek(0)			fp.seek(0)
	for n, line in enumerate(util.iterfile(fp)):			for n, line in enumerate(util.iterfile(fp)):

	#require repofncache			#require repofncache

				An extension which will set fncache chunksize to 1 byte to make sure that logic
				does not break

				$ cat > chunksize.py <<EOF
				> from __future__ import absolute_import
				> from mercurial import store
				> store.fncache_chunksize = 1
				> EOF

				$ cat >> $HGRCPATH <<EOF
				> [extensions]
				> chunksize = $TESTTMP/chunksize.py
				> EOF

	Init repo1:			Init repo1:

	$ hg init repo1			$ hg init repo1
	$ cd repo1			$ cd repo1
	$ echo "some text" > a			$ echo "some text" > a
	$ hg add			$ hg add
	adding a			adding a
	$ hg ci -m first			$ hg ci -m first

This is an archive of the discontinued Mercurial Phabricator instance.

store: don't read the whole fncache in memory
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents
Changeset List

Diff 14542

mercurial/store.py

tests/test-fncache.t

This is an archive of the discontinued Mercurial Phabricator instance.

store: don't read the whole fncache in memoryClosedPublic

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 14542

mercurial/store.py

tests/test-fncache.t

store: don't read the whole fncache in memory
ClosedPublic

Revision Contents
Changeset List