libgit2

mirror of https://git.proxmox.com/git/libgit2 synced 2025-12-29 08:03:45 +00:00

Author	SHA1	Message	Date
Patrick Steinhardt	35079f507b	odb: add option to turn off hash verification Verifying hashsums of objects we are reading from the ODB may be costly as we have to perform an additional hashsum calculation on the object. Especially when reading large objects, the penalty can be as high as 35%, as can be seen when executing the equivalent of `git cat-file` with and without verification enabled. To mitigate for this, we add a global option for libgit2 which enables the developer to turn off the verification, e.g. when he can be reasonably sure that the objects on disk won't be corrupted.	2017-04-28 14:05:45 +02:00
Patrick Steinhardt	28a0741f1a	odb: verify object hashes The upstream git.git project verifies objects when looking them up from disk. This avoids scenarios where objects have somehow become corrupt on disk, e.g. due to hardware failures or bit flips. While our mantra is usually to follow upstream behavior, we do not do so in this case, as we never check hashes of objects we have just read from disk. To fix this, we create a new error class `GIT_EMISMATCH` which denotes that we have looked up an object with a hashsum mismatch. `odb_read_1` will then, after having read the object from its backend, hash the object and compare the resulting hash to the expected hash. If hashes do not match, it will return an error. This obviously introduces another computation of checksums and could potentially impact performance. Note though that we usually perform I/O operations directly before doing this computation, and as such the actual overhead should be drowned out by I/O. Running our test suite seems to confirm this guess. On a Linux system with best-of-five timings, we had 21.592s with the check enabled and 21.590s with the ckeck disabled. Note though that our test suite mostly contains very small blobs only. It is expected that repositories with bigger blobs may notice an increased hit by this check. In addition to a new test, we also had to change the odb::backend::nonrefreshing test suite, which now triggers a hashsum mismatch when looking up the commit "deadbeef...". This is expected, as the fake backend allocated inside of the test will return an empty object for the OID "deadbeef...", which will obviously not hash back to "deadbeef..." again. We can simply adjust the hash to equal the hash of the empty object here to fix this test.	2017-04-28 14:05:45 +02:00
Patrick Steinhardt	d59dabe5cb	tests: object: test looking up corrupted objects We currently have no tests which check whether we fail reading corrupted objects. Add one which modifies contents of an object stored on disk and then tries to read the object.	2017-04-28 14:05:45 +02:00
Patrick Steinhardt	86c035526d	tests: object: create sandbox The object::lookup tests do use the "testrepo.git" repository in a read-only way, so we do not set up the repository as a sandbox but simply open it. But in a future commit, we will want to test looking up objects which are corrupted in some way, which requires us to modify the on-disk data. Doing this in a repository without creating the sandbox will modify contents of our libgit2 repository, though. Create the repository in a sandbox to avoid this.	2017-04-28 14:05:44 +02:00
Patrick Steinhardt	e29e802966	tests: odb: make hash of fake backend configurable In the odb::backend::nonrefreshing test suite, we set up a fake backend so that we are able to determine if backend functions are called correctly. During the setup, we also parse an OID which is later on used to read out the pseudo-object. While this procedure works right now, it will create problems later when we implement hash verification for looked up objects. The current OID ("deadbeef") will not match the hash of contents we give back to the ODB layer and thus cannot be verified. Make the hash configurable so that we can simply switch the returned for single tests.	2017-04-28 14:05:44 +02:00
Edward Thomson	7df580fae6	Merge pull request #4191 from pks-t/pks/wt-ref-renames Branch renames with worktrees	2017-04-28 11:58:49 +01:00
Patrick Steinhardt	2a7086fa89	tests: config: verify functionality with read-only backends	2017-04-26 09:35:11 +02:00
Patrick Steinhardt	13c275aba5	tests: threads::diff: fix warning for unused variable The threads::diff test suite has a static variable `_retries`, which is used on Windows platforms only. As it is unused on other systems, the compiler throws a warning there. Fix the warning by wrapping the declaration in an ifdef.	2017-04-21 07:49:08 +00:00
Carlos Martín Nieto	8d89e40961	Merge pull request #4192 from libgit2/ethomson/win32_posix Refactor some of the win32 POSIX emulation	2017-04-17 17:19:03 +02:00
Patrick Steinhardt	cf07db2f1b	filter: only close filter if it's been initialized correctly In the function `git_filter_list_stream_data`, we initialize, write and subesquently close the stream which should receive content processed by the filter. While we skip writing to the stream if its initialization failed, we still try to close it unconditionally -- even if the initialization failed, where the stream might not be set at all, leading us to segfault. Semantics in this code is not really clear. The function handling the same logic for files instead of data seems to do the right thing here in only closing the stream when initialization succeeded. When stepping back a bit, this is only reasonable: if a stream cannot be initialized, the caller would not expect it to be closed again. So actually, both callers of `stream_list_init` fail to do so. The data streaming function will always close the stream and the file streaming function will not close the stream if writing to it has failed. The fix is thus two-fold: - callers of `stream_list_init` now close the stream iff it has been initialized - `stream_list_init` now closes the lastly initialized stream if the current stream in the chain failed to initialize Add a test which segfaulted previous to these changes.	2017-04-07 16:34:47 +02:00
Patrick Steinhardt	2a485dabc0	refs: update worktree HEADs when renaming branches Whenever we rename a branch, we update the repository's symbolic HEAD reference if it currently points to the branch that is to be renamed. But with the introduction of worktrees, we also have to iterate over all HEADs of linked worktrees to adjust them. Do so.	2017-04-05 13:50:38 +02:00
Patrick Steinhardt	602972560a	tests: worktree::refs: convert spaces to tabs	2017-04-05 13:44:52 +02:00
Edward Thomson	48f09c6c47	win32: only set `git_win32__retries` where it exists	2017-04-05 11:59:03 +01:00
Edward Thomson	89d403cce2	win32: enable `p_utimes` for readonly files Instead of failing to set the timestamp of a read-only file (like any object file), set it writable temporarily to update the timestamp.	2017-04-05 11:04:29 +01:00
Edward Thomson	7ece906598	win32: make posix emulation retries configurable POSIX emulation retries should be configurable so that tests can disable them. In particular, maniacally threading tests may end up trying to open locked files and need retries, which will slow continuous integration tests significantly.	2017-04-03 23:14:24 +01:00
Edward Thomson	6ad091dceb	Merge pull request #4176 from libgit2/ethomson/3872 inet_pton: don't assume addr families don't exist	2017-03-23 09:33:09 +00:00
Edward Thomson	f623cf894a	Merge pull request #4163 from pks-t/pks/submodules-with-worktrees Worktree fixes	2017-03-22 20:32:55 +00:00
Edward Thomson	6fd6c67824	Merge pull request #4030 from libgit2/ethomson/fsync fsync all the things	2017-03-22 20:29:22 +00:00
Edward Thomson	983979fa0d	inet_pton: don't assume addr families don't exist Address family 5 might exist on some crazy system like Haiku. Use `INT_MAX-1` as an unsupported address family.	2017-03-22 19:52:38 +00:00
Patrick Steinhardt	31059923bd	Merge pull request #4169 from csware/absolute-symlink	2017-03-20 12:16:18 +01:00
Patrick Steinhardt	c10ce7c21a	tests: filebuf: test writing to symlink with absolute paths	2017-03-20 12:14:07 +01:00
Patrick Steinhardt	b0c9bc920f	submodule: resolve URLs relative to main worktree It is possible to specify submodule URLs relative to the repository location. E.g. having a submodule with URL "../submodule" will look for the submodule at "repo/../submodule". With the introduction of worktrees, though, we cannot simply resolve the URL relative to the repository location itself. If the repository for which a URL is to be resolved is a working tree, we have to resolve the URL relative to the parent's repository path. Otherwise, the URL would change depending on where the working tree is located. Fix this by special-casing when we have a working tree while getting the URL base.	2017-03-17 09:27:56 +01:00
Patrick Steinhardt	097f0105b4	refdb: create references in commondir References for a repository are usually created inside of its gitdir. When using worktrees, though, these references are not to be created inside the worktree gitdir, but instead inside the gitdir of its parent repository, which is the commondir. Like this, branches will still be available after the worktree itself has been deleted. The filesystem refdb currently still creates new references inside of the gitdir. Fix this and have it create references in commondir.	2017-03-17 09:27:56 +01:00
Patrick Steinhardt	8f154be3eb	worktree: write resolved paths into link files The three link files "worktree/.git", ".git/worktrees/<name>/commondir" and ".git/worktrees/<name>/gitdir" should always contain absolute and resolved paths. Adjust the logic creating new worktrees to first use `git_path_prettify_dir` before writing out these files, so that paths are resolved first.	2017-03-17 09:27:56 +01:00
Patrick Steinhardt	20a368e2d7	worktree: parent path should point to the working dir The working tree's parent path should not point to the parent's gitdir, but to the parent's working directory. Pointing to the gitdir would not make any sense, as the parent's working directory is actually equal to both repository's common directory. Fix the issue.	2017-03-17 09:27:55 +01:00
Patrick Steinhardt	3017ba94a3	worktree: implement `git_worktree_open_from_repository` While we already provide functionality to look up a worktree from a repository, we cannot do so the other way round. That is given a repository, we want to look up its worktree if it actually exists. Getting the worktree of a repository is useful when we want to get certain meta information like the parent's location, getting the locked status, etc.	2017-03-17 09:26:13 +01:00
Patrick Steinhardt	f3c3068670	tests: worktree: use joinpath instead of printf to join paths	2017-03-15 17:16:47 +01:00
Patrick Steinhardt	fdb3e24ac4	tests: worktree: unify init/cleanup in open tests	2017-03-15 14:24:45 +01:00
Patrick Steinhardt	548e5bdf1e	tests: worktree: unify init/cleanup in submodule tests	2017-03-15 12:28:05 +01:00
Patrick Steinhardt	aa593a65df	tests: worktree: move submodule tests into own suite	2017-03-15 12:01:02 +01:00
Patrick Steinhardt	c0eba379d1	diff_parse: correctly set options for parsed diffs The function `diff_parsed_alloc` allocates and initializes a `git_diff_parsed` structure. This structure also contains diff options. While we initialize its flags, we fail to do a real initialization of its values. This bites us when we want to actually use the generated diff as we do not se the option's version field, which is required to operate correctly. Fix the issue by executing `git_diff_init_options` on the embedded struct.	2017-03-14 13:09:35 +01:00
Patrick Steinhardt	ad5a909cfb	patch_parse: fix parsing minimal trailing diff line In a diff, the shortest possible hunk with a modification (that is, no deletion) results from a file with only one line with a single character which is removed. Thus the following hunk @@ -1 +1 @@ -a + is the shortest valid hunk modifying a line. The function parsing the hunk body though assumes that there must always be at least 4 bytes present to make up a valid hunk, which is obviously wrong in this case. The absolute minimum number of bytes required for a modification is actually 2 bytes, that is the "+" and the following newline. Note: if there is no trailing newline, the assumption will not be offended as the diff will have a line "\ No trailing newline" at its end. This patch fixes the issue by lowering the amount of bytes required.	2017-03-14 13:09:13 +01:00
Patrick Steinhardt	ace3508f4c	patch_generate: fix `git_diff_foreach` only working with generated diffs The current logic of `git_diff_foreach` makes the assumption that all diffs passed in are actually derived from generated diffs. With these assumptions we try to derive the actual diff by inspecting either the working directory files or blobs of a repository. This obviously cannot work for diffs parsed from a file, where we do not necessarily have a repository at hand. Since the introduced split of parsed and generated patches, there are multiple functions which help us to handle patches generically, being indifferent from where they stem from. Use these functions and remove the old logic specific to generated patches. This allows re-using the same code for invoking the callbacks on the deltas.	2017-03-14 13:08:28 +01:00
Carlos Martín Nieto	53454be87f	Merge pull request #4157 from adamniedzielski/4099-git-sort-time-uninteresting Skip uninteresting commits in revwalk timesort iterator	2017-03-11 19:52:05 +01:00
Edward Thomson	ffd4df6bf2	Merge pull request #4151 from novalis/dturner/rebase-submodule-untracked rebase: ignore untracked files in submodules	2017-03-09 16:32:38 +00:00
Adam Niedzielski	c11c08a552	Skip uninteresting commits in revwalk timesort iterator Fixes #4099	2017-03-09 14:01:10 +01:00
Edward Thomson	d24ae06d72	refs::namespace: add namespace tests These simple tests only ensure that we enforce the existence of a namespace; these mirror the rugged tests, they are not exhaustive.	2017-03-06 14:11:00 +00:00
David Turner	2270ca9f8f	rebase: ignore untracked files in submodules An untracked file in a submodule should not prevent a rebase from starting. Even if the submodule's SHA is changed, and that file would conflict with a new tracked file, it's still OK to start the rebase and discover the conflict later. Signed-off-by: David Turner <dturner@twosigma.com>	2017-03-03 15:07:54 -05:00
Edward Thomson	52d03f37f7	git_commit_create: freshen tree objects in commit Freshen the tree object that a commit points to during commit time.	2017-03-03 14:12:00 +00:00
Edward Thomson	d087c8f80e	hash: test for sha1 collision attack detection	2017-03-03 10:50:28 +00:00
Edward Thomson	1c04a96b25	Honor `core.fsyncObjectFiles`	2017-03-02 09:11:33 +00:00
Patrick Steinhardt	5aa1f12a81	tests: refs::create: fix memory leak	2017-03-01 07:45:14 +01:00
Patrick Steinhardt	cf8e9a3ad5	Merge pull request #4143 from richardipsum/issue-4094 Fix: make reflog include "(merge)" for merge commits	2017-03-01 07:35:44 +01:00
Richard Ipsum	397cf1a1f5	Add test for inclusion of (merge) in reflog This test ensures that the string '(merge)' is included in the reflog when a merge commit is made.	2017-02-28 22:32:21 +00:00
Edward Thomson	3ac05d1149	win32: don't fsync parent directories on Windows Windows doesn't support it.	2017-02-28 13:29:01 +00:00
Edward Thomson	2a5ad7d0f2	fsync: call it "synchronous" object writing Rename `GIT_OPT_ENABLE_SYNCHRONIZED_OBJECT_CREATION` -> `GIT_OPT_ENABLE_SYNCHRONOUS_OBJECT_CREATION`.	2017-02-28 13:29:01 +00:00
Edward Thomson	1229e1c4d7	fsync parent directories when fsyncing When fsync'ing files, fsync the parent directory in the case where we rename a file into place, or create a new file, to ensure that the directory entry is flushed correctly.	2017-02-28 13:28:36 +00:00
Edward Thomson	eb56ed81af	refdb_fs: optionally fsync packed refs	2017-02-28 13:28:36 +00:00
Edward Thomson	af3dcb0e50	refdb_fs: optionally fsync loose references	2017-02-28 13:28:36 +00:00
Edward Thomson	1c2c0ae2a4	packbuilder: honor git_object__synchronized_writing Honor `git_object__synchronized_writing` when creating a packfile and corresponding index.	2017-02-28 13:27:50 +00:00

1 2 3 4 5 ...

2070 Commits