libgit2

mirror of https://git.proxmox.com/git/libgit2 synced 2025-05-31 09:43:08 +00:00

Author	SHA1	Message	Date
Edward Thomson	44e8af8f29	Merge pull request #3892 from mitesch/shared_buffer Use a shared buffer in calls of git_treebuilder_write to avoid heap contention	2017-01-21 22:51:50 +00:00
Edward Thomson	909d549436	giterr_set: consistent error messages Error messages should be sentence fragments, and therefore: 1. Should not begin with a capital letter, 2. Should not conclude with punctuation, and 3. Should not end a sentence and begin a new one	2016-12-29 12:26:03 +00:00
Michael Tesch	87aaefe20b	write_tree: use shared buffer for writing trees The function to write trees allocates a new buffer for each tree. This causes problems with performance when performing a lot of actions involving writing trees, e.g. when doing many merges. Fix the issue by instead handing in a shared buffer, which is then re-used across the calls without having to re-allocate between calls.	2016-12-12 10:46:05 -05:00
Carlos Martín Nieto	8977658519	tree: look for conflicts in the new tree when updating We look at whether we're trying to replace a blob with a tree during the update phase, but we fail to look at whether we've just inserted a blob where we're now trying to insert a tree. Update the check to look at both places. The test for this was previously succeeding due to the bu where we did not look at the sorted output.	2016-11-14 12:44:52 +01:00
Carlos Martín Nieto	b85929c523	tree: use the sorted update list in our loop The loop is made with the assumption that the inputs are sorted and not using it leads to bad outputs.	2016-11-14 12:44:01 +01:00
Patrick Steinhardt	901434b00f	common: cast precision specifiers to int	2016-11-14 10:07:55 +01:00
Patrick Steinhardt	4974e3a596	tree: validate filename and OID length when parsing object When parsing tree entries from raw object data, we do not verify that the tree entry actually has a filename as well as a valid object ID. Fix this by asserting that the filename length is non-zero as well as asserting that there are at least `GIT_OID_RAWSZ` bytes left when parsing the OID.	2016-10-07 09:18:55 +02:00
Edward Thomson	fdf14637d5	Merge pull request #3792 from edquist/misc Fix comment for GIT_FILEMODE_LINK	2016-05-26 00:58:43 -05:00
Carlos Martín Nieto	a2cb47130e	tree: handle removal of all entries in the updater When we remove all entries in a tree, we should remove that tree from its parent rather than include the empty tree.	2016-05-24 14:30:43 +02:00
Carlos Martín Nieto	5341230536	tree: plug leaks in the tree updater	2016-05-19 15:29:53 +02:00
Carlos Martín Nieto	6ee08d2cd0	tree: use the basename for the entry removal When we want to remove the file, use the basename as the name of the entry to remove, instead of the full one, which includes the directories we've inserted into the stack.	2016-05-19 15:22:02 +02:00
Carl Edquist	c8fb2e152a	Fix comment for GIT_FILEMODE_LINK 0120000 is symbolic link, not commit	2016-05-18 16:00:01 -05:00
Carlos Martín Nieto	9464f9ebc1	Introduce a function to create a tree based on a different one Instead of going through the usual steps of reading a tree recursively into an index, modifying it and writing it back out as a tree, introduce a function to perform simple updates more efficiently. `git_tree_create_updated` avoids reading trees which are not modified and supports upsert and delete operations. It is not as versatile as modifying the index, but it makes some common operations much more efficient.	2016-05-17 17:41:05 +02:00
Carlos Martín Nieto	f5c874a475	Plug a few leaks	2016-03-31 10:41:33 -04:00
Edward Thomson	e2e4bae9a0	tree: drop the now-unnecessary entries vector Remove the now-unnecessary entries vector. Add `git_array_search` to binary search through an array to accomplish this.	2016-03-22 06:21:13 -07:00
Carlos Martín Nieto	4ed9e939e2	tree: store the entries in a growable array Take advantage of the constant size of tree-owned arrays and store them in an array instead of a pool. This still lets us free them all at once but lets the system allocator do the work of fitting them in.	2016-03-20 12:01:45 +01:00
Carlos Martín Nieto	60a194aa86	tree: re-use the id and filename in the odb object Instead of copying over the data into the individual entries, point to the originals, which are already in a format we can use.	2016-03-20 11:00:12 +01:00
Carlos Martín Nieto	ea5bf6bbce	treebuilder: don't try to verify submodules exist in the odb Submodules don't exist in the objectdb and the code is making us try to look for a blob with its commit id, which is obviously not going to work. Skip the test if the user wants to insert a submodule.	2016-03-04 12:38:28 +01:00
Edward Thomson	2bbc7d3e56	treebuilder: validate tree entries (optionally) When `GIT_OPT_ENABLE_STRICT_OBJECT_CREATION` is turned on, validate the tree and parent ids given to treebuilder insertion.	2016-02-28 12:38:40 -05:00
Edward Thomson	aadad40592	tree: zap warnings around `size_t` vs `uint16_t`	2016-02-16 11:50:44 +00:00
Carlos Martín Nieto	fc43646965	tree: mark a tree as already sorted The trees are sorted on-disk, so we don't have to go over them again. This cuts almost a fifth of time spent parsing trees.	2015-12-06 23:17:19 +01:00
Carlos Martín Nieto	0174f21b0a	tree: use a specialised mode parse function Instead of going out to strtol, which is made to parse generic numbers, copy a parse function from git which is specialised for file modes.	2015-12-02 18:59:58 +01:00
Patrick Steinhardt	9487585ddc	tree: mark cloned tree entries as un-pooled When duplicating a `struct git_tree_entry` with `git_tree_entry_dup` the resulting structure is not allocated inside a memory pool. As we do a 1:1 copy of the original struct, though, we also copy the `pooled` field, which is set to `true` for pooled entries. This results in a huge memory leak as we never free tree entries that were duplicated from a pooled tree entry. Fix this by marking the newly duplicated entry as un-pooled.	2015-12-01 14:25:15 +01:00
Carlos Martín Nieto	95ae3520c5	tree: ensure the entry filename fits in 16 bits Return an error in case the length is too big. Also take this opportunity to have a single allocating function for the size and overflow logic.	2015-11-30 17:32:18 +01:00
Carlos Martín Nieto	ee42bb0e3d	tree: make path len uint16_t and avoid holes This reduces the size of the struct from 32 to 26 bytes, and leaves a single padding byte at the end of the struct (which comes from the zero-length array).	2015-11-28 19:21:52 +01:00
Carlos Martín Nieto	2580077fc2	tree: calculate the filename length once We already know the size due to the `memchr()` so use that information instead of calling `strlen()` on it.	2015-11-28 19:21:52 +01:00
Carlos Martín Nieto	ed970748b6	tree: pool the entry memory allocations These are rather small allocations, so we end up spending a non-trivial amount of time asking the OS for memory. Since these entries are tied to the lifetime of their tree, we can give the tree a pool so we speed up the allocations.	2015-11-28 19:21:51 +01:00
Carlos Martín Nieto	7132150ddf	tree: avoid advancing over the filename multiple times We've already looked at the filename with `memchr()` and then used `strlen()` to allocate the entry. We already know how much we have to advance to get to the object id, so add the filename length instead of looking at each byte again.	2015-11-28 19:21:51 +01:00
Carlos Martín Nieto	84511143fd	tree: add more correct error messages for not found Don't use the full path, as that's not what we are asserting does not exist, but just the subpath we were looking up.	2015-03-17 20:51:14 +01:00
Stefan Widgren	c8e02b8776	Remove extra semicolon outside of a function Without this change, compiling with gcc and pedantic generates warning: ISO C does not allow extra ‘;’ outside of a function.	2015-02-15 21:07:05 +01:00
Edward Thomson	f1453c59b2	Make our overflow check look more like gcc/clang's Make our overflow checking look more like gcc and clang's, so that we can substitute it out with the compiler instrinsics on platforms that support it. This means dropping the ability to pass `NULL` as an out parameter. As a result, the macros also get updated to reflect this as well.	2015-02-13 09:27:33 -05:00
Edward Thomson	2884cc42de	overflow checking: don't make callers set oom Have the ALLOC_OVERFLOW testing macros also simply set_oom in the case where a computation would overflow, so that callers don't need to.	2015-02-12 22:54:47 -05:00
Edward Thomson	392702ee2c	allocations: test for overflow of requested size Introduce some helper macros to test integer overflow from arithmetic and set error message appropriately.	2015-02-12 22:54:46 -05:00
Carlos Martín Nieto	208a2c8aef	treebuilder: rename _create() to _new() This function is a constructor, so let's name it like one and leave _create() for the reference functions, which do create/write the reference.	2014-12-27 12:09:11 +00:00
Edward Thomson	dce7b1a4e7	treebuilder: take a repository for path validation Path validation may be influenced by `core.protectHFS` and `core.protectNTFS` configuration settings, thus treebuilders can take a repository to influence their configuration.	2014-12-17 13:05:27 -05:00
Vicent Marti	62155257d2	tree: Check for `.git` with case insensitivy	2014-12-16 10:08:46 -06:00
Carlos Martín Nieto	7465e87399	index: fill the tree cache on write-tree An obvious place to fill the tree cache is on write-tree, as we're guaranteed to be able to fill in the whole tree cache. The way this commit does this is not the most efficient, as we read the root tree from the odb instead of filling in the cache as we go along, but it fills the cache such that successive operations (and persisting the index to disk) will be able to take advantage of the cache, and it reuses the code we already have for filling the cache. Filling in the cache as we create the trees would require some reallocation of the children vector, which is currently not possible with out pool implementation. A different data structure would likely allow us to perform this operation at a later date.	2014-10-10 19:43:42 +02:00
Carlos Martín Nieto	c2f8b21593	index: write out the tree cache extension Keeping the cache around after read-tree is only one part of the optimisation opportunities. In order to share the cache between program instances, we need to write the TREE extension to the index. Do so, taking the opportunity to rename 'entries' to 'entry_count' to match the name given in the format description. The included test is rather trivial, but works as a sanity check.	2014-10-10 19:43:42 +02:00
Carlos Martín Nieto	966fb20702	tree: free in error conditions As reported by coverity, we would leak some memory in error conditions.	2014-06-25 21:25:44 +02:00
Carlos Martín Nieto	fcc6006607	treentry: no need for manual size book-keeping We can simply ask the hasmap.	2014-06-10 15:14:13 +02:00
Carlos Martín Nieto	978fbb4c34	treebuilder: don't keep removed entries around If the user wants to keep a copy for themselves, they should make a copy. It adds unnecessary complexity to make sure the returned entries are valid until the builder is cleared.	2014-06-10 15:14:13 +02:00
Carlos Martín Nieto	4d3f1f9740	treebuilder: use a map instead of vector to store the entries Finding a filename in a vector means we need to resort it every time we want to read from it, which includes every time we want to write to it as well, as we want to find duplicate keys. A hash-map fits what we want to do much more accurately, as we do not care about sorting, but just the particular filename. We still keep removed entries around, as the interface let you assume they were going to be around until the treebuilder is cleared or freed, but in this case that involves an append to a vector in the filter case, which can now fail. The only time we care about sorting is when we write out the tree, so let's make that the only time we do any sorting.	2014-06-10 15:14:13 +02:00
Carlos Martín Nieto	2c11d2eeb5	treebuilder: insert sorted By inserting in the right position, we can keep the vector sorted, making entry insertion almost twice as fast.	2014-06-10 00:06:21 +02:00
Russell Belfer	882c774271	Convert pqueue to just be a git_vector This updates the git_pqueue to simply be a set of specialized init/insert/pop functions on a git_vector. To preserve the pqueue feature of having a fixed size heap, I converted the "sorted" field in git_vectors to a more general "flags" field so that pqueue could mix in it's own flag. This had a bunch of ramifications because a number of places were directly looking at the vector "sorted" field - I added a couple new git_vector helpers (is_sorted, set_sorted) so the specific representation of this information could be abstracted.	2014-02-04 10:01:37 -08:00
Carlos Martín Nieto	f000ee4e5b	tree: remove legacy 'oid' naming Rename git_tree_entry_byoid() to _byid() as per the convention.	2014-01-25 08:15:44 +01:00
Carlos Martín Nieto	d541170c77	index: rename an entry's id to 'id' This was not converted when we converted the rest, so do it now.	2014-01-25 08:15:44 +01:00
Arthur Schreiber	529f342aba	Align git_tree_entry_dup.	2014-01-14 21:33:59 +01:00
Russell Belfer	26c1cb91be	One more rename/cleanup for callback err functions	2013-12-11 10:57:50 -08:00
Russell Belfer	25e0b1576d	Remove converting user error to GIT_EUSER This changes the behavior of callbacks so that the callback error code is not converted into GIT_EUSER and instead we propagate the return value through to the caller. Instead of using the giterr_capture and giterr_restore functions, we now rely on all functions to pass back the return value from a callback. To avoid having a return value with no error message, the user can call the public giterr_set_str or some such function to set an error message. There is a new helper 'giterr_set_callback' that functions can invoke after making a callback which ensures that some error message was set in case the callback did not set one. In places where the sign of the callback return value is meaningful (e.g. positive to skip, negative to abort), only the negative values are returned back to the caller, obviously, since the other values allow for continuing the loop. The hardest parts of this were in the checkout code where positive return values were overloaded as meaningful values for checkout. I fixed this by adding an output parameter to many of the internal checkout functions and removing the overload. This added some code, but it is probably a better implementation. There is some funkiness in the network code where user provided callbacks could be returning a positive or a negative value and we want to rely on that to cancel the loop. There are still a couple places where an user error might get turned into GIT_EUSER there, I think, though none exercised by the tests.	2013-12-11 10:57:49 -08:00
Russell Belfer	dab89f9b68	Further EUSER and error propagation fixes This continues auditing all the places where GIT_EUSER is being returned and making sure to clear any existing error using the new giterr_user_cancel helper. As a result, places that relied on intercepting GIT_EUSER but having the old error preserved also needed to be cleaned up to correctly stash and then retrieve the actual error. Additionally, as I encountered places where error codes were not being propagated correctly, I tried to fix them up. A number of those fixes are included in the this commit as well.	2013-12-11 10:57:49 -08:00

1 2 3 4 5

214 Commits