libgit2

mirror of https://git.proxmox.com/git/libgit2 synced 2026-01-07 17:10:47 +00:00

Author	SHA1	Message	Date
Pierre-Olivier Latour	ea66215d87	Removed some useless variable assignments	2014-10-27 09:19:07 -07:00
Vicent Marti	737b505116	hashsig: Export as a `sys` header	2014-10-01 12:03:24 +02:00
Alan Rogers	61bef72dc3	Start adding GIT_DELTA_UNREADABLE and GIT_STATUS_WT_UNREADABLE.	2014-05-20 23:57:40 +10:00
Russell Belfer	240f4af321	Add build option for diff internal statistics	2014-05-02 09:21:32 -07:00
Ben Straub	c8893d1f00	Use a portable cast	2014-02-24 13:39:04 -08:00
Ben Straub	8ac7da796b	Avoid casting warning	2014-02-24 05:32:05 -08:00
Carlos Martín Nieto	9950bb4e8d	diff: rename the file's 'oid' to 'id' In the same vein as the previous commits in this series.	2014-01-25 08:15:44 +01:00
Edward Thomson	410a8e6fed	Sometimes a zero byte file is just a zero byte file Don't go to the ODB to resolve zero byte files in the workdir	2014-01-22 18:31:25 -05:00
Russell Belfer	9cfce2735d	Cleanups, renames, and leak fixes This renames git_vector_free_all to the better git_vector_free_deep and also contains a couple of memory leak fixes based on valgrind checks. The fixes are specifically: failure to free global dir path variables when not compiled with threading on and failure to free filters from the filter registry that had not be initialized fully.	2013-12-12 12:11:38 -08:00
Russell Belfer	7e3ed41959	Fix up some valgrind leaks and warnings	2013-12-11 16:56:17 -08:00
Russell Belfer	fcd324c625	Add git_vector_free_all There are a lot of places that we call git__free on each item in a vector and then call git_vector_free on the vector itself. This just wraps that up into one convenient helper function.	2013-12-11 10:57:49 -08:00
Russell Belfer	9f77b3f6f5	Add config read fns with controlled error behavior This adds `git_config__lookup_entry` which will look up a key in a config and return either the entry or NULL if the key was not present. Optionally, it can either suppress all errors or can return them (although not finding the key is not an error for this function). Unlike other accessors, this does not normalize the config key string, so it must only be used when the key is known to be in normalized form (i.e. all lower-case before the first dot and after the last dot, with no invalid characters). This also adds three high-level helper functions to look up config values with no errors and a fallback value. The three functions are for string, bool, and int values, and will resort to the fallback value for any error that arises. They are: * `git_config__get_string_force` * `git_config__get_bool_force` * `git_config__get_int_force` None of them normalize the config `key` either, so they can only be used for internal cases where the key is known to be in normal format.	2013-12-11 10:57:49 -08:00
Ben Straub	5a52d6be4c	Check version earlier	2013-12-11 06:43:17 -08:00
Ben Straub	7fb4147f1f	Don't clobber whitespace settings	2013-12-06 13:38:59 -08:00
Ben Straub	628e92cdb3	Don't use weird return codes	2013-12-05 14:47:04 -08:00
Ben Straub	c56c6d6945	Implement GIT_DIFF_FIND_BY_CONFIG	2013-12-05 14:13:46 -08:00
Russell Belfer	f62c174d0d	GIT_DIFF_FIND_REMOVE_UNMODIFIED sounds better	2013-12-02 13:49:58 -08:00
Russell Belfer	97ad85b88d	Add GIT_DIFF_FIND_DELETE_UNMODIFIED flag When doing copy detection, it is often necessary to include UNMODIFIED records in the git_diff so they are available as source records for GIT_DIFF_FIND_COPIES_FROM_UNMODIFIED. Yet in the final diff, often you will not want to have these UNMODIFIED records. This adds a flag which marks these UNMODIFIED records for deletion from the diff list so they will be removed after the rename detect phase is over.	2013-12-02 13:30:05 -08:00
Russell Belfer	2123a17f83	Fix bug making split deltas a COPIED targets When FIND_COPIES is used in combination with BREAK_REWRITES for rename detection, there was a bug where the split MODIFIED delta was only used as a target for RENAME records and not for COPIED records. This fixes that, converting the split into a pair of DELETED and COPIED deltas when that circumstance arises.	2013-12-02 13:27:06 -08:00
Russell Belfer	e7c85120ea	More tests and fixed for merging reversed diffs There were a lot more cases to deal with to make sure that our merged (i.e. workdir-to-tree-to-index) diffs were matching the output of core Git.	2013-11-01 11:39:37 -07:00
Russell Belfer	3940310e29	Fix some of the glaring errors in GIT_DIFF_REVERSE These changes fix the basic problem with GIT_DIFF_REVERSE being broken for text diffs. The reversed diff entries were getting added to the git_diff correctly, but some of the metadata was kept incorrectly in a way that prevented the text diffs from being generated correctly. Once I fixed that, it became clear that it was not possible to merge reversed diffs correctly. This has a first pass at fixing that problem. We probably need more tests to make sure that is really fixed thoroughly.	2013-11-01 10:20:51 -07:00
Russell Belfer	74a627f045	Tweak to git_diff_delta structure for nfiles While the base git_diff_delta structure always contains two files, when we introduce conflict data, it will be helpful to have an indicator when an additional file is involved.	2013-10-21 09:07:19 -07:00
Russell Belfer	10672e3e45	Diff API cleanup This lays groundwork for separating formatting options from diff creation options. This groups the formatting flags separately from the diff list creation flags and reorders the options. This also tweaks some APIs to further separate code that uses patches from code that just looks at git_diffs.	2013-10-15 15:10:07 -07:00
Russell Belfer	3ff1d12373	Rename diff objects and split patch.h This makes no functional change to diff but renames a couple of the objects and splits the new git_patch (formerly git_diff_patch) into a new header file.	2013-10-11 14:51:54 -07:00
Vicent Martí	6208bd499b	Merge pull request #1804 from ethomson/rewrites Minor changes for rewrites	2013-09-03 12:29:18 -07:00
Edward Thomson	17c7fbf6d2	Split rewrites, status doesn't return rewrites Ensure that we apply splits to rewrites, even if we're not interested in examining it closely for rename/copy detection. In keeping with core git, status should not display rewrites, it should simply show files as "modified".	2013-08-28 08:30:19 -05:00
Russell Belfer	b6ac07b517	Trying to fix Win32 warnings	2013-08-22 14:45:10 -07:00
Russell Belfer	7edb74d374	Update rename src map for any split src When using a rename source that is actually a to-be-split record, we have to update the best-fit mapping data in both the case where the target is also a split record and the case where the target is a simple added record. Before this commit, we were only doing the update when the target was itself a split record (and even in that case, the test was slightly wrong).	2013-08-04 14:06:13 -07:00
Russell Belfer	d730d3f4f0	Major rename detection changes After doing further profiling, I found that a lot of time was being spent attempting to insert hashes into the file hash signature when using the rolling hash because the rolling hash approach generates a hash per byte of the file instead of one per run/line of data. To optimize this, I decided to convert back to a run-based file signature algorithm which would be more like core Git. After changing this, a number of the existing tests started to fail. In some cases, this appears to have been because the test was coded to be too specific to the particular results of the file similarity metric and in some cases there appear to have been bugs in the core rename detection code where only by the coincidence of the file similarity scoring were the expected results being generated. This renames all the variables in the core rename detection code to be more consistent and hopefully easier to follow which made it a bit easier to reason about the behavior of that code and fix the problems that I was seeing. I think it's in better shape now. There are a couple of tests now that attempt to stress test the rename detection code and they are quite slow. Most of the time is spent setting up the test data on disk and in the index. When we roll out performance improvements for index insertion, it should also speed up these tests I hope.	2013-07-31 16:40:42 -07:00
Russell Belfer	a16e41729d	Fix rename detection to use actual blob size The size data in the index may not reflect the actual size of the blob data from the ODB when content filtering comes into play. This commit fixes rename detection to use the actual blob size when calculating data signatures instead of the value from the index. Because of a misunderstanding on my part, I first converted the git_index_add_bypath API to use the post-filtered blob data size in creating the index entry. I backed that change out, but I kept the overall refactoring of that routine and the new internal git_blob__create_from_paths API because it eliminates an extra stat() call from the code that adds a file to the index. The existing tests actually cover this code path, at least when running on Windows, so at this point I'm not adding new tests to cover the changes.	2013-07-25 12:27:39 -07:00
Russell Belfer	effdbeb323	Make rename detection file size fix better The previous fix for checking file sizes with rename detection always loads the blob. In this version, if the odb backend can get the object header without loading the whole thing into memory, then we'll just use that, so that we can eliminate possible rename sources & targets without loading them.	2013-07-24 17:48:37 -07:00
Russell Belfer	a5140f4dda	Fix rename detection for tree-to-tree diffs The performance improvements I introduced for rename detection were not able to run successfully for tree-to-tree diffs because the blob size was not known early enough and so the file signature always had to be calculated nonetheless. This change separates loading blobs into memory from calculating the signature. I can't avoid having to load the large blobs into memory, but by moving it forward, I'm able to avoid the signature calculation if the blob won't come into play for renames.	2013-07-24 17:11:49 -07:00
Russell Belfer	f5c4d02251	Fix incorrect comment	2013-07-24 13:44:35 -07:00
Russell Belfer	18e9efc425	Don't check rename if file size difference is huge	2013-07-24 13:10:16 -07:00
Edward Thomson	d55bed1a25	don't include ignored as rename candidates	2013-07-17 16:55:00 -05:00
nulltoken	c4ac556ee7	Fix compilation warnings	2013-06-29 13:27:55 +02:00
Russell Belfer	e4acc3ba19	Fix rename looped reference issues This makes the diff rename tracking code more careful about the order in which it processes renames and more thorough in updating the mapping of correct renames when an earlier rename update alters the index of a later matched pair.	2013-06-18 16:14:35 -07:00
Russell Belfer	a1683f28ce	More tests and bug fixes for status with rename This changes the behavior of the status RENAMED flags so that they will be combined with the MODIFIED flags if appropriate. If a file is modified in the index and also renamed, then the status code will have both the GIT_STATUS_INDEX_MODIFIED and INDEX_RENAMED bits set. If it is renamed but the OID has not changed, then just the GIT_STATUS_INDEX_RENAMED bit will be set. Similarly, the flags GIT_STATUS_WT_MODIFIED and GIT_STATUS_WT_RENAMED can both be set independently of one another. This fixes a serious bug where the check for unmodified files that was done at data load time could end up erasing the RENAMED state of a file that was renamed with no changes. Lastly, this contains a bunch of new tests for status with renames, including tests where the only rename changes are case changes. The expected results of these tests have to vary by whether the platform uses a case sensitive filesystem or not, so the expected data covers those platform differences separately.	2013-06-17 10:03:49 -07:00
Russell Belfer	37f66e8263	Fix Windows warnings This fixes problems with missing function prototypes and 64-bit data issues on Windows.	2013-06-12 15:21:21 -07:00
Vicent Martí	88c401bec8	Merge pull request #1643 from ethomson/rename_source Keep data about source of similarity	2013-06-12 14:54:32 -07:00
Edward Thomson	690bf41ce5	keep source similarity in rename detection	2013-06-10 15:18:38 -05:00
Russell Belfer	114f5a6c41	Reorganize diff and add basic diff driver This is a significant reorganization of the diff code to break it into a set of more clearly distinct files and to document the new organization. Hopefully this will make the diff code easier to understand and to extend. This adds a new `git_diff_driver` object that looks of diff driver information from the attributes and the config so that things like function content in diff headers can be provided. The full driver spec is not implemented in the commit - this is focused on the reorganization of the code and putting the driver hooks in place. This also removes a few #includes from src/repository.h that were overbroad, but as a result required extra #includes in a variety of places since including src/repository.h no longer results in pulling in the whole world.	2013-06-10 10:10:39 -07:00
Russell Belfer	49f70f2c37	Fill out diff rename test coverage This extends the rename tests to make sure that every rename scenario in the inner loop of git_diff_find_similar is actually exercised. Also, fixes an incorrect assert that was in one of the clauses that was not previously being exercised.	2013-05-23 15:48:06 -07:00
Russell Belfer	67db583dab	More diff rename tests; better split swap handling This adds a couple more tests of different rename scenarios. Also, this fixes a problem with the case where you have two "split" deltas and the left half of one matches the right half of the other. That case was already being handled, but in the wrong order in a way that could result in bad output. Also, if the swap also happened to put the other two halves into the correct place (i.e. two files exchanged places with each other), then the second delta was left with the SPLIT flag set when it really should be cleared.	2013-05-23 15:06:07 -07:00
Russell Belfer	c68b09dc7a	Fix dereference of freed delta I was accidentally using a value that I had just freed. This moves the clearing of the delta internal flags into a better place.	2013-05-23 11:52:34 -07:00
Russell Belfer	a21cbb12db	Significant rename detection rewrite This flips rename detection around so instead of creating a forward mapping from deltas to possible rename targets, instead it creates a reverse mapping, looking at possible targets and trying to find a source that they could have been renamed or copied from. This is important because each output can only have a single source, but a given source could map to multiple outputs (in the form of COPIED records). Additionally, this makes a couple of tweaks to the public rename detection APIs, mostly renaming a couple of options that control the behavior to make more sense and to be more like core Git. I walked through the tests looking at the exact results and updated the expectations based on what I saw. The new code is different from the old because it cannot give some nonsense results (like A was renamed to both B and C) which were part of the outputs previously.	2013-05-22 10:37:12 -07:00
Russell Belfer	9be5be47fb	More git_diff_find_similar improvements - Add new GIT_DIFF_FIND_EXACT_MATCH_ONLY flag to do similarity matching without using the similarity metric (i.e. only compare the SHA). - Clean up the similarity measurement code to more rigorously distinguish between files that are not similar and files that are not comparable (previously, a 0 could either mean that the files could not be compared or that they were totally different) - When splitting a MODIFIED file into a DELETE/ADD pair, actually make a DELETED/UNTRACKED pair if the right side of the diff is from the working directory. This prevents an odd mix of ADDED and UNTRACKED files on workdir diffs.	2013-05-20 13:37:21 -07:00
Russell Belfer	d958e37a48	Fix issues with git_diff_find_similar There are a number of bugs in the rename code that only were obvious when I started testing it against large old repos with more complex patterns. (The code to do that testing is not ready to merge with libgit2, but I do plan to add more thorough tests.) This contains a significant number of changes and also tweaks the public API slightly to make emulating core git easier. Most notably, this separates the GIT_DIFF_FIND_AND_BREAK_REWRITES flag into FIND_REWRITES (which adds a self-similarity score to every modified file) and BREAK_REWRITES (which splits the modified deltas into add/remove pairs in the diff list). When you do a raw output of core git, rewrites show up as M090 or such, not at A and D output, so I wanted to be able to emulate that. Publicly, this also changes the flags to be uint16_t since we don't need values out of that range. Internally, this contains significant changes from a number of small bug fixes (like using the wrong side of the diff to decide if the object could be found in the ODB vs the workdir) to larger issues about which files can and should be compared and how the various edge cases of similarity scores should be treated. Honestly, I don't think this is the last update that will have to be made to this code, but I think this moves us closer to correct behavior and I tried to document the code so it would be easier to follow..	2013-05-17 17:21:45 -07:00
Vicent Martí	7159620044	Merge pull request #1588 from arrbee/fixes-for-checkout-and-diff Bug fixes for checkout and diff	2013-05-15 15:47:46 -07:00
Russell Belfer	09fae31d45	Improve robustness of diff rename detection Under some strange circumstances, diffs can end up listing files that we can't actually open successfully. Instead of aborting the git_diff_find_similar, this makes it so that those files just won't be considered as valid rename/copy targets instead.	2013-05-15 14:58:26 -07:00

1 2

68 Commits