mirrors/jgit - jgit - source @ dussan.org

Commit Graph

Author	SHA1	Message	Date
Matthias Sohn	8ada9048c5	Replace ExpectedException which was deprecated in junit 4.13 Change-Id: I64b0c057dd0a12aef2f3d56fa0c8a10e3b23fffd Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	4 years ago
Matthias Sohn	5c5f7c6b14	Update EDL 1.0 license headers to new short SPDX compliant format This is the format given by the Eclipse legal doc generator [1]. [1] https://www.eclipse.org/projects/tools/documentation.php?id=technology.jgit Bug: 548298 Change-Id: I8d8cabc998ba1b083e3f0906a8d558d391ffb6c4 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	4 years ago
Carsten Hammer	74bc50125d	Replace chain of if statements with switch and switch over strings where possible. Sometimes if statements are chained and form a series of comparisons against constants. Using switch statements improves readability. Bug: 545856 Change-Id: Iacb78956ee5c20db4d793e6b668508ec67466606 Signed-off-by: Carsten Hammer <carsten.hammer@t-online.de> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	5 years ago
Thomas Wolf	c020848dbb	Close ObjectReaders in tests Change-Id: Ic839af4f85861fb48091d45679e2a614f001d770 Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch>	4 years ago
Han-Wen Nienhuys	e5880d9531	reftable: pass OutputStream at construction time This makes the intended use of the classes more clear. It also simplifies generic functions that write reftables: they only need a ReftableWriter as argument, as the stream is carried within the ReftableWriter. Change-Id: Idbb06f89ae33100f0c0b562cc38e5b3b026d5181 Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	4 years ago
Matthias Sohn	8f7e851346	[error prone] suppress AmbiguousMethodReference in AnyObjectId Move the implementation of the static equals() method to a new method and suppress the error. Deprecate the old method to signal that we intend to remove it in the next major release. See https://errorprone.info/bugpattern/AmbiguousMethodReference Change-Id: I5e29c97f4db3e11770be589a6ccd785e2c9ac7f2 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	4 years ago
Patrick Hiesel	93a0e50ed0	Introduce RefDatabase#getTipsWithSha1 to list refs pointing to object Add resolveTipSha1, an inverse of exactRef(String ...), to RefDatabase and provide a default implementation that runs in O(n) time where n is the number of refs. For RefTable, provide an implementation that runs in O(log(n)) time. [ifrade@google.com: with tests in InMemoryRepositoryTest to exercise the reftable code path, too] Change-Id: I2811ccd0339cdc1c74b42cce2ea003f07a2ce9e1 Signed-off-by: Patrick Hiesel <hiesel@google.com> Signed-off-by: Ivan Frade <ifrade@google.com>	5 years ago
Ivan Frade	9d97d2404f	DfsRefDatabase: Keep update index when peeling a reference The new references created in the peeling do not receive the update index. In other words, the update index of a reference (if set) is lost in the peeling. Pass-through the update index to the newly created references. Tested via InMemoryRepository, which uses DfsReftableDatabase. Change-Id: I7ff7c737a9c3366fdec296a4d9b2e51d10227957 Signed-off-by: Ivan Frade <ifrade@google.com>	5 years ago
Minh Thai	2b7fa04dad	Fix bug in copyPackBypassCache's skip 'PACK' header logic Bug caused the pack to be 12 bytes short when cold cache. Also added test for copyPackAsIs method. Change-Id: Idf8fb0e50d1215245d4b032e2e00df4b218c115f Signed-off-by: Minh Thai <mthai@google.com>	5 years ago
Ivan Frade	39b27f7c7b	DfsFsck: Check that .gitmodules in the repository have valid contents Previous commits block the addition to the repo of dangerous .gitmodules files, but some could have been committed before those safeguards where in place. Add a check in DfsFsck to validate the .gitmodules files in the repo. Use the same validation than the ReceivePack, translating the results to FsckErrors. Note that all .gitmodules files in the storage will be checked, not only the latest version. Change-Id: I040cf1f31a779419aad0292ba5e6e76eb7f32b66 Signed-off-by: Ivan Frade <ifrade@google.com>	5 years ago
Dave Borowitz	5c02ce52d6	Allow overriding DfsPackDescription comparator for scanning packs Provide a factory for comparators that use the default heuristics except with a different ordering of PackSources. Change-Id: I0809b64deb3d0486040076946fdbdad650d69240	6 years ago
Dave Borowitz	96512f5d3b	Move DfsPackDescription comparators to common location There are several ways of comparing DfsPackDescriptions for different purposes, such as object lookup search order and reftable ordering. Some of these are later compounded into comparators on other objects, so they appear in the code as Comparator<DfsReftable>, for example. Put all the DfsPackDescription comparators in static methods on DfsPackDescription itself. Stop implementing Comparable, to avoid giving the impression that there is always one true and correct way of sorting packs. Change-Id: Ia5ca65249c13373f7ef5b8a5d1ad50a26577706c	6 years ago
Dave Borowitz	e7bacf0a7f	Use Comparators for PackSource Rather than requiring callers to do their own computations based on the package-private "category" number, provide an actual Comparator<PackSource> instance, and explicitly discourage usage of default Enum comparison. Construct the default comparator using a builder pattern based on defining equivalence classes. This gives us the same behavior as the old category field in PackSource, with an abstraction that does not leak the implementation detail of comparing rank numbers. Change-Id: I6757211397ab1bc181d61298e073f88b69dbefc3	6 years ago
David Pursehouse	1f6d43a652	Fix trivial usages of deprecated Repository#getAllRefs Callers of getAllRefs that only iterate over the `values()` of the returned map can be trivially fixed to call getRefDatabase().getRefs() instead. Only fix those where the calling method is already declared to throw IOException, to avoid potential API changes. Change-Id: I2b05f785077a1713953cfd42df7bf915f889f90b Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	6 years ago
Minh Thai	159da6dacc	Break down DfsBlockCache stats by pack file extension. Change-Id: Iaecf0580279b33e3e2439784528cae7b69fb28bc Signed-off-by: Minh Thai <mthai@google.com>	6 years ago
Minh Thai	15a189e4e0	Add flag for keeping ref tombstones in GC reftable A tombstone will prevent a delayed reference update from resurrecting the deleted reference. Change-Id: Id9f4df43d435a299ff16cef614821439edef9b11 Signed-off-by: Minh Thai <mthai@google.com>	6 years ago
Shawn Pearce	7cd5d77ae3	dfs: Switch InMemoryRepository to DfsReftableDatabase This ensure DfsReftableDatabase is tested by the same test suites that use/test InMemoryRepository. It also simplifies the logic of InMemoryRepository and brings its compatibility story closer to any other DFS repository that uses reftables for its reference storage. Change-Id: I881469fd77ed11a9239b477633510b8c482a19ca Signed-off-by: Minh Thai <mthai@google.com> Signed-off-by: Terry Parker <tparker@google.com>	6 years ago
Shawn Pearce	d13dfac9dc	dfs: write reftable from DfsGarbageCollector If a ReftableConfig has been supplied by the caller, write out a reftable as a sibling of the the GC pack, alongside the heads. To bootstrap from a non-reftable system, the refs are read from the DfsRefDatabase if no GC reftables are present. Its assumed the references are fully current, and do not need to be merged with any other reftables. Any non-GC reftables will be pruned at the end of the GC cycle, just like any packs that were replaced. If a GC reftable is present, all existing reftables are compacted, and references from DfsRefDatabase are only used to seed the packer. Its assumed these are consistent with each other. Change-Id: Ie397eb58aaaefb6865c816d9b39de3ac12998019	6 years ago
Zhen Chen	b0695e5b7b	Add commit check for head references Make sure all refs/heads/* point to a commit object. Change-Id: I9c7cf347aaf63d5ef604d520c2383c6cf3043890 Signed-off-by: Zhen Chen <czhen@google.com>	6 years ago
Zhen Chen	673acfc6bd	Add connectivity check from references Make sure all objects referenced by references are reachable. Stop at the first missing object. Change-Id: Ifcd7392c4321b17d9290bd87f038bc62bc10dabb Signed-off-by: Zhen Chen <czhen@google.com>	6 years ago
Zhen Chen	2c2999643f	Add dfs fsck implementation JGit already had some fsck-like classes like ObjectChecker which can check for an individual object. The read-only FsckPackParser which will parse all objects within a pack file and check it with ObjectChecker. It will also check the pack index file against the object information from the pack parser. Change-Id: Ifd8e0d28eb68ff0b8edd2b51b2fa3a50a544c855 Signed-off-by: Zhen Chen <czhen@google.com>	7 years ago
Shawn Pearce	a1e4825c4e	dfs: silence resource warnings in DfsBlockCacheTest Change-Id: Ia934d8578592dc20837944d50acfb8920e260893	6 years ago
Shawn Pearce	f414f7de1f	dfs: Fix DataFormatException: 0 bytes to inflate When a file uses a different block size (e.g. 500) than the cache (e.g. 512), and the DfsPackFile's blockSize field has not been initialized, the cache misaligns block loads. The cache uses its default of 512 to compute the block alignment instead of the file's 500. This causes DfsReader try to set an empty range into an Inflater, resulting in an object being unable to load. Change-Id: I7d6352708225f62ef2f216d1ddcbaa64be113df6	6 years ago
Shawn Pearce	3a072c8a7a	dfs: test for repositories sharing blocks in DfsBlockCache Simple test to verify two DfsRepository instances will reuse the same DfsBlocks in the DfsBlockCache, even though the DfsStreamKey instance is now different between their DfsPackFile instances. Change-Id: I409c109142dea488d189b9ac0d3c319755dce7b4	6 years ago
Shawn Pearce	8d27c480df	dfs: Fix build break caused by DfsStreamKey.of signature change Change-Id: I6c49cf42a04dd0d96cfe0751f500a51f56f0bdb8	6 years ago
Shawn Pearce	07f98a8b71	Derive DfsStreamKey from DfsPackDescription By making this a deterministic function, DfsBlockCache can stop retaining a map of every DfsPackDescription it has ever seen. This fixes a long standing memory leak in DfsBlockCache. This refactoring also simplifies the idea of setting up more lightweight objects around streams. Change-Id: I051e7b96f5454c6b0a0e652d8f4a69c0bed7f6f4	6 years ago
Shawn Pearce	e924de5295	Rename DfsPackKey to DfsStreamKey This renaming supports reusing DfsStreamKey in a future commit to index other PackExt type streams inside of the DfsBlockCache. Change-Id: Ib52d374e47724ccb837f4fbab1fc85c486c5b408	6 years ago
Terry Parker	8dd53135cb	Add a new singlePack option to PackConfig If set, "singlePack" will create a single GC pack file for all objects reachable from refs/. If not set, the GC pack will contain object reachable from refs/heads/ and refs/tags/*, and the GC_REST pack will contain all other reachable objects. Change-Id: I56bcb6a9da2c10a0909c2f940c025db6f3acebcb Signed-off-by: Terry Parker <tparker@google.com>	7 years ago
Shawn Pearce	1513a5632d	Allow DfsReader to be subclassed Necessary if a DFS implementation wants to override close() to record DfsReaderIoStats. Change-Id: I144575f9bf1abf2c1fd72030550c4f0795fcf44d	7 years ago
Thirumala Reddy Mutchukota	5e250e45be	Delete expired garbage even when there is no GC pack present. Delete the condition to check whether the garbage pack creation time is older than the last GC operation, because it's not possible to find the last GC operation time when there is no GC pack. Add additional tests to make sure the contents of the expired garbage packs are considered during the GC operation and any actively referenced objects from the garbage packs are copied successfully into the GC pack before deleting the garbage pack. Change-Id: I09e8b2656de8ba7f9b996724ad1961d908e937b6 Signed-off-by: Thirumala Reddy Mutchukota <thirumala@google.com>	7 years ago
David Pursehouse	3b4448637f	Enable and fix warnings about redundant specification of type arguments Since the introduction of generic type parameter inference in Java 7, it's not necessary to explicitly specify the type of generic parameters. Enable the warning in Eclipse, and fix all occurrences. Change-Id: I9158caf1beca5e4980b6240ac401f3868520aad0 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	7 years ago
Shawn Pearce	d67b183537	Prefer smaller GC files during DFS garbage collection In `8ac65d33ed` PackWriter changed its behavior to always prefer the last object representation presented to it by the ObjectReuseAsIs implementation. This was a fix to avoid delta chain cycles. Unfortunately it can lead to suboptimal compression when concurrent GCs are run on the same repository. One case is automatic GC running (with default settings) in parallel to a manual GC that has disabled delta reuse in order to generate new smaller deltas for the entire history of the repository. Running GC with no-reuse generally requires more CPU time, which also translates to a longer running time. This can lead to a race where the automatic GC completes before the no-reuse GC, leaving the repository in a state such as: no-reuse GC: size 1 GiB, mtime = 18:45 auto GC: size 8 GiB, mtime = 17:30 With the default sort ordering, the smaller no-reuse GC pack is sorted earlier in the pack list, due to its more recent mtime. During object reuse in a future GC, these smaller representations are considered first by PackWriter, but are all discarded when the auto GC file from 17:30 is examined second (due to its older mtime). Work around this in two ways. Well formed DFS repositories should have at most 1 GC pack. If 2 or more GC packs exist, break the sorting tie by selecting the smaller file earlier in the pack list. This allows all normal read code paths to favor the smaller file, which places less pressure on the DfsBlockCache. If any GC race happens, readers serving clone requests will prefer the file that is smaller. During object reuse, flip this ordering so that the smaller file is last. This allows PackWriter to see smaller deltas last, replacing larger representations that were previously considered from other pack files. Change-Id: I0b7dc8bb9711c82abd6bd16643f518cfccc6d31a	7 years ago
Thirumala Reddy Mutchukota	006f4d4d29	Reintroduce garbage pack coalescing when ttl > 0. Disabling the garbage pack coalescing when garbageTtl > 0 can result in lot of garbage packs if they are created within the garbageTtl time. To avoid a large number of garbage packs, re-introducing garbage pack coalescing for the packs that are created within a single calendar day when the garbageTtl is more than one day or one third of the garbageTtl. Change-Id: If969716aeb55fb4fd0ff71d75f41a07638cd5a69 Signed-off-by: Thirumala Reddy Mutchukota <thirumala@google.com>	7 years ago
Thirumala Reddy Mutchukota	c9f55032a2	Record the estimated size of the pack files. The Compacter and Garbage Collector will record the estimated size of the newly going to be created compact, gc or garbage packs. This information can be used by the clients to better make a call on how to actually store the pack based on the approximated expected size. Added a new protected method DfsObjDatabase.newPack(PackSource packSource, long estimatedPackSize), so that the clients can override this method to make use of the estimatedPackSize while creating a new PackDescription object. The default implementation of this method is equivalent to newPack(packSource).setEstimatedPackSize(estimatedPackSize). I didn't make it abstract because that would force all the existing sub classes of DfsObjDatabase to implement this method. Due to this default implementation, the estimatedPackSize is added to DfsPackDescription using a setter instead of a constructor parameter (even though constructor parameter would be a better choice as this value is set only during the object creation). Change-Id: Iade1122633ea774c2e842178a6a6cbb4a57b598b Signed-off-by: Thirumala Reddy Mutchukota <thirumala@google.com>	7 years ago
Philipp Marx	8adbfe4da6	Check that DfsBlockCache#blockSize is a power of 2 In case a value is used which isn’t a power of 2 there will be a high chance of java.lang.ArrayIndexOutBoundsException and org.eclipse.jgit.errors.CorruptObjectException due to a mismatching assumption for the DfsBlockCache#blockSizeShift parameter. Change-Id: Ib348b3704edf10b5f93a3ffab4fa6f09cbbae231 Signed-off-by: Philipp Marx <smigfu@googlemail.com>	7 years ago
Mike Williams	fd527a2cd7	Prune UNREACHABLE_GARBAGE packs when they expire DfsGarbageCollector will now enforce a maximum time to live (TTL) for UNREACHABLE_GARBAGE packs. The default TTL is 1 day, which should be enough time to avoid races with other processes that are inserting data into the repository. Change-Id: Id719e6e2a03cfc9a0c0aef8ed71d261dda14bd0c Signed-off-by: Mike Williams <miwilliams@google.com>	8 years ago
Dave Borowitz	0d6ba84065	DfsInserter: Optionally disable existing object check When using a DfsInserter for high-throughput insertion of many objects (analogous to git-fast-import), we don't necessarily want to do a random object lookup for each. It'll be faster from the inserter's perspective to insert the duplicate objects and let a later GC handle the deduplication. Change-Id: Ic97f5f01657b4525f157e6df66023f1f07fc1851	8 years ago
Dave Borowitz	adff322a69	Expose the ObjectInserter that created an ObjectReader We've found in Gerrit Code Review that it is common to pass around both an ObjectReader (or more commonly a RevWalk wrapping one) and an ObjectInserter. These code paths often assume that the ObjectReader can read back any objects created by the ObjectInserter without flushing. However, we previously had no way to enforce that constraint programmatically, leading to hard-to-spot problems. Provide a solution by exposing the ObjectInserter that created an ObjectReader, when known. Callers can either continue passing both objects and check: reader.getCreatedFromInserter() == inserter or they can just pass around ObjectReader and extract the inserter when it's needed (checking that it's not null at usage time). Change-Id: Ibbf5d1968b506f6b47030ab1b046ffccb47352ea	8 years ago
Matthias Sohn	7ee184acfa	Fix imports in DfsInserterTest - remove unused import of AnyObjectId - auto-sort import statements Change-Id: I1c7cec2734bd58370a7dfae70a6a4ccbe3e304ce Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	8 years ago
Mike Williams	c4d73fb7cc	Insert duplicate objects to prevent race during garbage collection. Prior to this change, DfsInserter would not insert an object into a pack if it already existed in another pack in the repository, even if that pack was unreachable. Consider this sequence of events: - Object FOO is pushed to a repository. - Subsequent ref changes make FOO UNREACHABLE_GARBAGE. - FOO is subsequently re-inserted using a DfsInserter, but skipped due to existing in UNREACHABLE_GARBAGE. - The repository is repacked; FOO will not be written into a new pack because it is not yet reachable from a reference. If the UNREACHABLE_GARBAGE packs are deleted, FOO disappears. - A reference is updated to reference FOO. This reference is now broken as FOO was removed when the repacking process deleted the UNREACHABLE_GARBAGE pack that stored the only copy of FOO. The garbage collector can't safely delete the UNREACHABLE_GARBAGE pack because FOO might be in the middle of being re-inserted/re-packed. This change writes a duplicate copy of an object if it only exists in UNREACHABLE_GARBAGE. This "freshens" the object to give it a chance to survive long enough to be made reachable through a reference. Change-Id: I20f2062230f3af3bccd6f21d3b7342f1152a5532 Signed-off-by: Mike Williams <miwilliams@google.com>	8 years ago
Matthias Sohn	686124bec3	Replace deprecated release() methods by close() See the discussion [1] in the Gerrit mailing list. [1] https://groups.google.com/forum/#!topic/repo-discuss/RRQT_xCqz4o Change-Id: I2c67384309c5c2e8511a7d0d4e088b4e95f819ff Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	9 years ago
Shawn Pearce	8ff08455f6	Fix memory leak in dfs.DeltaBaseCase The LRU chain management code was broken leading to situations where the chain was incomplete. This prevented the cache from removing items when it exceeded its memory target, causing a leak. One case was repeated hit on the head of the chain. moveToHead(e) was invoked linking the head back to itself in a cycle orphaning the rest of the table. Add some unit tests to cover this and a few other paths. Change-Id: Ib27486eaa1b1d2bf1c745a56d0a5832bfb029322	9 years ago
Shawn Pearce	d70419ab00	Revert "Add a method to DfsOutputStream to read as an InputStream" This reverts commit `b646578d89`. openInputStream() is never used in JGit, nor is it used by any known working DFS implementation. The method was added as a utility for reading back from a DfsInserter, but the final implementation of that feature does not requrire this method. Change-Id: I075ad95e40af49c92b554480f8993ef5658f7684	9 years ago
Dave Borowitz	e1856dbf44	Add a method to ObjectInserter to read back inserted objects In the DFS implementation, flushing an inserter writes a new pack to the storage system and is potentially very slow, but was the only way to ensure previously-inserted objects were available. For some tasks, like performing a series of three-way merges, the total size of all inserted objects may be small enough to avoid flushing the in-memory buffered data. DfsOutputStream already provides a read method to read back from the not-yet-flushed data, so use this to provide an ObjectReader in the DFS case. In the file-backed case, objects are written out loosely on the fly, so the implementation can just return the existing WindowCursor. Change-Id: I454fdfb88f4d215e31b7da2b2a069853b197b3dd	11 years ago
Dave Borowitz	b646578d89	Add a method to DfsOutputStream to read as an InputStream Change-Id: I0ec1f17a88bc14f22c10f9bc8d6f5b5118410e3a	11 years ago

45 Commits (8ada9048c5add754c3b34851b1bd501ce28b3321)