* How to prevent Git from compressing certain files? @ 2009-07-22 19:49 Dirk Süsserott 2009-07-22 20:44 ` Jakub Narebski ` (2 more replies) 0 siblings, 3 replies; 5+ messages in thread From: Dirk Süsserott @ 2009-07-22 19:49 UTC (permalink / raw) To: Git Mailing List Hi, I'm (ab)using Git to store my media files, i.e. digicam pictures (*.jpg) and the like. This way I can e.g. comment a series of pictures without installing and learning a special purpose "Photo Archiving" tool. Gitk shows the roadmap! Somewhere I read that Git isn't supposed to efficiently handle binary files. Of course, I don't want to merge my files, just store them with their history and git-push them to some "safe place". I figured that pushing and git gc'ing both try to compress those files (or differences) really hard. Works great for "regular" files, but is pointless with jpegs. Question: Is there a way to prevent Git from trying to compress certain files based on their extension? -- Dirk ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to prevent Git from compressing certain files? 2009-07-22 19:49 How to prevent Git from compressing certain files? Dirk Süsserott @ 2009-07-22 20:44 ` Jakub Narebski 2009-07-22 20:46 ` Jakub Narebski 2009-07-23 10:12 ` Jeff King 2 siblings, 0 replies; 5+ messages in thread From: Jakub Narebski @ 2009-07-22 20:44 UTC (permalink / raw) To: Dirk Süsserott; +Cc: Git Mailing List, Junio C Hamano Dirk Süsserott <newsletter@dirk.my1.cc> writes: > I'm (ab)using Git to store my media files, i.e. digicam pictures (*.jpg) > and the like. This way I can e.g. comment a series of pictures without > installing and learning a special purpose "Photo Archiving" tool. Gitk > shows the roadmap! > > Somewhere I read that Git isn't supposed to efficiently handle binary > files. Of course, I don't want to merge my files, just store them with > their history and git-push them to some "safe place". > > I figured that pushing and git gc'ing both try to compress those files > (or differences) really hard. Works great for "regular" files, but is > pointless with jpegs. > > Question: Is there a way to prevent Git from trying to compress certain > files based on their extension? There is _undocumented_ "delta" gitattribute, introduced in a74db82 (Teach "delta" attribute to pack-objects., 2007-05-19), which you have to unset. Like this: == .gitattributes == *.jpg -delta -- Jakub Narebski Poland ShadeHawk on #git ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to prevent Git from compressing certain files? 2009-07-22 19:49 How to prevent Git from compressing certain files? Dirk Süsserott 2009-07-22 20:44 ` Jakub Narebski @ 2009-07-22 20:46 ` Jakub Narebski 2009-07-23 10:12 ` Jeff King 2 siblings, 0 replies; 5+ messages in thread From: Jakub Narebski @ 2009-07-22 20:46 UTC (permalink / raw) To: Dirk Süsserott; +Cc: Git Mailing List, Junio C Hamano Dirk Süsserott <newsletter@dirk.my1.cc> writes: > I'm (ab)using Git to store my media files, i.e. digicam pictures (*.jpg) > and the like. This way I can e.g. comment a series of pictures without > installing and learning a special purpose "Photo Archiving" tool. Gitk > shows the roadmap! > > Somewhere I read that Git isn't supposed to efficiently handle binary > files. Of course, I don't want to merge my files, just store them with > their history and git-push them to some "safe place". > > I figured that pushing and git gc'ing both try to compress those files > (or differences) really hard. Works great for "regular" files, but is > pointless with jpegs. > > Question: Is there a way to prevent Git from trying to compress certain > files based on their extension? There is _undocumented_ "delta" gitattribute, introduced by Junio C Hamano in a74db82 (Teach "delta" attribute to pack-objects., 2007-05-19), which you would have to unset. Like this: == .gitattributes == *.jpg -delta -- Jakub Narebski Poland ShadeHawk on #git ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to prevent Git from compressing certain files? 2009-07-22 19:49 How to prevent Git from compressing certain files? Dirk Süsserott 2009-07-22 20:44 ` Jakub Narebski 2009-07-22 20:46 ` Jakub Narebski @ 2009-07-23 10:12 ` Jeff King 2009-07-23 18:51 ` Dirk Süsserott 2 siblings, 1 reply; 5+ messages in thread From: Jeff King @ 2009-07-23 10:12 UTC (permalink / raw) To: Dirk Süsserott; +Cc: Git Mailing List On Wed, Jul 22, 2009 at 09:49:31PM +0200, Dirk Süsserott wrote: > Somewhere I read that Git isn't supposed to efficiently handle binary > files. Of course, I don't want to merge my files, just store them with > their history and git-push them to some "safe place". Git handles binary files better than many systems. The downsides are: - you can't do file-level diffing and merging very well, for obvious reasons (though actually, git is better than most; it makes it easy to look at both sides individually and pick the one you want). - really _big_ files can give lousy performance. Git assumes single files can fit into memory, which means files in the gigabyte range (or hundreds of megabytes if your machine is old :) ) can be awful. It also means that things like inexact rename detection and finding delta candidates can be slow. > I figured that pushing and git gc'ing both try to compress those files > (or differences) really hard. Works great for "regular" files, but is > pointless with jpegs. > > Question: Is there a way to prevent Git from trying to compress certain > files based on their extension? There are actually two types of compression that git uses: delta compression between similar objects in packs, and zlib compression of loose objects and objects within packs. You almost certainly don't want zlib compression on your jpegs, as they are already compressed. You can turn off zlib compression entirely by setting core.compression to 0. Unfortunately, this turns off compression for _all_ objects, which means in a mixed-use repo you won't be compressing your text (and even in a photos-only repo, you are not compressing your commit messages). Delta compression between two jpegs, or between two versions of a jpeg where the image data itself was modified, is unlikely to be useful. However, if you use EXIF metadata in the file, then you will save a lot of space between versions with the same image data, but different metadata. So it's worth leaving delta compression on in that case, and probably turning it off otherwise. As Jakub mentioned, you can use the delta gitattribute for just your jpegs. You can also turn off deltas entirely by setting pack.window to 0, though you may be losing some benefit on your non-blob objects. -Peff ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to prevent Git from compressing certain files? 2009-07-23 10:12 ` Jeff King @ 2009-07-23 18:51 ` Dirk Süsserott 0 siblings, 0 replies; 5+ messages in thread From: Dirk Süsserott @ 2009-07-23 18:51 UTC (permalink / raw) To: Git Mailing List; +Cc: Jeff King, Jakub Narebski Jeff, Jakub, thanks for the answers and explanation of how and when Git compresses data. Setting both core.compression and pack.window to 0 sped up my git-gc :-) Dirk ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2009-07-23 18:52 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-07-22 19:49 How to prevent Git from compressing certain files? Dirk Süsserott 2009-07-22 20:44 ` Jakub Narebski 2009-07-22 20:46 ` Jakub Narebski 2009-07-23 10:12 ` Jeff King 2009-07-23 18:51 ` Dirk Süsserott
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).