* Workflow: split repository? @ 2007-10-12 12:21 Jan Wielemaker 2007-10-12 14:30 ` Jan Hudec 0 siblings, 1 reply; 3+ messages in thread From: Jan Wielemaker @ 2007-10-12 12:21 UTC (permalink / raw) To: Git Mailing List Hi, I've got a big active project, until yesterday managed using CVS. As with any distributed academic research project the repository has become a nice mess where most files are in the wrong place and there are several almost independent sub-projects living in directories. The plan is/was to * Convert everything to GIT (done, through cvs2svn) * Everyone keeps hacking on their bits, while one is starting to reorganise the structure by moving files and directories and changing import headers, and other file references in a GIT branch. * Now we merge the continued work and the reorganisation to end up with a nice clean hierarchy :-) * Split the big project into multiple projects. One of the reasons is that we want to make part of them public. Others we cannot publish as they contain copyrighted data. I understand we can reunite them using GIT sub modules. Does this make sense? While splitting we want to *loose* history information for some of the projects. That is easy: simply create a new repository from the current files. For some however we would like to *preserve* the history. This means we would like to pick a hierarchy with its history. After quite a bit of reading, I get the impression this cannot be done. Am I right? Is the only way to create a GIT repositiory right away from a subset of the CVS for which we want to preserve the history? Thanks --- Jan ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Workflow: split repository? 2007-10-12 12:21 Workflow: split repository? Jan Wielemaker @ 2007-10-12 14:30 ` Jan Hudec 2007-10-12 14:57 ` Jan Wielemaker 0 siblings, 1 reply; 3+ messages in thread From: Jan Hudec @ 2007-10-12 14:30 UTC (permalink / raw) To: Jan Wielemaker; +Cc: Git Mailing List [-- Attachment #1: Type: text/plain, Size: 2633 bytes --] On Fri, Oct 12, 2007 at 14:21:39 +0200, Jan Wielemaker wrote: > Hi, > > I've got a big active project, until yesterday managed using CVS. As > with any distributed academic research project the repository has become > a nice mess where most files are in the wrong place and there are > several almost independent sub-projects living in directories. > > The plan is/was to > > * Convert everything to GIT (done, through cvs2svn) > * Everyone keeps hacking on their bits, while one is starting > to reorganise the structure by moving files and directories > and changing import headers, and other file references in > a GIT branch. > * Now we merge the continued work and the reorganisation to > end up with a nice clean hierarchy :-) > * Split the big project into multiple projects. One of the > reasons is that we want to make part of them public. Others > we cannot publish as they contain copyrighted data. I understand > we can reunite them using GIT sub modules. > > Does this make sense? It might make more sense to convert bit by bit, to separate git repositories. Would save you some git-filter-branch work. > While splitting we want to *loose* history information for some of the > projects. That is easy: simply create a new repository from the current > files. For some however we would like to *preserve* the history. This > means we would like to pick a hierarchy with its history. After quite > a bit of reading, I get the impression this cannot be done. Am I right? It can, but you have to be aware of the pitfalls. Git allows you to create a new history, which is defined modification of the original history. There is git-filter-branch command, that can create a repository with just a subtree and such. But it's a new, independent, history. You can't merge between the old and new one (but you can rebase the few commits someone made while you were converting) and anyone who has the old history in his repo will still have it. > Is the only way to create a GIT repositiory right away from a subset of > the CVS for which we want to preserve the history? No, it's not. It will save you work if you can do as much splitting as possible during the conversion, ie. convert the bits you know will be separate separately (and combine them using submodules as appropriate). But if you have bits that will take a lot of work to factor out, you can convert to git, make the other code ready to use a submodule and than use git-filter-branch to extract the right bits of history for the submodule. -- Jan 'Bulb' Hudec <bulb@ucw.cz> [-- Attachment #2: Digital signature --] [-- Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Workflow: split repository? 2007-10-12 14:30 ` Jan Hudec @ 2007-10-12 14:57 ` Jan Wielemaker 0 siblings, 0 replies; 3+ messages in thread From: Jan Wielemaker @ 2007-10-12 14:57 UTC (permalink / raw) To: Jan Hudec; +Cc: Git Mailing List Hi Jan, On Friday 12 October 2007 16:30, Jan Hudec wrote: > On Fri, Oct 12, 2007 at 14:21:39 +0200, Jan Wielemaker wrote: > > Does this make sense? > > It might make more sense to convert bit by bit, to separate git > repositories. Would save you some git-filter-branch work. > > Is the only way to create a GIT repositiory right away from a subset of > > the CVS for which we want to preserve the history? > > No, it's not. It will save you work if you can do as much splitting as > possible during the conversion, ie. convert the bits you know will be > separate separately (and combine them using submodules as appropriate). > > But if you have bits that will take a lot of work to factor out, you can > convert to git, make the other code ready to use a submodule and than use > git-filter-branch to extract the right bits of history for the submodule. Thanks! git-filter-branch looks a bit overwhelming, but I think I can manage :-) I'll do the simple things in separate conversions. Cheers --- Jan ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2007-10-12 15:03 UTC | newest] Thread overview: 3+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2007-10-12 12:21 Workflow: split repository? Jan Wielemaker 2007-10-12 14:30 ` Jan Hudec 2007-10-12 14:57 ` Jan Wielemaker
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).