* Workflow: split repository?
@ 2007-10-12 12:21 Jan Wielemaker
2007-10-12 14:30 ` Jan Hudec
0 siblings, 1 reply; 3+ messages in thread
From: Jan Wielemaker @ 2007-10-12 12:21 UTC (permalink / raw)
To: Git Mailing List
Hi,
I've got a big active project, until yesterday managed using CVS. As
with any distributed academic research project the repository has become
a nice mess where most files are in the wrong place and there are
several almost independent sub-projects living in directories.
The plan is/was to
* Convert everything to GIT (done, through cvs2svn)
* Everyone keeps hacking on their bits, while one is starting
to reorganise the structure by moving files and directories
and changing import headers, and other file references in
a GIT branch.
* Now we merge the continued work and the reorganisation to
end up with a nice clean hierarchy :-)
* Split the big project into multiple projects. One of the
reasons is that we want to make part of them public. Others
we cannot publish as they contain copyrighted data. I understand
we can reunite them using GIT sub modules.
Does this make sense?
While splitting we want to *loose* history information for some of the
projects. That is easy: simply create a new repository from the current
files. For some however we would like to *preserve* the history. This
means we would like to pick a hierarchy with its history. After quite
a bit of reading, I get the impression this cannot be done. Am I right?
Is the only way to create a GIT repositiory right away from a subset of
the CVS for which we want to preserve the history?
Thanks --- Jan
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Workflow: split repository?
2007-10-12 12:21 Workflow: split repository? Jan Wielemaker
@ 2007-10-12 14:30 ` Jan Hudec
2007-10-12 14:57 ` Jan Wielemaker
0 siblings, 1 reply; 3+ messages in thread
From: Jan Hudec @ 2007-10-12 14:30 UTC (permalink / raw)
To: Jan Wielemaker; +Cc: Git Mailing List
[-- Attachment #1: Type: text/plain, Size: 2633 bytes --]
On Fri, Oct 12, 2007 at 14:21:39 +0200, Jan Wielemaker wrote:
> Hi,
>
> I've got a big active project, until yesterday managed using CVS. As
> with any distributed academic research project the repository has become
> a nice mess where most files are in the wrong place and there are
> several almost independent sub-projects living in directories.
>
> The plan is/was to
>
> * Convert everything to GIT (done, through cvs2svn)
> * Everyone keeps hacking on their bits, while one is starting
> to reorganise the structure by moving files and directories
> and changing import headers, and other file references in
> a GIT branch.
> * Now we merge the continued work and the reorganisation to
> end up with a nice clean hierarchy :-)
> * Split the big project into multiple projects. One of the
> reasons is that we want to make part of them public. Others
> we cannot publish as they contain copyrighted data. I understand
> we can reunite them using GIT sub modules.
>
> Does this make sense?
It might make more sense to convert bit by bit, to separate git repositories.
Would save you some git-filter-branch work.
> While splitting we want to *loose* history information for some of the
> projects. That is easy: simply create a new repository from the current
> files. For some however we would like to *preserve* the history. This
> means we would like to pick a hierarchy with its history. After quite
> a bit of reading, I get the impression this cannot be done. Am I right?
It can, but you have to be aware of the pitfalls. Git allows you to create
a new history, which is defined modification of the original history. There
is git-filter-branch command, that can create a repository with just
a subtree and such. But it's a new, independent, history. You can't merge
between the old and new one (but you can rebase the few commits someone made
while you were converting) and anyone who has the old history in his repo
will still have it.
> Is the only way to create a GIT repositiory right away from a subset of
> the CVS for which we want to preserve the history?
No, it's not. It will save you work if you can do as much splitting as
possible during the conversion, ie. convert the bits you know will be
separate separately (and combine them using submodules as appropriate).
But if you have bits that will take a lot of work to factor out, you can
convert to git, make the other code ready to use a submodule and than use
git-filter-branch to extract the right bits of history for the submodule.
--
Jan 'Bulb' Hudec <bulb@ucw.cz>
[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 189 bytes --]
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Workflow: split repository?
2007-10-12 14:30 ` Jan Hudec
@ 2007-10-12 14:57 ` Jan Wielemaker
0 siblings, 0 replies; 3+ messages in thread
From: Jan Wielemaker @ 2007-10-12 14:57 UTC (permalink / raw)
To: Jan Hudec; +Cc: Git Mailing List
Hi Jan,
On Friday 12 October 2007 16:30, Jan Hudec wrote:
> On Fri, Oct 12, 2007 at 14:21:39 +0200, Jan Wielemaker wrote:
> > Does this make sense?
>
> It might make more sense to convert bit by bit, to separate git
> repositories. Would save you some git-filter-branch work.
> > Is the only way to create a GIT repositiory right away from a subset of
> > the CVS for which we want to preserve the history?
>
> No, it's not. It will save you work if you can do as much splitting as
> possible during the conversion, ie. convert the bits you know will be
> separate separately (and combine them using submodules as appropriate).
>
> But if you have bits that will take a lot of work to factor out, you can
> convert to git, make the other code ready to use a submodule and than use
> git-filter-branch to extract the right bits of history for the submodule.
Thanks! git-filter-branch looks a bit overwhelming, but I think I can
manage :-) I'll do the simple things in separate conversions.
Cheers --- Jan
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2007-10-12 15:03 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-10-12 12:21 Workflow: split repository? Jan Wielemaker
2007-10-12 14:30 ` Jan Hudec
2007-10-12 14:57 ` Jan Wielemaker
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).