* [GSOC] Git Configuration API improvements @ 2014-03-11 13:12 karthik nayak 2014-03-11 14:51 ` Matthieu Moy 0 siblings, 1 reply; 5+ messages in thread From: karthik nayak @ 2014-03-11 13:12 UTC (permalink / raw) To: Git List; +Cc: Jeff King, mhagger, Matthieu.Moy Hello to all, I'm Karthik Nayak, a Computer Science student from Bangalore India. I will be applying for GSOC 2014. This is my first time applying for GSOC. I have been using Git for a long time now, so it would be an ideal organisation for me to contribute to. As suggested I completed the Micro Project under the guidance of Eric Sunshine.[1] I went through the Idea's page[2] and the mails corresponding to the topic[3]. Currently we have multiple invocation of git_config() in an individual invocation of git() which is not efficient. Also, it is hard to implement new features, as mentioned -- such as allowing a configuration to be unset. The basic idea is to use a data structure to store the config, the first time the git_config() is called. And refer to this each and every time we invoke git_config() further. Jeff suggested [4] to use a tree or a mapping of keys to values which are stored in a string. I think that the idea of storing using the config as a tree data structure would be really advantageous. This would allow us to easily implement modifications later, and can help to easily take care of duplicates being created on multiple usage of config set and unset. As, whenever a node's children have been deleted the node can also be automatically deleted. The tree can be a struct with values, header link and link to other configs. This way we can also create functions to work on the tree easily. Would be great to hear your thoughts on this topic and also I plan to start creating a proposal. Nice to have any suggestions or feedback of any kind. Thank you, Karthik [1] : http://article.gmane.org/gmane.comp.version-control.git/243695/match=karthik+188+gmail+com [2] : http://git.github.io/SoC-2014-Ideas.html [3] : http://article.gmane.org/gmane.comp.version-control.git/243500/match=git+configuration+caching [4] : http://article.gmane.org/gmane.comp.version-control.git/243542 ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [GSOC] Git Configuration API improvements 2014-03-11 13:12 [GSOC] Git Configuration API improvements karthik nayak @ 2014-03-11 14:51 ` Matthieu Moy 2014-03-11 16:19 ` karthik nayak 0 siblings, 1 reply; 5+ messages in thread From: Matthieu Moy @ 2014-03-11 14:51 UTC (permalink / raw) To: karthik nayak; +Cc: Git List, Jeff King, mhagger karthik nayak <karthik.188@gmail.com> writes: > Currently we have multiple invocation of git_config() in an > individual invocation of git() which is not efficient. Also, it is > hard to implement new features, I think efficiency is not a real concern here. Config files are small and easy to parse, so parsing them multiple times isn't really noticeable from the performance point of view. OTOH, the extensibility is a real concern, and that should be the main motivation for the project. -- Matthieu Moy http://www-verimag.imag.fr/~moy/ ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [GSOC] Git Configuration API improvements 2014-03-11 14:51 ` Matthieu Moy @ 2014-03-11 16:19 ` karthik nayak 2014-03-14 4:31 ` Jeff King 0 siblings, 1 reply; 5+ messages in thread From: karthik nayak @ 2014-03-11 16:19 UTC (permalink / raw) To: Matthieu Moy; +Cc: Git List, Jeff King, Michael Haggerty On Tue, Mar 11, 2014 at 8:21 PM, Matthieu Moy <Matthieu.Moy@grenoble-inp.fr> wrote: > karthik nayak <karthik.188@gmail.com> writes: > >> Currently we have multiple invocation of git_config() in an >> individual invocation of git() which is not efficient. Also, it is >> hard to implement new features, > > I think efficiency is not a real concern here. Config files are small > and easy to parse, so parsing them multiple times isn't really > noticeable from the performance point of view. > > OTOH, the extensibility is a real concern, and that should be the main > motivation for the project. > > -- > Matthieu Moy > http://www-verimag.imag.fr/~moy/ Hello Matthieu, Thanks. I understand what you mean. extensibility is the main motivation of the project, i think that by implementing the cache system we can fix the small problems (reappearing of headers while setting and unsetting configs) and also implement new features like to unset a config easily. ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [GSOC] Git Configuration API improvements 2014-03-11 16:19 ` karthik nayak @ 2014-03-14 4:31 ` Jeff King 2014-03-15 16:30 ` karthik nayak 0 siblings, 1 reply; 5+ messages in thread From: Jeff King @ 2014-03-14 4:31 UTC (permalink / raw) To: karthik nayak; +Cc: Matthieu Moy, Git List, Michael Haggerty On Tue, Mar 11, 2014 at 09:49:33PM +0530, karthik nayak wrote: > On Tue, Mar 11, 2014 at 8:21 PM, Matthieu Moy > <Matthieu.Moy@grenoble-inp.fr> wrote: > > karthik nayak <karthik.188@gmail.com> writes: > > > >> Currently we have multiple invocation of git_config() in an > >> individual invocation of git() which is not efficient. Also, it is > >> hard to implement new features, > > > > I think efficiency is not a real concern here. Config files are small > > and easy to parse, so parsing them multiple times isn't really > > noticeable from the performance point of view. > > > > OTOH, the extensibility is a real concern, and that should be the main > > motivation for the project. > > Thanks. I understand what you mean. extensibility is the main motivation of the > project, i think that by implementing the cache system we can fix the > small problems > (reappearing of headers while setting and unsetting configs) and also > implement new features > like to unset a config easily. I think the most interesting part of the config idea is turning the fetching of config variables "inside out". That is, right now we turn control over to the config code, which invokes our callbacks. So we see all variables in sequence, and pick out the ones that are interesting. We implement precedence with a "last one wins" technique where we keep overwriting a variable with subsequent config options. This can lead to difficult ordering situations, such as when a config option _might_ be respected based on another factor (e.g., the presence of a command line option, as in the status.branch option). It also means that it's impossible to tell after the fact whether a value was set explicitly on the command line, by config, or if we are using a system default. Most of the time this doesn't matter, but there are a few cases where we care, and we end up having to manually manage a separate flag in each case. By the phrase "inside out" above, I mean that we could read all config, and then fetch individual values as we need them. We do need to care about efficiency here, but only insofar as we don't cause any regressions (that is, the current system is fine speed-wise, we just need to make sure that we do not create new problems by calling a slow lookup in a tight loop or anything like that). -Peff ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [GSOC] Git Configuration API improvements 2014-03-14 4:31 ` Jeff King @ 2014-03-15 16:30 ` karthik nayak 0 siblings, 0 replies; 5+ messages in thread From: karthik nayak @ 2014-03-15 16:30 UTC (permalink / raw) To: Jeff King; +Cc: Matthieu Moy, Git List, Michael Haggerty On Fri, Mar 14, 2014 at 10:01 AM, Jeff King <peff@peff.net> wrote: > On Tue, Mar 11, 2014 at 09:49:33PM +0530, karthik nayak wrote: > >> On Tue, Mar 11, 2014 at 8:21 PM, Matthieu Moy >> <Matthieu.Moy@grenoble-inp.fr> wrote: >> > karthik nayak <karthik.188@gmail.com> writes: >> > >> >> Currently we have multiple invocation of git_config() in an >> >> individual invocation of git() which is not efficient. Also, it is >> >> hard to implement new features, >> > >> > I think efficiency is not a real concern here. Config files are small >> > and easy to parse, so parsing them multiple times isn't really >> > noticeable from the performance point of view. >> > >> > OTOH, the extensibility is a real concern, and that should be the main >> > motivation for the project. >> >> Thanks. I understand what you mean. extensibility is the main motivation of the >> project, i think that by implementing the cache system we can fix the >> small problems >> (reappearing of headers while setting and unsetting configs) and also >> implement new features >> like to unset a config easily. > > I think the most interesting part of the config idea is turning the > fetching of config variables "inside out". > > That is, right now we turn control over to the config code, which > invokes our callbacks. So we see all variables in sequence, and pick out > the ones that are interesting. We implement precedence with a "last one > wins" technique where we keep overwriting a variable with subsequent > config options. > > This can lead to difficult ordering situations, such as when a config > option _might_ be respected based on another factor (e.g., the presence > of a command line option, as in the status.branch option). > > It also means that it's impossible to tell after the fact whether a > value was set explicitly on the command line, by config, or if we are > using a system default. Most of the time this doesn't matter, but there > are a few cases where we care, and we end up having to manually manage > a separate flag in each case. > > By the phrase "inside out" above, I mean that we could read all config, > and then fetch individual values as we need them. We do need to care > about efficiency here, but only insofar as we don't cause any > regressions (that is, the current system is fine speed-wise, we just > need to make sure that we do not create new problems by calling a slow > lookup in a tight loop or anything like that). > > -Peff Hello Jeff, Like you said yes, this will be a complete "inside out" change. Thanks for summing it up, really helpful. Currently i am going through the code and understanding how it works currently. Simultaneously working on the proposal[1], would be great to have your feedback on that. Thanks, Karthik [1] https://gist.github.com/KarthikNayak/98569dd34326f7e6813a ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2014-03-15 16:30 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2014-03-11 13:12 [GSOC] Git Configuration API improvements karthik nayak 2014-03-11 14:51 ` Matthieu Moy 2014-03-11 16:19 ` karthik nayak 2014-03-14 4:31 ` Jeff King 2014-03-15 16:30 ` karthik nayak
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).