* [PATCH] Add unix domain socket support to HTTP transport.
@ 2024-02-21 9:14 Leslie Cheng via GitGitGadget
2024-02-21 22:09 ` Eric Wong
2024-02-23 1:58 ` [PATCH v2] " Leslie Cheng via GitGitGadget
0 siblings, 2 replies; 7+ messages in thread
From: Leslie Cheng via GitGitGadget @ 2024-02-21 9:14 UTC (permalink / raw)
To: git; +Cc: Leslie Cheng, Leslie Cheng
From: Leslie Cheng <leslie.cheng5@gmail.com>
This changeset introduces an `http.unixSocket` option so that users can
proxy their git over HTTP remotes to a unix domain socket. In terms of
why, since UDS are local and git already has a local protocol: some
corporate environments use a UDS to proxy requests to internal resources
(ie. source control), so this change would support those use-cases. This
proxy can occasionally be necessary to attach MFA tokens or client
certificates for CLI tools.
The implementation leverages `--unix-socket` option [0] via the
`CURLOPT_UNIX_SOCKET_PATH` flag available with libcurl [1].
`GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH` and `NO_UNIX_SOCKETS` were kept
separate so that we can spit out better error messages for users if git
was compiled with `NO_UNIX_SOCKETS`.
[0] https://curl.se/docs/manpage.html#--unix-socket
[1] https://curl.se/libcurl/c/CURLOPT_UNIX_SOCKET_PATH.html
Signed-off-by: Leslie Cheng <leslie@lc.fyi>
---
Add unix domain socket support to HTTP transport.
Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-git-1681%2Flcfyi%2Flcfyi%2Fadd-unix-socket-support-v1
Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-git-1681/lcfyi/lcfyi/add-unix-socket-support-v1
Pull-Request: https://github.com/git/git/pull/1681
Documentation/config/http.txt | 5 ++
git-curl-compat.h | 7 +++
http.c | 23 +++++++++
t/t5565-http-unix-domain-socket.sh | 80 ++++++++++++++++++++++++++++++
4 files changed, 115 insertions(+)
create mode 100755 t/t5565-http-unix-domain-socket.sh
diff --git a/Documentation/config/http.txt b/Documentation/config/http.txt
index 2d4e0c9b869..bf48cbd599a 100644
--- a/Documentation/config/http.txt
+++ b/Documentation/config/http.txt
@@ -277,6 +277,11 @@ http.followRedirects::
the base for the follow-up requests, this is generally
sufficient. The default is `initial`.
+http.unixSocket::
+ Connect through this Unix domain socket via HTTP, instead of using the
+ network. If set, this config takes precendence over `http.proxy` and
+ is incompatible with the proxy options (see `curl(1)`).
+
http.<url>.*::
Any of the http.* options above can be applied selectively to some URLs.
For a config key to match a URL, each element of the config key is
diff --git a/git-curl-compat.h b/git-curl-compat.h
index fd96b3cdffd..f0f3bec0e17 100644
--- a/git-curl-compat.h
+++ b/git-curl-compat.h
@@ -74,6 +74,13 @@
#define GIT_CURL_HAVE_CURLE_SSL_PINNEDPUBKEYNOTMATCH 1
#endif
+/**
+ * CURLOPT_UNIX_SOCKET_PATH was added in 7.40.0, released in January 2015.
+ */
+#if LIBCURL_VERSION_NUM >= 0x074000
+#define GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH 1
+#endif
+
/**
* CURL_HTTP_VERSION_2 was added in 7.43.0, released in June 2015.
*
diff --git a/http.c b/http.c
index e73b136e589..8cfdcaeac82 100644
--- a/http.c
+++ b/http.c
@@ -79,6 +79,9 @@ static const char *http_proxy_ssl_ca_info;
static struct credential proxy_cert_auth = CREDENTIAL_INIT;
static int proxy_ssl_cert_password_required;
+#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS)
+static const char *curl_unix_socket_path;
+#endif
static struct {
const char *name;
long curlauth_param;
@@ -455,6 +458,20 @@ static int http_options(const char *var, const char *value,
return 0;
}
+ if (!strcmp("http.unixsocket", var)) {
+#ifdef GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH
+#ifndef NO_UNIX_SOCKETS
+ return git_config_string(&curl_unix_socket_path, var, value);
+#else
+ warning(_("Unix socket support unavailable in this build of Git"));
+ return 0;
+#endif
+#else
+ warning(_("Unix socket support is not supported with cURL < 7.40.0"));
+ return 0;
+#endif
+ }
+
if (!strcmp("http.cookiefile", var))
return git_config_pathname(&curl_cookie_file, var, value);
if (!strcmp("http.savecookies", var)) {
@@ -1203,6 +1220,12 @@ static CURL *get_curl_handle(void)
}
init_curl_proxy_auth(result);
+#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS)
+ if (curl_unix_socket_path) {
+ curl_easy_setopt(result, CURLOPT_UNIX_SOCKET_PATH, curl_unix_socket_path);
+ }
+#endif
+
set_curl_keepalive(result);
return result;
diff --git a/t/t5565-http-unix-domain-socket.sh b/t/t5565-http-unix-domain-socket.sh
new file mode 100755
index 00000000000..4ebcdfaa515
--- /dev/null
+++ b/t/t5565-http-unix-domain-socket.sh
@@ -0,0 +1,80 @@
+#!/bin/sh
+
+test_description="test fetching through http via unix domain socket"
+
+. ./test-lib.sh
+. "$TEST_DIRECTORY"/lib-httpd.sh
+
+test -z "$NO_UNIX_SOCKETS" || {
+ skip_all='skipping http-unix-socket tests, unix sockets not available'
+ test_done
+}
+
+UDS_TO_TCP_FIFO=uds_to_tcp
+TCP_TO_UDS_FIFO=tcp_to_uds
+UDS_PID=
+TCP_PID=
+UDS_SOCKET="$(pwd)/uds.sock"
+UNRESOLVABLE_ENDPOINT=http://localhost:4242
+
+start_proxy_unix_to_tcp() {
+ local socket_path="$UDS_SOCKET"
+ local host=127.0.0.1
+ local port=$LIB_HTTPD_PORT
+
+ rm -f "$UDS_TO_TCP_FIFO"
+ rm -f "$TCP_TO_UDS_FIFO"
+ rm -f "$socket_path"
+ mkfifo "$UDS_TO_TCP_FIFO"
+ mkfifo "$TCP_TO_UDS_FIFO"
+ nc -klU "$socket_path" <tcp_to_uds >uds_to_tcp &
+ UDS_PID=$!
+
+ nc "$host" "$port" >tcp_to_uds <uds_to_tcp &
+ TCP_PID=$!
+
+ test_atexit 'stop_proxy_unix_to_tcp'
+}
+
+stop_proxy_unix_to_tcp() {
+ kill "$UDS_PID"
+ kill "$TCP_PID"
+ rm -f "$UDS_TO_TCP_FIFO"
+ rm -f "$TCP_TO_UDS_FIFO"
+}
+
+start_httpd
+start_proxy_unix_to_tcp
+
+test_expect_success 'setup repository' '
+ test_commit foo &&
+ git init --bare "$HTTPD_DOCUMENT_ROOT_PATH/repo.git" &&
+ git push --mirror "$HTTPD_DOCUMENT_ROOT_PATH/repo.git"
+'
+
+# sanity check that we can't clone normally
+test_expect_success 'cloning without UDS fails' '
+ test_must_fail git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone
+'
+
+test_expect_success 'cloning with UDS succeeds' '
+ test_when_finished "rm -rf clone" &&
+ test_config_global http.unixsocket "$UDS_SOCKET" &&
+ git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone
+'
+
+test_expect_success 'cloning with a non-existent http proxy fails' '
+ git clone $HTTPD_URL/smart/repo.git clone &&
+ rm -rf clone &&
+ test_config_global http.proxy 127.0.0.1:0 &&
+ test_must_fail git clone $HTTPD_URL/smart/repo.git clone
+'
+
+test_expect_success 'UDS socket takes precedence over http proxy' '
+ test_when_finished "rm -rf clone" &&
+ test_config_global http.proxy 127.0.0.1:0 &&
+ test_config_global http.unixsocket "$UDS_SOCKET" &&
+ git clone $HTTPD_URL/smart/repo.git clone
+'
+
+test_done
base-commit: 3e0d3cd5c7def4808247caf168e17f2bbf47892b
--
gitgitgadget
^ permalink raw reply related [flat|nested] 7+ messages in thread* Re: [PATCH] Add unix domain socket support to HTTP transport. 2024-02-21 9:14 [PATCH] Add unix domain socket support to HTTP transport Leslie Cheng via GitGitGadget @ 2024-02-21 22:09 ` Eric Wong 2024-02-22 3:04 ` Leslie Cheng 2024-02-23 1:58 ` [PATCH v2] " Leslie Cheng via GitGitGadget 1 sibling, 1 reply; 7+ messages in thread From: Eric Wong @ 2024-02-21 22:09 UTC (permalink / raw) To: Leslie Cheng via GitGitGadget; +Cc: git, Leslie Cheng, Leslie Cheng Leslie Cheng via GitGitGadget <gitgitgadget@gmail.com> wrote: > Subject: Re: [PATCH] Add unix domain socket support to HTTP transport. No need for trailing `.' in commit message titles <snip> > @@ -455,6 +458,20 @@ static int http_options(const char *var, const char *value, > return 0; > } > > + if (!strcmp("http.unixsocket", var)) { > +#ifdef GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH > +#ifndef NO_UNIX_SOCKETS > + return git_config_string(&curl_unix_socket_path, var, value); > +#else > + warning(_("Unix socket support unavailable in this build of Git")); > + return 0; > +#endif > +#else > + warning(_("Unix socket support is not supported with cURL < 7.40.0")); > + return 0; > +#endif > + } Personally, I'd hoist the #ifdef part into a standalone function since I find mixing CPP and C conditionals confusing. disclaimer: I'm an easily confused person and don't usually program in C, though. <snip> > --- /dev/null > +++ b/t/t5565-http-unix-domain-socket.sh <snip> > + nc -klU "$socket_path" <tcp_to_uds >uds_to_tcp & > + UDS_PID=$! > + > + nc "$host" "$port" >tcp_to_uds <uds_to_tcp & `nc' isn't widely installed, its supported flags vary between implementations, and our test suite doesn't currently use it. I suggest either using a small bit of Perl or writing a t/helper program to do its job. Finally, hard tabs should be used for indentation throughout. I'll wait on others to comment since I haven't looked at git hacking in a while. Anyways, I think this feature could be useful for me, too :> Thanks. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] Add unix domain socket support to HTTP transport. 2024-02-21 22:09 ` Eric Wong @ 2024-02-22 3:04 ` Leslie Cheng 0 siblings, 0 replies; 7+ messages in thread From: Leslie Cheng @ 2024-02-22 3:04 UTC (permalink / raw) To: Eric Wong, Leslie Cheng via GitGitGadget; +Cc: git, Leslie Cheng > No need for trailing `.' in commit message titles Will fix in the next patch, sorry! > Personally, I'd hoist the #ifdef part into a standalone function > since I find mixing CPP and C conditionals confusing. > > disclaimer: I'm an easily confused person and don't usually > program in C, though. I considered extracting it out, but the other conditionals in this function follow a similar pattern so I didn't want to change it. However, my use here is also the first time there's both an #ifdef and nested #ifndef, which I agree makes it a bit confusing to grok. I'm open to changing it, but I'll let it sit and marinate for a bit. > `nc' isn't widely installed, its supported flags vary between > implementations, and our test suite doesn't currently use it. > I suggest either using a small bit of Perl or writing a t/helper > program to do its job. > > Finally, hard tabs should be used for indentation throughout. > > I'll wait on others to comment since I haven't looked at git > hacking in a while. > > Anyways, I think this feature could be useful for me, too :> > Thanks. Good catch, I'll fix in the next patch. I've subbed `nc` out for a simple Perl script to pipe back and forth, just making sure CI is happy about this before submitting. ^ permalink raw reply [flat|nested] 7+ messages in thread
* [PATCH v2] Add unix domain socket support to HTTP transport 2024-02-21 9:14 [PATCH] Add unix domain socket support to HTTP transport Leslie Cheng via GitGitGadget 2024-02-21 22:09 ` Eric Wong @ 2024-02-23 1:58 ` Leslie Cheng via GitGitGadget 2024-02-23 8:37 ` Junio C Hamano 1 sibling, 1 reply; 7+ messages in thread From: Leslie Cheng via GitGitGadget @ 2024-02-23 1:58 UTC (permalink / raw) To: git; +Cc: Eric Wong, Leslie Cheng, Leslie Cheng, Leslie Cheng From: Leslie Cheng <leslie.cheng5@gmail.com> This changeset introduces an `http.unixSocket` option so that users can proxy their git over HTTP remotes to a unix domain socket. In terms of why, since UDS are local and git already has a local protocol: some corporate environments use a UDS to proxy requests to internal resources (ie. source control), so this change would support those use-cases. This proxy can occasionally be necessary to attach MFA tokens or client certificates for CLI tools. The implementation leverages `--unix-socket` option [0] via the `CURLOPT_UNIX_SOCKET_PATH` flag available with libcurl [1]. `GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH` and `NO_UNIX_SOCKETS` were kept separate so that we can spit out better error messages for users if git was compiled with `NO_UNIX_SOCKETS`. [0] https://curl.se/docs/manpage.html#--unix-socket [1] https://curl.se/libcurl/c/CURLOPT_UNIX_SOCKET_PATH.html Signed-off-by: Leslie Cheng <leslie@lc.fyi> --- Add unix domain socket support to HTTP transport Changes since v1: * Updated test to use Perl instead of nc to proxy between UDS and TCP socket; I chose not to split this out into a library since its use is hyper-specific and has a dependency on lib-httpd.sh Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-git-1681%2Flcfyi%2Flcfyi%2Fadd-unix-socket-support-v2 Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-git-1681/lcfyi/lcfyi/add-unix-socket-support-v2 Pull-Request: https://github.com/git/git/pull/1681 Range-diff vs v1: 1: 3e531632329 ! 1: 2af5cc8089b Add unix domain socket support to HTTP transport. @@ Metadata Author: Leslie Cheng <leslie.cheng5@gmail.com> ## Commit message ## - Add unix domain socket support to HTTP transport. + Add unix domain socket support to HTTP transport This changeset introduces an `http.unixSocket` option so that users can proxy their git over HTTP remotes to a unix domain socket. In terms of @@ t/t5565-http-unix-domain-socket.sh (new) + test_done +} + -+UDS_TO_TCP_FIFO=uds_to_tcp -+TCP_TO_UDS_FIFO=tcp_to_uds -+UDS_PID= -+TCP_PID= ++if ! test_have_prereq PERL ++then ++ skip_all='skipping http-unix-socket tests; perl not available' ++ test_done ++fi ++ ++SOCKET_PROXY_PIDFILE="$(pwd)/proxy.pid" +UDS_SOCKET="$(pwd)/uds.sock" -+UNRESOLVABLE_ENDPOINT=http://localhost:4242 ++UNRESOLVABLE_ENDPOINT=http://unresolved + +start_proxy_unix_to_tcp() { -+ local socket_path="$UDS_SOCKET" -+ local host=127.0.0.1 -+ local port=$LIB_HTTPD_PORT -+ -+ rm -f "$UDS_TO_TCP_FIFO" -+ rm -f "$TCP_TO_UDS_FIFO" -+ rm -f "$socket_path" -+ mkfifo "$UDS_TO_TCP_FIFO" -+ mkfifo "$TCP_TO_UDS_FIFO" -+ nc -klU "$socket_path" <tcp_to_uds >uds_to_tcp & -+ UDS_PID=$! -+ -+ nc "$host" "$port" >tcp_to_uds <uds_to_tcp & -+ TCP_PID=$! -+ -+ test_atexit 'stop_proxy_unix_to_tcp' ++ test_atexit 'stop_proxy_unix_to_tcp' ++ ++ perl -Mstrict -MIO::Select -MIO::Socket::INET -MIO::Socket::UNIX -e ' ++ my $uds_path = $ARGV[0]; ++ my $host = $ARGV[1]; ++ my $port = $ARGV[2]; ++ my $pidfile = $ARGV[3]; ++ ++ open(my $fh, ">", $pidfile) or die "failed to create pidfile"; ++ print $fh "$$"; ++ close($fh); ++ ++ my $uds = IO::Socket::UNIX->new( ++ Local => $uds_path, ++ Type => SOCK_STREAM, ++ Listen => 5, ++ ) or die "failed to create unix domain socket"; ++ ++ while (my $conn = $uds->accept()) { ++ my $tcp_client = IO::Socket::INET->new( ++ PeerAddr => $host, ++ PeerPort => $port, ++ Proto => "tcp", ++ ) or die "failed to create TCP socket"; ++ ++ my $sel = IO::Select->new($conn, $tcp_client); ++ ++ while (my @ready = $sel->can_read(10)) { ++ foreach my $socket (@ready) { ++ my $other = ($socket == $conn) ? $tcp_client : $conn; ++ my $data; ++ my $bytes = $socket->sysread($data, 4096); ++ ++ if ($bytes) { ++ $other->syswrite($data, $bytes); ++ } else { ++ $socket->close(); ++ } ++ } ++ } ++ } ++ ' "$UDS_SOCKET" "127.0.0.1" "$LIB_HTTPD_PORT" "$SOCKET_PROXY_PIDFILE" & ++ SOCKET_PROXY_PID=$! +} + +stop_proxy_unix_to_tcp() { -+ kill "$UDS_PID" -+ kill "$TCP_PID" -+ rm -f "$UDS_TO_TCP_FIFO" -+ rm -f "$TCP_TO_UDS_FIFO" ++ kill -9 "$(cat "$SOCKET_PROXY_PIDFILE")" ++ rm -f "$SOCKET_PROXY_PIDFILE" ++ rm -f "$UDS_SOCKET" +} + +start_httpd @@ t/t5565-http-unix-domain-socket.sh (new) + +# sanity check that we can't clone normally +test_expect_success 'cloning without UDS fails' ' -+ test_must_fail git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone ++ test_must_fail git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone +' + +test_expect_success 'cloning with UDS succeeds' ' -+ test_when_finished "rm -rf clone" && ++ test_when_finished "rm -rf clone" && + test_config_global http.unixsocket "$UDS_SOCKET" && + git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone +' + +test_expect_success 'cloning with a non-existent http proxy fails' ' -+ git clone $HTTPD_URL/smart/repo.git clone && -+ rm -rf clone && -+ test_config_global http.proxy 127.0.0.1:0 && -+ test_must_fail git clone $HTTPD_URL/smart/repo.git clone ++ git clone $HTTPD_URL/smart/repo.git clone && ++ rm -rf clone && ++ test_config_global http.proxy 127.0.0.1:0 && ++ test_must_fail git clone $HTTPD_URL/smart/repo.git clone +' + +test_expect_success 'UDS socket takes precedence over http proxy' ' -+ test_when_finished "rm -rf clone" && -+ test_config_global http.proxy 127.0.0.1:0 && -+ test_config_global http.unixsocket "$UDS_SOCKET" && -+ git clone $HTTPD_URL/smart/repo.git clone ++ test_when_finished "rm -rf clone" && ++ test_config_global http.proxy 127.0.0.1:0 && ++ test_config_global http.unixsocket "$UDS_SOCKET" && ++ git clone $HTTPD_URL/smart/repo.git clone +' + +test_done Documentation/config/http.txt | 5 ++ git-curl-compat.h | 7 ++ http.c | 23 ++++++ t/t5565-http-unix-domain-socket.sh | 109 +++++++++++++++++++++++++++++ 4 files changed, 144 insertions(+) create mode 100755 t/t5565-http-unix-domain-socket.sh diff --git a/Documentation/config/http.txt b/Documentation/config/http.txt index 2d4e0c9b869..bf48cbd599a 100644 --- a/Documentation/config/http.txt +++ b/Documentation/config/http.txt @@ -277,6 +277,11 @@ http.followRedirects:: the base for the follow-up requests, this is generally sufficient. The default is `initial`. +http.unixSocket:: + Connect through this Unix domain socket via HTTP, instead of using the + network. If set, this config takes precendence over `http.proxy` and + is incompatible with the proxy options (see `curl(1)`). + http.<url>.*:: Any of the http.* options above can be applied selectively to some URLs. For a config key to match a URL, each element of the config key is diff --git a/git-curl-compat.h b/git-curl-compat.h index fd96b3cdffd..f0f3bec0e17 100644 --- a/git-curl-compat.h +++ b/git-curl-compat.h @@ -74,6 +74,13 @@ #define GIT_CURL_HAVE_CURLE_SSL_PINNEDPUBKEYNOTMATCH 1 #endif +/** + * CURLOPT_UNIX_SOCKET_PATH was added in 7.40.0, released in January 2015. + */ +#if LIBCURL_VERSION_NUM >= 0x074000 +#define GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH 1 +#endif + /** * CURL_HTTP_VERSION_2 was added in 7.43.0, released in June 2015. * diff --git a/http.c b/http.c index e73b136e589..8cfdcaeac82 100644 --- a/http.c +++ b/http.c @@ -79,6 +79,9 @@ static const char *http_proxy_ssl_ca_info; static struct credential proxy_cert_auth = CREDENTIAL_INIT; static int proxy_ssl_cert_password_required; +#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) +static const char *curl_unix_socket_path; +#endif static struct { const char *name; long curlauth_param; @@ -455,6 +458,20 @@ static int http_options(const char *var, const char *value, return 0; } + if (!strcmp("http.unixsocket", var)) { +#ifdef GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH +#ifndef NO_UNIX_SOCKETS + return git_config_string(&curl_unix_socket_path, var, value); +#else + warning(_("Unix socket support unavailable in this build of Git")); + return 0; +#endif +#else + warning(_("Unix socket support is not supported with cURL < 7.40.0")); + return 0; +#endif + } + if (!strcmp("http.cookiefile", var)) return git_config_pathname(&curl_cookie_file, var, value); if (!strcmp("http.savecookies", var)) { @@ -1203,6 +1220,12 @@ static CURL *get_curl_handle(void) } init_curl_proxy_auth(result); +#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) + if (curl_unix_socket_path) { + curl_easy_setopt(result, CURLOPT_UNIX_SOCKET_PATH, curl_unix_socket_path); + } +#endif + set_curl_keepalive(result); return result; diff --git a/t/t5565-http-unix-domain-socket.sh b/t/t5565-http-unix-domain-socket.sh new file mode 100755 index 00000000000..2f9c53ab14f --- /dev/null +++ b/t/t5565-http-unix-domain-socket.sh @@ -0,0 +1,109 @@ +#!/bin/sh + +test_description="test fetching through http via unix domain socket" + +. ./test-lib.sh +. "$TEST_DIRECTORY"/lib-httpd.sh + +test -z "$NO_UNIX_SOCKETS" || { + skip_all='skipping http-unix-socket tests, unix sockets not available' + test_done +} + +if ! test_have_prereq PERL +then + skip_all='skipping http-unix-socket tests; perl not available' + test_done +fi + +SOCKET_PROXY_PIDFILE="$(pwd)/proxy.pid" +UDS_SOCKET="$(pwd)/uds.sock" +UNRESOLVABLE_ENDPOINT=http://unresolved + +start_proxy_unix_to_tcp() { + test_atexit 'stop_proxy_unix_to_tcp' + + perl -Mstrict -MIO::Select -MIO::Socket::INET -MIO::Socket::UNIX -e ' + my $uds_path = $ARGV[0]; + my $host = $ARGV[1]; + my $port = $ARGV[2]; + my $pidfile = $ARGV[3]; + + open(my $fh, ">", $pidfile) or die "failed to create pidfile"; + print $fh "$$"; + close($fh); + + my $uds = IO::Socket::UNIX->new( + Local => $uds_path, + Type => SOCK_STREAM, + Listen => 5, + ) or die "failed to create unix domain socket"; + + while (my $conn = $uds->accept()) { + my $tcp_client = IO::Socket::INET->new( + PeerAddr => $host, + PeerPort => $port, + Proto => "tcp", + ) or die "failed to create TCP socket"; + + my $sel = IO::Select->new($conn, $tcp_client); + + while (my @ready = $sel->can_read(10)) { + foreach my $socket (@ready) { + my $other = ($socket == $conn) ? $tcp_client : $conn; + my $data; + my $bytes = $socket->sysread($data, 4096); + + if ($bytes) { + $other->syswrite($data, $bytes); + } else { + $socket->close(); + } + } + } + } + ' "$UDS_SOCKET" "127.0.0.1" "$LIB_HTTPD_PORT" "$SOCKET_PROXY_PIDFILE" & + SOCKET_PROXY_PID=$! +} + +stop_proxy_unix_to_tcp() { + kill -9 "$(cat "$SOCKET_PROXY_PIDFILE")" + rm -f "$SOCKET_PROXY_PIDFILE" + rm -f "$UDS_SOCKET" +} + +start_httpd +start_proxy_unix_to_tcp + +test_expect_success 'setup repository' ' + test_commit foo && + git init --bare "$HTTPD_DOCUMENT_ROOT_PATH/repo.git" && + git push --mirror "$HTTPD_DOCUMENT_ROOT_PATH/repo.git" +' + +# sanity check that we can't clone normally +test_expect_success 'cloning without UDS fails' ' + test_must_fail git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone +' + +test_expect_success 'cloning with UDS succeeds' ' + test_when_finished "rm -rf clone" && + test_config_global http.unixsocket "$UDS_SOCKET" && + git clone "$UNRESOLVABLE_ENDPOINT/smart/repo.git" clone +' + +test_expect_success 'cloning with a non-existent http proxy fails' ' + git clone $HTTPD_URL/smart/repo.git clone && + rm -rf clone && + test_config_global http.proxy 127.0.0.1:0 && + test_must_fail git clone $HTTPD_URL/smart/repo.git clone +' + +test_expect_success 'UDS socket takes precedence over http proxy' ' + test_when_finished "rm -rf clone" && + test_config_global http.proxy 127.0.0.1:0 && + test_config_global http.unixsocket "$UDS_SOCKET" && + git clone $HTTPD_URL/smart/repo.git clone +' + +test_done base-commit: 3e0d3cd5c7def4808247caf168e17f2bbf47892b -- gitgitgadget ^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH v2] Add unix domain socket support to HTTP transport 2024-02-23 1:58 ` [PATCH v2] " Leslie Cheng via GitGitGadget @ 2024-02-23 8:37 ` Junio C Hamano 2024-02-23 15:43 ` Junio C Hamano 0 siblings, 1 reply; 7+ messages in thread From: Junio C Hamano @ 2024-02-23 8:37 UTC (permalink / raw) To: Leslie Cheng via GitGitGadget; +Cc: git, Eric Wong, Leslie Cheng, Leslie Cheng "Leslie Cheng via GitGitGadget" <gitgitgadget@gmail.com> writes: > Subject: Re: [PATCH v2] Add unix domain socket support to HTTP transport Perhaps Subject: [PATCH] http: enable proxying via unix-domain socket to follow the usual "<area>: <description>" format? > From: Leslie Cheng <leslie.cheng5@gmail.com> > > This changeset introduces an `http.unixSocket` option so that users can "This changeset introduces" -> "Introduce". There may be other gotchas that might use help from Documentation/SubmittingPatches, but I didn't read too carefully. Besides, it is a single patch, not a set of changes ;-). `http.unixSocket` is a configuration variable. It may be confusing to use the word "option". Speaking of options, shouldn't there be a command line option that overrides the configured value? We should honor the usual http.<url>.VARIABLE convention where http.<url>.VARIABLE that is destination-specific overrides a more generic http.VARIABLE configuration variable. > proxy their git over HTTP remotes to a unix domain socket. In terms of > why, since UDS are local and git already has a local protocol: some > corporate environments use a UDS to proxy requests to internal resources > (ie. source control), so this change would support those use-cases. This "ie." -> "i.e.,"? > proxy can occasionally be necessary to attach MFA tokens or client > certificates for CLI tools. > > The implementation leverages `--unix-socket` option [0] via the > `CURLOPT_UNIX_SOCKET_PATH` flag available with libcurl [1]. There is a feature in libcURL library, that is enabled by setting the CURLOPT_UNIX_SOCKET_PATH option via the curl_easy_setopt() call, and their command line utility. You do the same to implement this feature. But when you are not adding "--unix-socket" option to any of our commands, mention of that option name makes it more confusing than necessary. The usual way to compose a log message of this project is to - Give an observation on how the current system work in the present tense (so no need to say "Currently X is Y", just "X is Y"), and discuss what you perceive as a problem in it. - Propose a solution (optional---often, problem description trivially leads to an obvious solution in reader's minds). - Give commands to the codebase to "become like so". in this order. How about following that convention, perhaps like: In some corporate environments, the proxy server listens to a local unix domain socket for requests, instead of listening to a network port. Even though we have http.proxy (and more destination specific http.<url>.proxy) configuration variables to specify the network address/port of a proxy, that would not help if your proxy does not listen to the network. Introduce an `http.unixSocket` (and `http.<url>.unixSocket`) configuration variables that specify the path to a unix domain socket for such a proxy. Recent versions of libcURL library added CURLOPT_UNIX_SOCKET_PATH to support "curl --unix-socket <path>"---use the same mechanism to implement it. > `GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH` and `NO_UNIX_SOCKETS` were kept > separate so that we can spit out better error messages for users if git > was compiled with `NO_UNIX_SOCKETS`. Unlike NO_UNIX_SOCKETS, GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH is entirely internal to your implementation and not surfaced to neither the end-users or the binary packagers. Because of that, I suspect that any description that has to use that name probably falls on the other side of "too much implementation details" to be useful to help future developers.. Besides, I suspect that GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH might not be the optimum approach. See below. > diff --git a/Documentation/config/http.txt b/Documentation/config/http.txt > index 2d4e0c9b869..bf48cbd599a 100644 > --- a/Documentation/config/http.txt > +++ b/Documentation/config/http.txt > @@ -277,6 +277,11 @@ http.followRedirects:: > the base for the follow-up requests, this is generally > sufficient. The default is `initial`. > > +http.unixSocket:: > + Connect through this Unix domain socket via HTTP, instead of using the > + network. If set, this config takes precendence over `http.proxy` and > + is incompatible with the proxy options (see `curl(1)`). Talking about precedence between this and http.proxy is good thing, but one very important piece of information is missing. What value does it take? The absolute path of a unix-domain socket to pass the HTTP traffic over, instead of using the network. or something, perhaps? > diff --git a/git-curl-compat.h b/git-curl-compat.h > index fd96b3cdffd..f0f3bec0e17 100644 > --- a/git-curl-compat.h > +++ b/git-curl-compat.h > @@ -74,6 +74,13 @@ > #define GIT_CURL_HAVE_CURLE_SSL_PINNEDPUBKEYNOTMATCH 1 > #endif > > +/** > + * CURLOPT_UNIX_SOCKET_PATH was added in 7.40.0, released in January 2015. > + */ > +#if LIBCURL_VERSION_NUM >= 0x074000 > +#define GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH 1 > +#endif The "HAVE" part in GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH is a statement of a fact. If the version of cURL library we have is certain value, we have it. OK. > diff --git a/http.c b/http.c > index e73b136e589..8cfdcaeac82 100644 > --- a/http.c > +++ b/http.c > @@ -79,6 +79,9 @@ static const char *http_proxy_ssl_ca_info; > static struct credential proxy_cert_auth = CREDENTIAL_INIT; > static int proxy_ssl_cert_password_required; It might make the code easier to follow if you did: #if !defined(NO_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) #if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) #define USE_CURLOPT_UNIX_SOCKET_PATH #endif #endif The points are (1) the users can decline to use CURLOPT_UNIX_SOCKET_PATH while still using unix domain socket for other purposes, and (2) you do not have to care if you HAVE it or not most of time; what matters more often is if the user told you to USE it. Hmm? > +#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) > +static const char *curl_unix_socket_path; > +#endif The guard here would become "#ifdef USE_CURLOPT_UNIX_SOCKET_PATH" if we wanted this to be conditional, but I think it is easier to make the variable unconditionally available; see below. > @@ -455,6 +458,20 @@ static int http_options(const char *var, const char *value, > return 0; > } > > + if (!strcmp("http.unixsocket", var)) { > +#ifdef GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH > +#ifndef NO_UNIX_SOCKETS > + return git_config_string(&curl_unix_socket_path, var, value); > +#else > + warning(_("Unix socket support unavailable in this build of Git")); > + return 0; > +#endif > +#else > + warning(_("Unix socket support is not supported with cURL < 7.40.0")); > + return 0; > +#endif > + } In general, it is inadvisable to issue a warning in the codepath that parses configuration variables, as the values we read may not be necessarily used. We could instead accept the given path into a variable unconditionally, and complain only before it gets used, near the call to curl_easy_setopt(). > if (!strcmp("http.cookiefile", var)) > return git_config_pathname(&curl_cookie_file, var, value); > if (!strcmp("http.savecookies", var)) { > @@ -1203,6 +1220,12 @@ static CURL *get_curl_handle(void) > } > init_curl_proxy_auth(result); > > +#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) > + if (curl_unix_socket_path) { > + curl_easy_setopt(result, CURLOPT_UNIX_SOCKET_PATH, curl_unix_socket_path); > + } > +#endif Here, the guard may become more like if (curl_unix_socket_path) { #ifdef USE_CURLOPT_UNIX_SOCKET_PATH curl_easy_setopt(...); #elif defined(NO_CURLOPT_UNIX_SOCKET_PATH) || defined(NO_UNIX_SOCKETS) warning(_("this build disables the unix-domain-socket feature")); #elif warning(_("your cURL library is too old")); #endif } ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] Add unix domain socket support to HTTP transport 2024-02-23 8:37 ` Junio C Hamano @ 2024-02-23 15:43 ` Junio C Hamano 2024-02-23 22:24 ` Leslie Cheng 0 siblings, 1 reply; 7+ messages in thread From: Junio C Hamano @ 2024-02-23 15:43 UTC (permalink / raw) To: Leslie Cheng via GitGitGadget; +Cc: git, Eric Wong, Leslie Cheng, Leslie Cheng Junio C Hamano <gitster@pobox.com> writes: > "Leslie Cheng via GitGitGadget" <gitgitgadget@gmail.com> writes: > >> Subject: Re: [PATCH v2] Add unix domain socket support to HTTP transport > > Perhaps > > Subject: [PATCH] http: enable proxying via unix-domain socket > > to follow the usual "<area>: <description>" format? > >> From: Leslie Cheng <leslie.cheng5@gmail.com> >> >> This changeset introduces an `http.unixSocket` option so that users can > > "This changeset introduces" -> "Introduce". There may be other > gotchas that might use help from Documentation/SubmittingPatches, > but I didn't read too carefully. > > Besides, it is a single patch, not a set of changes ;-). > > `http.unixSocket` is a configuration variable. It may be confusing > to use the word "option". Speaking of options, shouldn't there be a > command line option that overrides the configured value? > > We should honor the usual http.<url>.VARIABLE convention where > http.<url>.VARIABLE that is destination-specific overrides a more > generic http.VARIABLE configuration variable. Clarification. I know the above is automatically achieved, given the way we have laid urlmatch foundation to allow easy parsing for configuration variables structured this way. I did not mean that you'd need to do anything special; rather, I meant that we should advertise that we do in the commit log message. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] Add unix domain socket support to HTTP transport 2024-02-23 15:43 ` Junio C Hamano @ 2024-02-23 22:24 ` Leslie Cheng 0 siblings, 0 replies; 7+ messages in thread From: Leslie Cheng @ 2024-02-23 22:24 UTC (permalink / raw) To: Junio C Hamano, Leslie Cheng via GitGitGadget Cc: git, Eric Wong, Leslie Cheng On 2024-02-23 12:37 a.m., Junio C Hamano wrote: > How about following that convention, perhaps like: > > In some corporate environments, the proxy server listens to a > local unix domain socket for requests, instead of listening to a > network port. Even though we have http.proxy (and more > destination specific http.<url>.proxy) configuration variables > to specify the network address/port of a proxy, that would not > help if your proxy does not listen to the network. > > Introduce an `http.unixSocket` (and `http.<url>.unixSocket`) > configuration variables that specify the path to a unix domain > socket for such a proxy. Recent versions of libcURL library > added CURLOPT_UNIX_SOCKET_PATH to support "curl --unix-socket > <path>"---use the same mechanism to implement it. This is excellent, thanks for the guidance (and all the other suggestions prior)! I'll update in the next patch. > Unlike NO_UNIX_SOCKETS, GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH is > entirely internal to your implementation and not surfaced to neither > the end-users or the binary packagers. Because of that, I suspect > that any description that has to use that name probably falls on the > other side of "too much implementation details" to be useful to help > future developers.. That's reasonable, I figured it would fit as a cover letter detail but I agree it's not relevant as a commit message that lives in the history of the project. I'll also update this in the next patch. > Talking about precedence between this and http.proxy is good thing, > but one very important piece of information is missing. What value > does it take? > > The absolute path of a unix-domain socket to pass the HTTP > traffic over, instead of using the network. > > or something, perhaps? I like that wording, I'll update in the next patch. > It might make the code easier to follow if you did: > > #if !defined(NO_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) > #if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) > #define USE_CURLOPT_UNIX_SOCKET_PATH > #endif > #endif > > The points are > > (1) the users can decline to use CURLOPT_UNIX_SOCKET_PATH while > still using unix domain socket for other purposes, and > > (2) you do not have to care if you HAVE it or not most of time; > what matters more often is if the user told you to USE it. > > Hmm? Do you think this functionality is worth adding another macro to conditionally include it in the build? It felt out-of-the-way enough that we could just use the same `NO_UNIX_SOCKETS` macro to control for environments that don't support unix domain sockets. >> +#if defined(GIT_CURL_HAVE_CURLOPT_UNIX_SOCKET_PATH) && !defined(NO_UNIX_SOCKETS) >> +static const char *curl_unix_socket_path; >> +#endif > > The guard here would become "#ifdef USE_CURLOPT_UNIX_SOCKET_PATH" if > we wanted this to be conditional, but I think it is easier to make > the variable unconditionally available; see below. Agreed in general, I was looking to other patterns for conditional variables in file, e.g. https://github.com/git/git/blob/3c2a3fdc388747b9eaf4a4a4f2035c1c9ddb26d0/http.c#L66-L68 But, revisiting, this looks like an exception rather than the norm. > In general, it is inadvisable to issue a warning in the codepath > that parses configuration variables, as the values we read may not > be necessarily used. We could instead accept the given path into a > variable unconditionally, and complain only before it gets used, > near the call to curl_easy_setopt(). Similar to above, I followed what was already done for certain configuration variables (e.g. https://github.com/git/git/blob/3c2a3fdc388747b9eaf4a4a4f2035c1c9ddb26d0/http.c#L485-L501), but I agree with your feedback since this would result in constant warnings. To summarize, I'll do the following in the next patch: - change the wording of the commit message - move the conditional variables and parsing to a check at `curl_easy_setopt()` time I'm still undecided on whether I should introduce another macro specifically for this functionality, and I'd like to hear your thoughts on why it might make sense. ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2024-02-23 22:24 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2024-02-21 9:14 [PATCH] Add unix domain socket support to HTTP transport Leslie Cheng via GitGitGadget 2024-02-21 22:09 ` Eric Wong 2024-02-22 3:04 ` Leslie Cheng 2024-02-23 1:58 ` [PATCH v2] " Leslie Cheng via GitGitGadget 2024-02-23 8:37 ` Junio C Hamano 2024-02-23 15:43 ` Junio C Hamano 2024-02-23 22:24 ` Leslie Cheng
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).