qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* [Qemu-devel] Moving release tarballs to a CDN
@ 2017-11-08 15:33 Stefan Hajnoczi
  2017-11-08 16:19 ` Stefan Weil
  0 siblings, 1 reply; 4+ messages in thread
From: Stefan Hajnoczi @ 2017-11-08 15:33 UTC (permalink / raw)
  To: Michael Roth, Jeff Cody; +Cc: qemu-devel, Paolo Bonzini, Daniel P. Berrange

Hi Mike and Jeff,
qemu.org's bandwidth usage is dominated by release tarball downloads.
This puts qemu.org bandwidth usage in the 2+ TB/month range.

Many hosting providers do not offer this much bandwidth as part of a
basic package that would otherwise fit qemu.org's requirements.

Several people have suggested putting release tarballs on a mirror or
CDN to offload the bandwidth usage.  Jeff Cody and I looked at using
GitHub's "releases" feature to host release tarballs.

The following changes would be necessary:

1. Add qemu.org URL redirection to GitHub:
https://download.qemu.org/qemu-(\d+.\d+.\d+).tar.xz ->
https://github.com/qemu/qemu/releases/download/v\1/qemu-\1.tar.xz

2. Modify release script to immediately push to GitHub and then upload
tar.xz via API:
https://developer.github.com/v3/repos/releases/#upload-a-release-asset

3. Continue using https://download.qemu.org/qemu-\d+.\d+.\d+.tar.xz
URL for download links.

This way future QEMU releases would be hosted on GitHub with nice
qemu.org download links.  The signature file stays hosted on qemu.org
so it's possible to validate that the tarball has not been tampered
with.

How does this sound?

Stefan

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Qemu-devel] Moving release tarballs to a CDN
  2017-11-08 15:33 [Qemu-devel] Moving release tarballs to a CDN Stefan Hajnoczi
@ 2017-11-08 16:19 ` Stefan Weil
  2017-11-08 17:11   ` Stefan Hajnoczi
  0 siblings, 1 reply; 4+ messages in thread
From: Stefan Weil @ 2017-11-08 16:19 UTC (permalink / raw)
  To: Stefan Hajnoczi; +Cc: Michael Roth, Jeff Cody, Paolo Bonzini, qemu-devel

Am 08.11.2017 um 16:33 schrieb Stefan Hajnoczi:
> Hi Mike and Jeff,
> qemu.org's bandwidth usage is dominated by release tarball downloads.
> This puts qemu.org bandwidth usage in the 2+ TB/month range.

Hi Stefan,

how much of this traffic is caused by web spiders?

>From my own binaries I know that the bots of the
different search engines cause most of the traffic,
if they are allowed to do so.

Usually they respect robots.txt. There is no
https://www.qemu.org/robots.txt currently.
Nor is there a https://download.qemu.org/robots.txt.
Adding both would reduce the downloads, maybe
enough to fix the problem.

Or do you see an advantage from bots which download
QEMU tarballs? robots.txt can also block only
selected bots.

Regards
Stefan

PS. There is a https://git.qemu.org/robots.txt.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Qemu-devel] Moving release tarballs to a CDN
  2017-11-08 16:19 ` Stefan Weil
@ 2017-11-08 17:11   ` Stefan Hajnoczi
  2017-11-08 22:08     ` Jeff Cody
  0 siblings, 1 reply; 4+ messages in thread
From: Stefan Hajnoczi @ 2017-11-08 17:11 UTC (permalink / raw)
  To: Stefan Weil; +Cc: Michael Roth, Jeff Cody, Paolo Bonzini, qemu-devel

[-- Attachment #1: Type: text/plain, Size: 1088 bytes --]

On Wed, Nov 08, 2017 at 05:19:25PM +0100, Stefan Weil wrote:
> Am 08.11.2017 um 16:33 schrieb Stefan Hajnoczi:
> > Hi Mike and Jeff,
> > qemu.org's bandwidth usage is dominated by release tarball downloads.
> > This puts qemu.org bandwidth usage in the 2+ TB/month range.
> 
> Hi Stefan,
> 
> how much of this traffic is caused by web spiders?
> 
> From my own binaries I know that the bots of the
> different search engines cause most of the traffic,
> if they are allowed to do so.
> 
> Usually they respect robots.txt. There is no
> https://www.qemu.org/robots.txt currently.
> Nor is there a https://download.qemu.org/robots.txt.
> Adding both would reduce the downloads, maybe
> enough to fix the problem.
> 
> Or do you see an advantage from bots which download
> QEMU tarballs? robots.txt can also block only
> selected bots.
> 
> Regards
> Stefan
> 
> PS. There is a https://git.qemu.org/robots.txt.

Great idea!  It's an easy to try adding a robots.txt and check how
bandwidth uses changes over the next month.

Jeff: Want to try this?

Stefan

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 455 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Qemu-devel] Moving release tarballs to a CDN
  2017-11-08 17:11   ` Stefan Hajnoczi
@ 2017-11-08 22:08     ` Jeff Cody
  0 siblings, 0 replies; 4+ messages in thread
From: Jeff Cody @ 2017-11-08 22:08 UTC (permalink / raw)
  To: Stefan Hajnoczi; +Cc: Stefan Weil, Michael Roth, Paolo Bonzini, qemu-devel

On Wed, Nov 08, 2017 at 05:11:20PM +0000, Stefan Hajnoczi wrote:
> On Wed, Nov 08, 2017 at 05:19:25PM +0100, Stefan Weil wrote:
> > Am 08.11.2017 um 16:33 schrieb Stefan Hajnoczi:
> > > Hi Mike and Jeff,
> > > qemu.org's bandwidth usage is dominated by release tarball downloads.
> > > This puts qemu.org bandwidth usage in the 2+ TB/month range.
> > 
> > Hi Stefan,
> > 
> > how much of this traffic is caused by web spiders?
> > 
> > From my own binaries I know that the bots of the
> > different search engines cause most of the traffic,
> > if they are allowed to do so.
> > 
> > Usually they respect robots.txt. There is no
> > https://www.qemu.org/robots.txt currently.
> > Nor is there a https://download.qemu.org/robots.txt.
> > Adding both would reduce the downloads, maybe
> > enough to fix the problem.
> > 
> > Or do you see an advantage from bots which download
> > QEMU tarballs? robots.txt can also block only
> > selected bots.
> > 
> > Regards
> > Stefan
> > 
> > PS. There is a https://git.qemu.org/robots.txt.
> 
> Great idea!  It's an easy to try adding a robots.txt and check how
> bandwidth uses changes over the next month.
> 
> Jeff: Want to try this?
> 
> Stefan

Yes, sure - I added a robots.txt to exclude .bz2 and .xz files, and we can
see how that affects bandwidth.  Right now, with our current hosting provider,
we are not near any bandwidth limit, but it makes sense to conserve
resources (unless there is any benefit we see to allowing bots to index
download.qemu.org binaries).


Thanks,

Jeff

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2017-11-08 22:08 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-11-08 15:33 [Qemu-devel] Moving release tarballs to a CDN Stefan Hajnoczi
2017-11-08 16:19 ` Stefan Weil
2017-11-08 17:11   ` Stefan Hajnoczi
2017-11-08 22:08     ` Jeff Cody

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).