All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Daniel P. Berrangé" <berrange@redhat.com>
To: "Alex Bennée" <alex.bennee@linaro.org>
Cc: qemu-devel@nongnu.org
Subject: Re: [RFC PATCH] readthedocs: don't build extra formats
Date: Tue, 5 Aug 2025 10:54:27 +0100	[thread overview]
Message-ID: <aJHU0w0gWe8i2uZ3@redhat.com> (raw)
In-Reply-To: <87cy9a5ec5.fsf@draig.linaro.org>

On Tue, Aug 05, 2025 at 10:04:42AM +0100, Alex Bennée wrote:
> Daniel P. Berrangé <berrange@redhat.com> writes:
> 
> > On Mon, Aug 04, 2025 at 05:29:59PM +0100, Alex Bennée wrote:
> >> We don't build the PDFs ourselves for the hosted docs and it looks
> >> like rtd can't manage building PDFs now they have gone over a certain
> >> size. Disable the extra formats so we can at least have the online
> >> stuff again.
> >
> > Regardless of build problems, IMHO, we should not have been building
> > the PDFs as no effort is being made to validate that the content is
> > formatting well under the layout constraints of PDFs
> 
> True.
> 
> I will say the one thing I have found PDFs good for is uploading the
> docs into a LLM context like NotebookLM. Otherwise you end up having to
> add individual links which a) is a pain and b) is a potential DDoS
> source if the model keeps hitting the host which as I'm sure everyone is
> aware is a problem for FLOSS archives at the moment.

Is there a "single page HTML" option that would service that need ?

In general PDFs are a pretty awful format for programatically
consuming text, because they have no logical content structure
like HTML docs, so I'd expect HTML is a better format to feed
into any tool either LLM or not.

With regards,
Daniel
-- 
|: https://berrange.com      -o-    https://www.flickr.com/photos/dberrange :|
|: https://libvirt.org         -o-            https://fstop138.berrange.com :|
|: https://entangle-photo.org    -o-    https://www.instagram.com/dberrange :|



  reply	other threads:[~2025-08-05  9:55 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-08-04 16:29 [RFC PATCH] readthedocs: don't build extra formats Alex Bennée
2025-08-05  7:11 ` Daniel P. Berrangé
2025-08-05  9:04   ` Alex Bennée
2025-08-05  9:54     ` Daniel P. Berrangé [this message]
2025-08-05 10:54       ` Alex Bennée
2025-08-12 23:38 ` Stefan Hajnoczi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aJHU0w0gWe8i2uZ3@redhat.com \
    --to=berrange@redhat.com \
    --cc=alex.bennee@linaro.org \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.