All of lore.kernel.org
 help / color / mirror / Atom feed
From: Luis Chamberlain <mcgrof@kernel.org>
To: Chuck Lever <cel@kernel.org>, Daniel Gomez <da.gomez@kruces.com>,
	kdevops@lists.linux.dev
Cc: Luis Chamberlain <mcgrof@kernel.org>
Subject: [PATCH v4 7/8] aws: add GPU instance defconfigs for AI/ML workloads
Date: Tue, 16 Sep 2025 17:34:48 -0700	[thread overview]
Message-ID: <20250917003451.2318229-8-mcgrof@kernel.org> (raw)
In-Reply-To: <20250917003451.2318229-1-mcgrof@kernel.org>

Add two AWS GPU instance defconfigs to support GPU-accelerated
workloads:

1. aws-gpu-p5-48xlarge: High-end configuration with 8x NVIDIA H100 80GB
   GPUs for large-scale AI/ML training and inference. Includes 500GB
   storage for models and datasets.

2. aws-gpu-g5-xlarge: Cost-effective single NVIDIA A10G 24GB GPU for
   development and smaller workloads. Includes 200GB storage.

Both configurations:
- Use Debian 12 for stability
- Enable kernel development workflows (KOTD)
- Configure 9P filesystem for host-guest development
- Set up in us-east-1 for best GPU availability
- Use GP3 EBS volumes for better performance

These defconfigs leverage the new dynamic Kconfig generation to
automatically include all available P5 and G5 instance sizes.

Generated-by: Claude AI
Signed-off-by: Luis Chamberlain <mcgrof@kernel.org>
---
 defconfigs/aws-gpu-g5-xlarge   | 36 ++++++++++++++++++++++++++++++++++
 defconfigs/aws-gpu-p5-48xlarge | 36 ++++++++++++++++++++++++++++++++++
 2 files changed, 72 insertions(+)
 create mode 100644 defconfigs/aws-gpu-g5-xlarge
 create mode 100644 defconfigs/aws-gpu-p5-48xlarge

diff --git a/defconfigs/aws-gpu-g5-xlarge b/defconfigs/aws-gpu-g5-xlarge
new file mode 100644
index 00000000..a2f0a21f
--- /dev/null
+++ b/defconfigs/aws-gpu-g5-xlarge
@@ -0,0 +1,36 @@
+# AWS G5.xlarge GPU instance configuration
+# 1x NVIDIA A10G 24GB GPU for cost-effective GPU development
+
+CONFIG_KDEVOPS_FIRST_RUN=y
+CONFIG_TERRAFORM=y
+CONFIG_TERRAFORM_AWS=y
+CONFIG_TERRAFORM_AWS_DATA_ENABLE=y
+
+# Instance configuration
+CONFIG_TERRAFORM_AWS_INSTANCE_FAMILY_G5=y
+CONFIG_TERRAFORM_AWS_INSTANCE_G5_XLARGE=y
+
+# Moderate root volume
+CONFIG_TERRAFORM_AWS_EBS_SIZE=200
+CONFIG_TERRAFORM_AWS_EBS_VOLUME_TYPE_GP3=y
+
+# US East 1 for availability
+CONFIG_TERRAFORM_AWS_REGION_US_EAST_1=y
+
+# Debian 12 for stability
+CONFIG_VAGRANT_DEBIAN=y
+CONFIG_VAGRANT_DEBIAN12=y
+
+# Use kdevops user
+CONFIG_KDEVOPS_SSH_CONFIG_USER_KDEVOPS=y
+
+# Workflows
+CONFIG_KDEVOPS_WORKFLOW_ENABLE_KOTD=y
+CONFIG_KDEVOPS_WORKFLOW_KOTD_ENABLE_LATEST=y
+
+# Enable git for kernel development
+CONFIG_BOOTLINUX_9P=y
+CONFIG_BOOTLINUX_TREE_LINUS=y
+
+# Networking
+CONFIG_KDEVOPS_ENABLE_TERRAFORM_SSH_COMPLETE_SETUP=y
\ No newline at end of file
diff --git a/defconfigs/aws-gpu-p5-48xlarge b/defconfigs/aws-gpu-p5-48xlarge
new file mode 100644
index 00000000..741b98e3
--- /dev/null
+++ b/defconfigs/aws-gpu-p5-48xlarge
@@ -0,0 +1,36 @@
+# AWS P5.48xlarge GPU instance configuration
+# 8x NVIDIA H100 80GB GPUs for AI/ML workloads
+
+CONFIG_KDEVOPS_FIRST_RUN=y
+CONFIG_TERRAFORM=y
+CONFIG_TERRAFORM_AWS=y
+CONFIG_TERRAFORM_AWS_DATA_ENABLE=y
+
+# Instance configuration
+CONFIG_TERRAFORM_AWS_INSTANCE_FAMILY_P5=y
+CONFIG_TERRAFORM_AWS_INSTANCE_P5_48XLARGE=y
+
+# Large root volume for models and datasets
+CONFIG_TERRAFORM_AWS_EBS_SIZE=500
+CONFIG_TERRAFORM_AWS_EBS_VOLUME_TYPE_GP3=y
+
+# US East 1 typically has best GPU availability
+CONFIG_TERRAFORM_AWS_REGION_US_EAST_1=y
+
+# Debian 12 for stability
+CONFIG_VAGRANT_DEBIAN=y
+CONFIG_VAGRANT_DEBIAN12=y
+
+# Use kdevops user
+CONFIG_KDEVOPS_SSH_CONFIG_USER_KDEVOPS=y
+
+# Workflows
+CONFIG_KDEVOPS_WORKFLOW_ENABLE_KOTD=y
+CONFIG_KDEVOPS_WORKFLOW_KOTD_ENABLE_LATEST=y
+
+# Enable git for kernel development
+CONFIG_BOOTLINUX_9P=y
+CONFIG_BOOTLINUX_TREE_LINUS=y
+
+# Networking
+CONFIG_KDEVOPS_ENABLE_TERRAFORM_SSH_COMPLETE_SETUP=y
\ No newline at end of file
-- 
2.51.0


  parent reply	other threads:[~2025-09-17  0:34 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-09-17  0:34 [PATCH v4 0/8] aws: add dynamic kconfig support Luis Chamberlain
2025-09-17  0:34 ` [PATCH v4 1/8] aws: prevent SSH key conflicts across multiple kdevops directories Luis Chamberlain
2025-09-17  3:36   ` Chuck Lever
2025-09-17  0:34 ` [PATCH v4 2/8] terraform/aws: Add scripts to gather provider resource information Luis Chamberlain
2025-09-17  0:34 ` [PATCH v4 3/8] aws: add optimized Kconfig generator using Chuck's scripts Luis Chamberlain
2025-09-17  3:58   ` Chuck Lever
2025-09-17  0:34 ` [PATCH v4 4/8] aws: integrate dynamic Kconfig generation with make targets Luis Chamberlain
2025-09-17  3:40   ` Chuck Lever
2025-09-17  7:05     ` Luis Chamberlain
2025-09-17  0:34 ` [PATCH v4 5/8] aws: add cloud billing support with make cloud-bill Luis Chamberlain
2025-09-17  0:34 ` [PATCH v4 6/8] aws: replace static Kconfig files with dynamically generated ones Luis Chamberlain
2025-09-17  0:34 ` Luis Chamberlain [this message]
2025-09-17  0:34 ` [PATCH v4 8/8] docs: add documentation for dynamic cloud configuration Luis Chamberlain

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20250917003451.2318229-8-mcgrof@kernel.org \
    --to=mcgrof@kernel.org \
    --cc=cel@kernel.org \
    --cc=da.gomez@kruces.com \
    --cc=kdevops@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.