From: kernel test robot <lkp@intel.com>
To: Mallesh Koujalagi <mallesh.koujalagi@intel.com>,
intel-xe@lists.freedesktop.org, dri-devel@lists.freedesktop.org,
rodrigo.vivi@intel.com
Cc: llvm@lists.linux.dev, oe-kbuild-all@lists.linux.dev,
andrealmeid@igalia.com, christian.koenig@amd.com,
airlied@gmail.com, simona.vetter@ffwll.ch, mripard@kernel.org,
anshuman.gupta@intel.com, badal.nilawar@intel.com,
riana.tauro@intel.com, karthik.poosa@intel.com,
sk.anirban@intel.com, raag.jadav@intel.com,
Mallesh Koujalagi <mallesh.koujalagi@intel.com>
Subject: Re: [PATCH v2 1/5] Introduce Xe Uncorrectable Error Handling
Date: Fri, 20 Mar 2026 04:02:46 +0800 [thread overview]
Message-ID: <202603200358.BacRkqob-lkp@intel.com> (raw)
In-Reply-To: <20260318064016.374656-8-mallesh.koujalagi@intel.com>
Hi Mallesh,
kernel test robot noticed the following build errors:
[auto build test ERROR on drm-xe/drm-xe-next]
[also build test ERROR on drm-misc/drm-misc-next drm/drm-next next-20260319]
[cannot apply to linus/master v7.0-rc4]
[If your patch is applied to the wrong git tree, kindly drop us a note.
And when submitting patch, we suggest to use '--base' as documented in
https://git-scm.com/docs/git-format-patch#_base_tree_information]
url: https://github.com/intel-lab-lkp/linux/commits/Mallesh-Koujalagi/Introduce-Xe-Uncorrectable-Error-Handling/20260318-153303
base: https://gitlab.freedesktop.org/drm/xe/kernel.git drm-xe-next
patch link: https://lore.kernel.org/r/20260318064016.374656-8-mallesh.koujalagi%40intel.com
patch subject: [PATCH v2 1/5] Introduce Xe Uncorrectable Error Handling
config: i386-randconfig-017-20260319 (https://download.01.org/0day-ci/archive/20260320/202603200358.BacRkqob-lkp@intel.com/config)
compiler: clang version 20.1.8 (https://github.com/llvm/llvm-project 87f0227cb60147a26a1eeb4fb06e3b505e9c7261)
reproduce (this is a W=1 build): (https://download.01.org/0day-ci/archive/20260320/202603200358.BacRkqob-lkp@intel.com/reproduce)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202603200358.BacRkqob-lkp@intel.com/
All errors (new ones prefixed by >>):
>> drivers/gpu/drm/xe/xe_ras.c:241:4: error: variable 'action' is used uninitialized whenever switch default is taken [-Werror,-Wsometimes-uninitialized]
241 | default:
| ^~~~~~~
drivers/gpu/drm/xe/xe_ras.c:250:8: note: uninitialized use occurs here
250 | if (action > final_action)
| ^~~~~~
drivers/gpu/drm/xe/xe_ras.c:227:4: note: variable 'action' is declared here
227 | enum xe_ras_recovery_action action;
| ^
1 error generated.
vim +/action +241 drivers/gpu/drm/xe/xe_ras.c
177
178 /**
179 * xe_ras_process_errors - Process and contain hardware errors
180 * @xe: xe device instance
181 *
182 * Get error details from system controller and return recovery
183 * method. Called only from PCI error handling.
184 *
185 * Returns: recovery action to be taken
186 */
187 enum xe_ras_recovery_action xe_ras_process_errors(struct xe_device *xe)
188 {
189 struct xe_sysctrl_mailbox_command command = {0};
190 struct xe_ras_get_error_response response;
191 enum xe_ras_recovery_action final_action;
192 size_t rlen;
193 int ret;
194
195 /* Default action */
196 final_action = XE_RAS_RECOVERY_ACTION_RECOVERED;
197
198 if (!xe->info.has_sysctrl)
199 return XE_RAS_RECOVERY_ACTION_RESET;
200
201 xe_ras_prepare_sysctrl_command(&command, XE_SYSCTRL_CMD_GET_SOC_ERROR, NULL, 0,
202 &response, sizeof(response));
203
204 do {
205 memset(&response, 0, sizeof(response));
206 rlen = 0;
207
208 ret = xe_sysctrl_send_command(xe, &command, &rlen);
209 if (ret || !rlen) {
210 xe_err(xe, "[RAS]: Sysctrl error ret %d\n", ret);
211 goto err;
212 }
213
214 if (rlen != sizeof(response)) {
215 xe_err(xe, "[RAS]: Sysctrl response does not match len!!\n");
216 goto err;
217 }
218
219 if (response.num_errors > XE_RAS_NUM_ERROR_ARR) {
220 xe_err(xe, "[RAS]: Number of errors out of bound (%d)\n",
221 XE_RAS_NUM_ERROR_ARR);
222 goto err;
223 }
224
225 for (int i = 0; i < response.num_errors; i++) {
226 struct xe_ras_error_array arr = response.error_arr[i];
227 enum xe_ras_recovery_action action;
228 struct xe_ras_error_class error_class;
229 u8 component;
230
231 error_class = arr.error_class;
232 component = error_class.common.component;
233
234 switch (component) {
235 case XE_RAS_COMPONENT_CORE_COMPUTE:
236 action = handle_compute_errors(xe, &arr);
237 break;
238 case XE_RAS_COMPONENT_SOC_INTERNAL:
239 action = handle_soc_internal_errors(xe, &arr);
240 break;
> 241 default:
242 xe_err(xe, "[RAS]: Unknown error component %u\n", component);
243 break;
244 }
245
246 /*
247 * Retain the highest severity action. Process and log all errors
248 * and then take appropriate recovery action
249 */
250 if (action > final_action)
251 final_action = action;
252 }
253
254 } while (response.additional_errors);
255
256 return final_action;
257
258 err:
259 return XE_RAS_RECOVERY_ACTION_RESET;
260 }
261
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
next prev parent reply other threads:[~2026-03-19 20:03 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-18 6:40 [PATCH v2 0/5] Introduce cold reset recovery method Mallesh Koujalagi
2026-03-18 6:40 ` [PATCH v2 1/5] Introduce Xe Uncorrectable Error Handling Mallesh Koujalagi
2026-03-18 19:35 ` kernel test robot
2026-03-19 14:42 ` kernel test robot
2026-03-19 20:02 ` kernel test robot [this message]
2026-03-18 6:40 ` [PATCH v2 2/5] drm: Add DRM_WEDGE_RECOVERY_COLD_RESET for power management unit error Mallesh Koujalagi
2026-03-30 5:26 ` Tauro, Riana
2026-03-18 6:40 ` [PATCH v2 3/5] drm/doc: Document DRM_WEDGE_RECOVERY_COLD_RESET recovery method Mallesh Koujalagi
2026-03-30 5:00 ` Tauro, Riana
2026-03-30 14:02 ` Mallesh, Koujalagi
2026-04-02 8:16 ` Raag Jadav
2026-04-06 12:26 ` Mallesh, Koujalagi
2026-03-18 6:40 ` [PATCH v2 4/5] drm/xe: Add handler for power management unit errors which require cold-reset Mallesh Koujalagi
2026-03-30 4:54 ` Tauro, Riana
2026-03-30 13:50 ` Mallesh, Koujalagi
2026-04-02 8:19 ` Raag Jadav
2026-03-18 6:40 ` [PATCH v2 5/5] drm/xe/debugfs: Add interface to trigger power management unit error handler Mallesh Koujalagi
2026-03-30 4:55 ` Tauro, Riana
2026-03-30 13:40 ` Mallesh, Koujalagi
2026-04-02 8:31 ` Raag Jadav
2026-04-06 12:49 ` Mallesh, Koujalagi
2026-03-18 6:49 ` ✗ CI.checkpatch: warning for Introduce cold reset recovery method Patchwork
2026-03-18 6:50 ` ✓ CI.KUnit: success " Patchwork
2026-03-18 7:33 ` ✓ Xe.CI.BAT: " Patchwork
2026-03-19 20:20 ` ✓ Xe.CI.FULL: " Patchwork
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202603200358.BacRkqob-lkp@intel.com \
--to=lkp@intel.com \
--cc=airlied@gmail.com \
--cc=andrealmeid@igalia.com \
--cc=anshuman.gupta@intel.com \
--cc=badal.nilawar@intel.com \
--cc=christian.koenig@amd.com \
--cc=dri-devel@lists.freedesktop.org \
--cc=intel-xe@lists.freedesktop.org \
--cc=karthik.poosa@intel.com \
--cc=llvm@lists.linux.dev \
--cc=mallesh.koujalagi@intel.com \
--cc=mripard@kernel.org \
--cc=oe-kbuild-all@lists.linux.dev \
--cc=raag.jadav@intel.com \
--cc=riana.tauro@intel.com \
--cc=rodrigo.vivi@intel.com \
--cc=simona.vetter@ffwll.ch \
--cc=sk.anirban@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox